May 6, 2021
Good Risk vs Bad Risk: Deconstructing the Features of 1000 German Loans
Attempting to discover the most influential features of a loan application when considering risk, our team built a model using the features of a loan application to predict whether an applicant would have a good or bad risk rating.
March 24, 2021
The Moneyball Method: Using Data to Build a Football Dream Team (On a Budget)
Our team set out to build an exceptional football team for less than 100M Euros. Using data provided in the FIFA 2020/2021 dataset - the video game - we built a prediction model in order to find the key performance attributes for each position. Then, we used this to pick out a team of excellent but undervalued players.
February 9, 2021
Patriotism, Animals, Comedy and Sex: Clustering 233 Superbowl Ads
We built a model clustering 233 Superbowl ads using data from FiveThirtyEight in order to work out what content brands use to sell their products during America's most-watched sporting event.
January 12, 2021
Finding the 'Perfect' Sales Candidate Using Clustering and Prediction: Graphext and "The Sales Acceleration Formula"
Exploring how Graphext's data-driven approach might be used to identify the characteristics of successful salespeople.
December 29, 2020
The Evolution of American Protests After the Death of George Floyd: COVID-19, BLM and the Election
2020 has been a turbulent year for every country but particularly in the USA. We clustered American protest events between May 24 - Nov 28, 2020 to investigate the relationships between types of protests, their violence and their geography.
December 21, 2020
The Top Stories in 2020 According to Every Tweet from 38 UK News Publishers
We collected every tweet in 2020 from 38 UK news organisations to find out what the media have been reporting on. Then we visualised categories of tweets as trends to see what the British media landscape looked like throughout the year.
August 7, 2020
Cómo el Congreso de los Diputados tuiteó durante el Estado de Alarma
Prácticamente todos los diputados del Congreso actual, la XIV legislatura, tienen cuenta en Twitter y gran parte de su trabajo consiste en leer y escribir tweets. Para bien o para mal, los políticos nunca han tenido más poder y facilidad para mandar directamente sus mensajes a los ciudadnos sin el filtro de la prensa. Analizamos las principales narrativas que cada partido movió.
August 5, 2020
How the US Congress Tweeted in 2020
We analyzed almost 200K tweets from members of Congress, comparing across parties and seeing what worked for each one. We found interesting insights by analyzing the way each party worded their opinions.
July 29, 2020
Finding Real Estate Opportunities in Madrid
We analyzed more than 20k advertisements in real estate websites to try to find underpriced houses with Graphext's predictive algorithms. Along the way we looked into the relationships between prices and factors such as education level or location index to try to find insights and patterns in the data.
July 9, 2020
The Lipstick Effect: Did the 2008 Financial Crisis Drive an Increase in Positive Airline Reviews?
We analyzed 30K airlines services reviews and saw that there are clear jumps in ratings marked by the 2008 financial crisis and subsequent economic recovery. Could these factors have impacted consumers, or have airlines improved their services?
June 26, 2020
Is Mark Cuban a Socialist, a Communist, a Globalist... or Something Else? How Trump Supporters Attack Another Billionaire
Mark Cuban is one of the wealthiest people in America, with an estimated net worth of $4.1 billion. He asked for a tool to work out why his Twitter supporters were calling him a socialist, a communist and a globalist - and to analyze whether their accusations were true!
May 24, 2020
Las 2 Españas a Palos de Golf: quiénes y cómo son los que reparten tweets a izquierda y derecha.
La semana pasada, millones de personas en España acabaron imaginando en su cabeza algo que realmente nunca pasó: a un señor rico del barrio de Salamanca destrozando mobilario urbano con un palo de golf. Analizamos con datos y Graphext cómo se originó este bulo, y sobre todo quién hay detrás a izquierda y derecha propagando estas nuevas narrativas políticas