September 10, 2021
Sentiment Analysis & Billboard Top 100: The Changing Mood of Popular Music
We used sentiment analysis to model 5100 Billboard chart-toppers between 1964 and 2015. Our analysis predicted whether song lyrics were positive, negative or neutral as well as detecting the topic and intent behind the most popular tunes in music history.
July 22, 2021
How to Study Brand Conversations with Advanced Text Analysis?
How can we use text analysis of data from Twitter to improve our understanding of markets? This is the question prompting Paul, a strategist in our business team, to scrape tweets about Lloyds bank and conduct a Twitter topic analysis using advanced NLP and network creation. First, he collected tweets using Tractor, Graphext's scraping tool for social media analysis. Then, he analyzed the topics of tweets using network analysis. Here's how he did it ...
July 20, 2021
A Beginners Guide to Market Segmentation: Types, Techniques & Examples to Better Understand Your Customer Base (with Data)
Market segmentation means splitting your customer base into distinct communities based on the similarity of their features. Depending on the data you use to segment customers, clustering a market dataset results in the grouping of customers based on geographic, demographic, behavioural and psychographic factors as well as their buying preferences.
June 8, 2021
The Method Behind Our Investigation of Reports of Adverse COVID-19 Vaccine Events
Taking on an investigation into the adverse reactions associated with the COVID-19 vaccination rollout in the USA, our team were aware of the increased need for transparency whilst conducting our analysis. This article documents the methodology behind our study of Vaccine Adverse Event Reporting System (VAERS) data.
June 8, 2021
Conspiracies, Complexity and Clustering: Investigating Reports of Adverse COVID-19 Vaccine Effects
Modelling data from the Vaccine Adverse Event Reporting System (VAERS) - a US government-sponsored vaccine reaction monitoring service - our team set out to investigate reports of adverse health effects related to the seismic rollout of the COVID-19 vaccination programme in the USA.
May 6, 2021
Good Risk vs Bad Risk: Deconstructing the Features of 1000 German Loans
Attempting to discover the most influential features of a loan application when considering risk, our team built a model using the features of a loan application to predict whether an applicant would have a good or bad risk rating.
March 24, 2021
The Moneyball Method: Using Data to Build a Football Dream Team (On a Budget)
Our team set out to build an exceptional football team for less than 100M Euros. Using data provided in the FIFA 2020/2021 dataset - the video game - we built a prediction model in order to find the key performance attributes for each position. Then, we used this to pick out a team of excellent but undervalued players.
February 9, 2021
Patriotism, Animals, Comedy and Sex: Clustering 233 Superbowl Ads
We built a model clustering 233 Superbowl ads using data from FiveThirtyEight in order to work out what content brands use to sell their products during America's most-watched sporting event.
January 12, 2021
Finding the 'Perfect' Sales Candidate Using Clustering and Prediction: Graphext and "The Sales Acceleration Formula"
Exploring how Graphext's data-driven approach might be used to identify the characteristics of successful salespeople.
December 29, 2020
The Evolution of American Protests After the Death of George Floyd: COVID-19, BLM and the Election
2020 has been a turbulent year for every country but particularly in the USA. We clustered American protest events between May 24 - Nov 28, 2020 to investigate the relationships between types of protests, their violence and their geography.
December 21, 2020
The Top Stories in 2020 According to Every Tweet from 38 UK News Publishers
We collected every tweet in 2020 from 38 UK news organisations to find out what the media have been reporting on. Then we visualised categories of tweets as trends to see what the British media landscape looked like throughout the year.
August 7, 2020
Cómo el Congreso de los Diputados tuiteó durante el Estado de Alarma
Prácticamente todos los diputados del Congreso actual, la XIV legislatura, tienen cuenta en Twitter y gran parte de su trabajo consiste en leer y escribir tweets. Para bien o para mal, los políticos nunca han tenido más poder y facilidad para mandar directamente sus mensajes a los ciudadnos sin el filtro de la prensa. Analizamos las principales narrativas que cada partido movió.