February 2, 2022
Data-Driven SEO: A Keyword Optimization Guide using Web Scraping & Co-occurrence Analysis (Graphext + Deepnote + Adwords)
To improve our SEO, we built a data-driven method to analyze the content of top-ranking Google search results as part of a keyword optimization process. Starting with a single search term, our technique uses web scraping + NLP techniques to find specific keywords that are already proven to boost the rank of similar pages.
January 10, 2022
22 Data YouTubers + Streamers to Watch in 2022
Youtube, Twitch and other streaming platforms are full of data professionals sharing hacks, tutorials and stories of their working life. As well as content geared towards people starting out with data analysis - like Reuven Lerner covering essential Python tips and walkthroughs - there are videos posted by data Youtubers and streamers that debate topics at the forefront of data science research - Cassie Kozyrkov for instance.
January 5, 2022
19 Data Newsletters to Read in 2022
Newsletters are becoming a popular way to distil news, events and tips as the data landscape becomes busier and busier! These are the kind of emails we love to receive because they help us to stay ahead of the game ... and they are all about data. As well as data newsletters created for business analysts - The Modern Data Stack shares resources, opportunities and tools (we are very proud to have featured) - there are series geared towards data science and AI developments such as The Batch.
December 20, 2021
36 Data Podcasts to Follow in 2022
The world of data science podcasting has become as varied as the input parameters to a Linear Regression model. From household names like Freakonomics to less known up-and-comers like Big Data Beard, data professionals are sitting up from their computers to talk about business, the future of AI, data in the real world and much more ... if you know where to look.
December 8, 2021
When Dating Apps Met Survey Theory: Sampling, Weighting & Romance
A picture of a population is what most surveys hope to achieve. Who doesn't want to know which essential Tinder personality traits help a person to be successful in love? We're taking a look at the fundamentals of survey theory - sampling & weighting - through the lens of a Pew Research survey that examines American attitudes towards relationships and dating apps in 2021.
November 24, 2021
Reverse Engineering Infamous Marketing Strategies from Innocent Drinks
Why are the social media strategies of Innocent Drinks considered as the gold standard for marketing teams the world over? We collected every tweet (10,521) posted by the communication department to deconstruct Innocent's content, style, reach and engagement with a simple topic analysis.
November 15, 2021
How Aquaservice Use Graphext To Improve Their Prediction Models
We spoke to the data science team at Aquaservice about how they used Graphext to build a clustering model to improve the way they forecast consumer demand. Their project grouped delivery routes using over 30 factors to calculate similarity and exposed patterns in the errors made by their prediction models. Models that are responsible for forecasting the number of water bottles that should be loaded into trucks for delivery across a huge number of routes across Spain.
November 11, 2021
What People Have Felt and Thought About The 2021 UN Climate Change Conference (COP26)
We collected every tweet published about the 2021 UN Climate Change Conference (COP26) to study how people have engaged with events during the summit. Using topic analysis and emotion detection, our project dives into people's visceral reactions to agreements on deforestation, commitments between China and the USA and the appearance of Barack Obama.
October 26, 2021
How to Perform Simple & Effective Customer Segmentation | A Walkthrough with Data from a Delicatessen
Customer segmentation involves splitting a customer base into distinct groups. These customer segments are defined by specific and shared characteristics, behaviours or preferences that help businesses to spot patterns and associate customers with one another. This article walks through the steps involved in a simple customer segmentation analysis. Using sales data from a delicatessen, we'll segment customers according to their buying preferences and behaviour. To achieve this, we'll use a powerful machine learning technique known as clustering.
October 19, 2021
Make or Break: After 5 Years ... Couples are Less Likely to Break Up
What's the most important milestone in a relationship? According to data from a Stanford study, it's a day like any other that occurs somewhere between the 4th and 5th anniversary of a relationship.
September 10, 2021
Sentiment Analysis & Billboard Top 100: The Changing Mood of Popular Music
We used sentiment analysis to model 5100 Billboard chart-toppers between 1964 and 2015. Our analysis predicted whether song lyrics were positive, negative or neutral as well as detecting the topic and intent behind the most popular tunes in music history.
August 23, 2021
The 5 Most Extreme US Office Characters
Testing out our brand spanking new integration with Hugging Face models for NLP, we analyzed speech from characters in all 9 series of the US Office. Added into our Graphext project, the language models focused on classifying the dialogue of Michael, Dwight, Pam, Jim, Daryll and all the other characters according to the detection of sentiment, emotion, offensive language, irony and hate speech.