Blog | Graphext | Graphext

January 7, 2025

How to implement churn predictive models that are actually useful and profitable for Revenue Operations teams.

How to implement churn predictive models that are actually useful and profitable for Revenue Operations teams.

January 8, 2025

Color in Graphext: A Silent Superpower

Gabriela Marante

Gabriela Marante

Luna Fidalgo

How thoughtful color choices became the secret weapon of Graphext: from dark modes that save your eyes to color palettes that make complex data click

December 31, 2023

Finding the most protein-rich, low-calorie, and healthy foods with data & Graphext.

Victoriano Izquierdo

Victoriano Izquierdo

Advanced analytics enables users to find the most protein-rich, low-calorie, and healthy foods at Mercadona, supporting muscle growth and overall health.

November 29, 2023

How we use Graphext to predict who will pay for Graphext with lead scoring

Thomas Buhrmann

Thomas Buhrmann

Victoriano Izquierdo

Victoriano Izquierdo

Using Graphext to predict potential customers through lead scoring by selecting the right variables and focusing on engagement with the tool in the near future.

November 15, 2023

Turn your Notion database into actionable insights

The Graphext Notion integration allows teams to generate performance reports, user segmentation reports, and analyze user interviews using their Notion database.

October 16, 2023

Why data without profound human interpretation, is meaningless, even in the era of AI

Many people are saying that ChatGPT is going to replace all data analysts and data scientists. Well, that might never happen… Humans bring contextual understanding, identify emerging trends, and build trust in the data analysis process.

How to Do Exploratory Data Analysis in Python

Who to follow to stay up to date with all things data science

A Complete Guide to Remarkable Data-Driven Content: YouTubers, Streamers, Podcasts, and Newsletters You Should Check Out in 2023

Mastering a Career in Data Science

We looked at the insights from the 2022 Kaggle Survey to provide guidance for those interested in pursuing a career in data science or programming, covering topics such as job roles, programming languages, machine learning, and compensation.

January 10, 2022

22 Data YouTubers + Streamers to Watch in 2022

Victoriano Izquierdo

Victoriano Izquierdo

Andy Clarke

Youtube, Twitch and other streaming platforms are full of data professionals sharing hacks, tutorials and stories of their working life. As well as content geared towards people starting out with data analysis - like Reuven Lerner covering essential Python tips and walkthroughs - there are videos posted by data Youtubers and streamers that debate topics at the forefront of data science research - Cassie Kozyrkov for instance.

January 5, 2022

19 Data Newsletters to Read in 2022

Victoriano Izquierdo

Victoriano Izquierdo

Andy Clarke

Newsletters are becoming a popular way to distil news, events and tips as the data landscape becomes busier and busier! These are the kind of emails we love to receive because they help us to stay ahead of the game ... and they are all about data. As well as data newsletters created for business analysts - The Modern Data Stack shares resources, opportunities and tools (we are very proud to have featured) - there are series geared towards data science and AI developments such as The Batch.

December 20, 2021

36 Data Podcasts to Follow in 2022

Victoriano Izquierdo

Victoriano Izquierdo

Andy Clarke

The world of data science podcasting has become as varied as the input parameters to a Linear Regression model. From household names like Freakonomics to less known up-and-comers like Big Data Beard, data professionals are sitting up from their computers to talk about business, the future of AI, data in the real world and much more ... if you know where to look.

December 8, 2021

What People Really Feel About Programming Languages

Victoriano Izquierdo

Victoriano Izquierdo

We collected and analyzed answers to a Twitter meme, wherein people were asked to express their relationship with programming languages.

November 24, 2021

Reverse Engineering Infamous Marketing Strategies from Innocent Drinks

Andy Clarke

Why are the social media strategies of Innocent Drinks considered as the gold standard for marketing teams the world over? We collected every tweet (10,521) posted by the communication department to deconstruct Innocent's content, style, reach and engagement with a simple topic analysis.

November 11, 2021

Using Mutual Information to Cluster Variables and Discover the Associations Between Survey Questions

Alex Gonzalez

Our team set out to build a type of analysis that could be used to measure the strength of association between variables in a dataset. Here's how we did it ...

October 26, 2021

How to Perform Simple & Effective Customer Segmentation | A Walkthrough with Data from a Delicatessen (Dataset interactive)

Andy Clarke

Customer segmentation involves splitting a customer base into distinct groups. These customer segments are defined by specific and shared characteristics, behaviours or preferences that help businesses to spot patterns and associate customers with one another. This article walks through the steps involved in a simple customer segmentation analysis. Using sales data from a delicatessen, we'll segment customers according to their buying preferences and behaviour. To achieve this, we'll use a powerful machine learning technique known as clustering.

October 19, 2021

Make or Break: After 5 Years ... Couples are Less Likely to Break Up

Victoriano Izquierdo

Victoriano Izquierdo

What's the most important milestone in a relationship? According to data from a Stanford study, it's a day like any other that occurs somewhere between the 4th and 5th anniversary of a relationship.

September 16, 2021

Nuevas perspectivas en analítica y detección de talento: webinar con D'Anchiano

María López

El pasado 15 de julio estuvimos en directo con Juan Palacios, CEO y fundador de D'Anchiano. Juan nos contó en detalle el uso que le da a Graphext para procesos de selección, así como una mayor visibilidad de qué utilidad tiene Graphext y el análisis de datos en RRHH y detección de talento.

September 10, 2021

Sentiment Analysis & Billboard Top 100: The Changing Mood of Popular Music .

Andy Clarke

We used sentiment analysis to model 5100 Billboard chart-toppers between 1964 and 2015. Our analysis predicted whether song lyrics were positive, negative or neutral as well as detecting the topic and intent behind the most popular tunes in music history.

How to Study Brand Conversations with Advanced Text Analysis?

Paul Suddon

How can we use text analysis of data from Twitter to improve our understanding of markets? This is the question prompting Paul, a strategist in our business team, to scrape tweets about Lloyds bank and conduct a Twitter topic analysis using advanced NLP and network creation. First, he collected tweets using Tractor, Graphext's scraping tool for social media analysis. Then, he analyzed the topics of tweets using network analysis. Here's how he did it ...

A Beginners Guide to Market Segmentation: Types, Techniques & Examples to Better Understand Your Customer Base (with Data)

Andy Clarke

Market segmentation means splitting your customer base into distinct communities based on the similarity of their features. Depending on the data you use to segment customers, clustering a market dataset results in the grouping of customers based on geographic, demographic, behavioural and psychographic factors as well as their buying preferences.

A Market Segmentation of 1000 Supermarket Customers Using Data on Sales, Income and Demographics

Andy Clarke

Paul Suddon

Our team clustered 1000 supermarket sales in order to segment customers according to their buying habits. Our market segmentation analysis uses data on the demographics, income and geography of customers to identify key buyer personas and inform marketing strategies and campaigns.

Graphext | Graphtex | Graphnext: Grouping Similar Spellings Using Chars2Vec and Agglomerative Clustering

Maria del Mar Ruiz

Maria del Mar Ruiz

'España' and 'Españha' are just spelling variations. We built a way of grouping words spelt differently but referring to the same concept.

Conspiracies, Complexity and Clustering: Investigating Reports of Adverse COVID-19 Vaccine Effects

Andy Clarke

Modelling data from the Vaccine Adverse Event Reporting System (VAERS) - a US government-sponsored vaccine reaction monitoring service - our team set out to investigate reports of adverse health effects related to the seismic rollout of the COVID-19 vaccination programme in the USA.

The Method Behind Our Investigation of Reports of Adverse COVID-19 Vaccine Events

Andy Clarke

Paul Suddon

Taking on an investigation into the adverse reactions associated with the COVID-19 vaccination rollout in the USA, our team were aware of the increased need for transparency whilst conducting our analysis. This article documents the methodology behind our study of Vaccine Adverse Event Reporting System (VAERS) data.

Good Risk vs Bad Risk: Deconstructing the Features of 1000 German Loans

Andy Clarke

Attempting to discover the most influential features of a loan application when considering risk, our team built a model using the features of a loan application to predict whether an applicant would have a good or bad risk rating.

Simple Solutions to Prevent Customer Churn

Paul Suddon

Our team analyzed 7043 current and former customers of a telecoms provider in order to better understand what types of people are most likely to cancel their contracts.

How Data Can Help You Keep Your Workers

Paul Suddon

To showcase how a company could reduce employee turnover, our team clustered a dataset containing information about IBM employees to discover the reasons why employees left their jobs.

Menhir & Graphext- Analyzing the Intangible Value of Financial Assets.

Andy Clarke

Working at the intersection of data science and finance, Menhir is using Graphext to understand the composition of financial portfolios, performing analysis that typically takes analysts between two and three weeks in just two days.

The Moneyball Method: Using Data to Build a Football Dream Team (On a Budget)

Andy Clarke

Victoriano Izquierdo

Victoriano Izquierdo

Our team set out to build an exceptional football team for less than 100M Euros. Using data provided in the FIFA 2020/2021 dataset - the video game - we built a prediction model in order to find the key performance attributes for each position. Then, we used this to pick out a team of excellent but undervalued players.

Understanding Employee Behaviour

Almudena Sanz

Why do people act the way they do? Why do they buy products, quit their jobs, or change partners? Many of these motives are people's behaviors and some can be found in data.

February 9, 2021

Patriotism, Animals, Comedy and Sex: Clustering 233 Superbowl Ads

Andy Clarke

Victoriano Izquierdo

Victoriano Izquierdo

We built a model clustering 233 Superbowl ads using data from FiveThirtyEight in order to work out what content brands use to sell their products during America's most-watched sporting event.

February 2, 2021

Health vs Economy: Using Twitter to Investigate How Latin American Leaders Have Responded to COVID-19

Andy Clarke

Julián Yunez, a political communications consultant, used Graphext to investigate how messages related to public health and economics have been balanced by Latin American leaders during the pandemic.

January 12, 2021

Finding the 'Perfect' Sales Candidate Using Clustering and Prediction: Graphext and "The Sales Acceleration Formula"

María López

Paul Suddon

Exploring how Graphext's data-driven approach might be used to identify the characteristics of successful salespeople.

January 4, 2021

How We Created a Pandemic-Resistant Team Building Session (Interactive)

Laura Castro

Motivated by the prospect of getting to know our new colleagues, our design team created a team-building session that negotiated the limitations brought by the pandemic.

December 29, 2020

The Evolution of American Protests After the Death of George Floyd: COVID-19, BLM and the Election

Andy Clarke

2020 has been a turbulent year for every country but particularly in the USA. We clustered American protest events between May 24 - Nov 28, 2020 to investigate the relationships between types of protests, their violence and their geography.

December 26, 2020

3 Mapas de Poder Político en España

Victoriano Izquierdo

Victoriano Izquierdo

Hoy publicamos en El País 3 grafos que representan 3 mapas de cómo están conectados en redes sociales los periodistas políticos, los diputados y las cuentas de Twitter políticas más relevantes en España.

October 27, 2020

ODS y Agenda 2030: Desarrollo sostenible en los medios de comunicación

Victoriano Izquierdo

Victoriano Izquierdo

En este proyecto nos planteamos analizar cuál es el rol que están asumiendo los medios de comunicación en la difusión del desarrollo sostenible y comprender el nivel de sensibilización y concienciación sobre la importancia respecto a la Agenda 2030.

September 13, 2020

Zara vs. Gap: An Instagram Analysis

Maximilian Zachow

Maximilian Zachow

What are unexpected insights and surprising patterns of the social media strategies of Inditex’s flagship Zara and Gap? In this article, we analyzed their Instagram campaigns since Feb 2017.

September 8, 2020

¿Es la clase obrera española la más xenóbofa de todas las clases sociales? Qué dicen los datos del CIS

Victoriano Izquierdo

Victoriano Izquierdo

Los obreros no cualificados y cualificados tienen un 36% y 16% más de posibilidades de decir que el número de inmigrantes es excesivo, analizamos qué otras variables pueda explicar en por qué tienen esa percepción sobre la población inmigrante en España.

Cómo el Congreso de los Diputados tuiteó durante el Estado de Alarma

Victoriano Izquierdo

Victoriano Izquierdo

Prácticamente todos los diputados del Congreso actual, la XIV legislatura, tienen cuenta en Twitter y gran parte de su trabajo consiste en leer y escribir tweets. Para bien o para mal, los políticos nunca han tenido más poder y facilidad para mandar directamente sus mensajes a los ciudadnos sin el filtro de la prensa. Analizamos las principales narrativas que cada partido movió.

How the US Congress Tweeted in 2020

Dani Vegara

We analyzed almost 200K tweets from members of Congress, comparing across parties and seeing what worked for each one. We found interesting insights by analyzing the way each party worded their opinions.

Finding Real Estate Opportunities in Madrid

Victoriano Izquierdo

Victoriano Izquierdo

We analyzed more than 20k advertisements in real estate websites to try to find underpriced houses with Graphext's predictive algorithms. Along the way we looked into the relationships between prices and factors such as education level or location index to try to find insights and patterns in the data.

The Lipstick Effect: Did the 2008 Financial Crisis Drive an Increase in Positive Airline Reviews?

Brais Ramos

We analyzed 30K airlines services reviews and saw that there are clear jumps in ratings marked by the 2008 financial crisis and subsequent economic recovery. Could these factors have impacted consumers, or have airlines improved their services?

Is Mark Cuban a Socialist, a Communist, a Globalist... or Something Else? How Trump Supporters Attack Another Billionaire

Victoriano Izquierdo

Victoriano Izquierdo

Mark Cuban is one of the wealthiest people in America, with an estimated net worth of $4.1 billion. He asked for a tool to work out why his Twitter supporters were calling him a socialist, a communist and a globalist - and to analyze whether their accusations were true!

Las 2 Españas a Palos de Golf: quiénes y cómo son los que reparten tweets a izquierda y derecha.

Victoriano Izquierdo

Victoriano Izquierdo

La semana pasada, millones de personas en España acabaron imaginando en su cabeza algo que realmente nunca pasó: a un señor rico del barrio de Salamanca destrozando mobilario urbano con un palo de golf. Analizamos con datos y Graphext cómo se originó este bulo, y sobre todo quién hay detrás a izquierda y derecha propagando estas nuevas narrativas políticas

Las cifras de muertos en España por COVID-19 en contexto.

Victoriano Izquierdo

Victoriano Izquierdo

¿Quiénes son y dónde están los excesos de muertes en España en los últimos años por edad, sexo y Comunidades Autónomas?

How to Look Good on Video Calls: Analyzing 1K Skype & Zoom rooms.

Victoriano Izquierdo

Victoriano Izquierdo

How do you look your best in a video call? Collecting almost 1000 reviews of video call backgrounds, we set out to find patterns in what makes people give off good or bad impressions.

Clustering countries by Covid-19 Mobility Trends changes

Victoriano Izquierdo

Victoriano Izquierdo

We used the Apple Mobility Trends Report to cluster with Graphext countries that are experiencing similar changes in people's walking behaviour.

How Couples Meet and Stay Together014100

Victoriano Izquierdo

Victoriano Izquierdo

Maximilian Zachow

Maximilian Zachow

Grabbing data from a Stanford University study on relationships, we set out to investigate what a 'happy' relationship looks like and how you find one.

How to Build your Brand Through Social Media

Maximilian Zachow

Maximilian Zachow

Many companies use Twitter in a traditional way: as promotion, presenting new features. But over 50% of all Tweets have another motivation.

Benchmarking McDonald's Store Performance

Almudena Sanz

Using Graphext, we helped McDonald’s to categorize their more than 500 spanish stores based on their typical customer profiles, to later analyze their sales transaction data.

Healthy Food: A Tweet Content Analysis

Maximilian Zachow

Maximilian Zachow

Is healthy food only about being vegan and following diets or is the community mentioning more narratives around that topic? We analyze 30k tweets to find out.

Overcoming Distribution Challenges with Bicycle Sharing

Maximilian Zachow

Maximilian Zachow

With Graphext we analyzed the vehicle distribution data (from Kaggle) of a bike sharing service based in Minnesota to try to understand bike shortage in Madrid

La imagen que transmite la política española:

Maximilian Zachow

Maximilian Zachow

Victoriano Izquierdo

Victoriano Izquierdo

Postureo en traje estrechando manos, Baños de masas y selfies, Memes y de relax en la naturaleza. Analizamos las fotos de los 10 políticos españoles más seguidos en Instagram

When Dating Apps Met Survey Theory: Sampling, Weighting & Romance

Alex Gonzalez

A picture of a population is what most surveys hope to achieve. Who doesn't want to know which essential Tinder personality traits help a person to be successful in love? We're taking a look at the fundamentals of survey theory - sampling & weighting - through the lens of a Pew Research survey that examines American attitudes towards relationships and dating apps in 2021.

The Importance of Blockchain to International Development

Almudena Sanz

We recently worked with the United Nations to understand the importance of Blockchain and other emerging technologies in the field of international development.

Analyzing a Spanish Election Poll with Aquienvoto

Victoriano Izquierdo

Victoriano Izquierdo

Opinion Poll Company Aquienvoto & Graphext teamed up to look behind the curtains of April 28th’s result. Join us as we analyze how people responded to 45 different questions.

Social Listening Using Graphext

Almudena Sanz

In this post we will identify and analyze the topics that interest the community of VCs in the US. To accomplish this we will use Graphext and Contexto.

Who Will Replace Cristiano Ronaldo in Real Madrid?

Asier Sarasua

Cristina Traba

Can data tell us who can replace Ronaldo in Real Madrid? Are there any similar players to Cristiano according to data? We analyze 18000 players to find out

The Top Stories in 2020 according to every tweet from 38 UK news publishers

Andy Clarke

We collected every tweet in 2020 from 38 UK news organisations to find out what the media have been reporting on. Then we visualised categories of tweets as trends to see what the British media landscape looked like throughout the year.

Finding a New Brand Identity with the Graphext Design Team

Victoriano Izquierdo

Victoriano Izquierdo

As a product evolves, the brand around it evolves too. Here we take you through the design process of finding a new logo for our business, following the experiments carried out by the Graphext design team.

Funnel Analysis and Network Visualisation with ING

Almudena Sanz

Working with data from ING, we created a network visualization of how users navigate hundreds of web pages, correlating their navigation patterns with specific user profiles.

Survey Analysis Using Graphext

Almudena Sanz

Surveys are an easy way to gather people’s opinion or behaviour, but the challenge has always been once you have all the data, what do you do with it?

How Spanish Politicians and the Media are Connected

Victoriano Izquierdo

Victoriano Izquierdo

In 2017, we analized the Spanish political community and how are they related with the most important media based on how their interactions from their twitter account.

Campus Madrid Community

Victoriano Izquierdo

Victoriano Izquierdo

During 2017 we were monitoring all the conversations on Twitter about and around Campus Madrid. This is the network of 4838 people who participated in the conversation.

Alexa Amazon Reviews

Maximilian Zachow

Maximilian Zachow

If you want to succeed with your product in the market you have to listen to what your customers say. Take a look to this post to see how to achieve this with Graphext.