Prepare Data

You might not always want to work with a full dataset. Sampling allows you to select a specific sample of your data to work with according to criteria that you set. If your dataset has more than 40K rows, sampling it will enable faster and more efficient analysis.

Use the sampling icon from the top right corner of the project setup wizard to start sampling your data. To remove a sample, click the trash icon inside the sampling menu.

‍

Sampling with Text or Categories

To sample using text or category filters, choose a variable, an operation and a value. When you execute your project your data will be sampled to include only values that meet the operation criteria.

For instance, choosing 'Retweet' as your variable, 'equals' as your operation and 'True' as your value would sample your data to include only data points where the 'Retweet' column of your data is true.

‍

‍

How to Sample Your Data Using Text or Category Filters?

  1. Start from the 'Datasets' of your Graphext workspace.
  2. Select a dataset to start working with.
  3. From the project setup wizard that appears on the right of your screen, select the icon from the top right representing the length of your dataset - the icon with 4 horizontal bars of different sizes.
  4. Select 'Add a filter'.
  5. Choose 'Filter by text or category' from the menu list.
  6. Select a variable from the dropdown list.
  7. Choose an operation from the dropdown list.
  8. Enter a value in the 'Value' text box.
  9. To add another filter, select 'Add a filter' and complete the additional filter form.
  10. Done ... The indicated number of rows in your dataset should change to reflect your filter. You can now continue with your project setup to start working with your sample.

‍


Sampling with Quantitative Variables

To sample using quantitative variables, choose a variable, an operation and a quantitative value. When you execute your project your data will be sampled to include only values that meet the operation criteria.

For instance, choosing 'Age' as your variable, 'less than' as your operation and '35' as your value would sample your data to include only data points where the 'Age' column of your data is less than 35.

‍

‍

How to Sample Your Data Using Quantitative Variable Filters?

  1. Start from the 'Datasets' of your Graphext workspace.
  2. Select a dataset to start working with.
  3. From the project setup wizard that appears on the right of your screen, select the icon from the top right representing the length of your dataset - the icon with 4 horizontal bars of different sizes.
  4. Select 'Add a filter'.
  5. Choose 'Filter by numerical value' from the menu list.
  6. Select a variable from the dropdown list.
  7. Choose an operation from the dropdown list.
  8. Enter a numerical value in the 'Value' text box.
  9. To add another filter, select 'Add a filter' and complete the additional filter form.
  10. Done ... The indicated number of rows in your dataset should change to reflect your filter. You can now continue with your project setup to start working with your sample.

‍


Sampling with Date Variables

To sample using date variables, choose a variable, an operation and a date value. When you execute your project your data will be sampled to include only values that meet the operation criteria.

For instance, choosing 'Birthday' as your variable, 'different to' as your operation and '31 July 2020' as your value would sample your data to include only data points where the 'Birthday' column of your data was different to 31 July 2020.

‍

‍

How to Sample Your Data using Date Variable Filters?

  1. Start from the 'Datasets' of your Graphext workspace.
  2. Select a dataset to start working with.
  3. From the project setup wizard that appears on the right of your screen, select the icon from the top right representing the length of your dataset - the icon with 4 horizontal bars of different sizes.
  4. Select 'Add a filter'.
  5. Choose 'Filter by date' from the menu list.
  6. Select a variable from the dropdown list.
  7. Choose an operation from the dropdown list.
  8. Select a date from the calendar dropdown.
  9. To add another filter, select 'Add a filter' and complete the additional filter form.
  10. Done ... The indicated number of rows in your dataset should change to reflect your filter. You can now continue with your project setup to start working with your sample.

‍

Need Something Different?

We know that data isn't always clean and simple.
Have a look through these topics if you can't see what you are looking for.