Exploring your data
Once you have defined a dataset, the first thing that DSS allows you to do is to view your data in the “Explore” component.
Explore shows you a sample of your dataset. It also provides:
-
Quick descriptive statistics, either on the sample on the entirety of the data
-
Meanings, i.e. rich types. See Schemas, storage types and meanings for more information
-
An overview of all columns
-
The ability to sort and filter the sample
-
The ability to choose which columns are displayed
All these abilities are also available while doing visual preparation of your data.
Note
For a step by step introduction to the exploration component of Data Science Studio, we recommend that you follow our Basic Courses . This section will focus on advanced and reference topics related to the exploration component.
-
Sampling
- Sampling in Explore
-
Sampling methods
- First records
- Random sampling (fixed number of records)
- Random sampling (approximate ratio)
- Random sampling (approximate number of records)
- Column values subset
- Stratified (fixed number of records)
- Stratified (approximate ratio)
- Class rebalancing (approximate number of records)
- Class rebalancing (approximate ratio)
- Last records
- Sampling and partitioning
- Refreshing the sample
- Analyze