Map values of a single column to the outputs datasets, either with discrete values or ranges.
Randomly send each row to an output dataset according to provided ratios.
Can choose one of the two available modes:
- Randomly splits the dataset according to the provided ratios (exact when using DSS engine, approximate otherwise).
- Randomly selects a subset of values of one ore more columns and send all rows with these values to an output, in order to obtain approximately the provided ratio for this output. Two outputs cannot contain the same values.
Build specific filters to dispatch data in each output dataset.
Filters are mutually exclusive, i.e first match takes it all.
Sort data according to a list of sort criteria. Then define percentile range of the data that is sent to each output dataset.