References → Split Recipe

The split tool is used to separate one dataset into two datasets. This can help sample data, create training datasets, and validate datasets.

Configuration

ConfigurationDescription
Recipe NameA freeform name of how a user would like to name a recipe
InputSelect a previously constructed recipe to process
Split RatioDefine a ratio to split the dataset into two. Enter a number between 0.0 and 1.0. The split defined will be assigned to split_1 while the remainder will be split_2. Example: If we have 1000 records and assign a split ratio of 0.1, ~100 records will be in split_1, and the remainder in split_2

Result

In the data explorer, the result set will have a new dropdown in the right corner where you can preview both split outputs in the pane. When mapping the output of a split recipe into a new recipe, users will select which split piece should be used.