Training data
The subset of the dataset used to train a model. Traditionally, examples in the dataset are divided into the following three distinct subsets:1
- a training set
- a Validation set
- a Test set
Ideally, each example in the dataset should belong to only one of the preceding subsets. For example, a single example shouldn’t belong to both the training set and the validation set.1