# Dataset splits vs. filtered training / test splits The NAVSIM framework utilizes several dataset splits for standardized training and evaluating agents. All of them use the OpenScene dataset that is divided into the dataset splits `mini`,`trainval`,`test`,`private_test_e2e`, which can all be downloaded separately. It is possible to run trainings and evaluations directly on these sets (see `Standard` in table below). Alternatively, you can run trainings and evaluations on training and validation splits that were filtered for challenging scenarios (see `NAVSIM` in table below), which is the recommended option for producing comparable and competitive results efficiently. In contrast to the dataset splits which refer to a downloadable set of logs, the training / test splits are implemented as scene filters, which define how scenes are extracted from these logs. The NAVSIM training / test splits subsample the OpenScene dataset splits. Moreover, the NAVSIM splits include overlapping scenes, while the Standard splits are non-overlapping. Specifically, `navtrain` is based on the `trainval` data and `navtest` on the `test` data. As the `trainval` sensor data is very large, we provide a separate download link, which loads only the frames needed for `navtrain`. This eases access for users that only want to run the `navtrain` split and not the `trainval` split. If you already downloaded the full `trainval` sensor data, it is **not necessary** to download the `navtrain` frames as well. The logs are always the complete dataset split. ## Overview The Table belows offers an overview on the training and test splits supported by NAVSIM. In Navsim-v1.1, the training/test split can bet set with a single config parameter given in the table.
| Name | Description | Logs | Sensors | Config parameters | |
|---|---|---|---|---|---|
| Standard | trainval | Large split for training and validating agents with regular driving recordings. Corresponds to nuPlan and downsampled to 2HZ. | 14GB | >2000GB | train_test_split=trainval |
| test | Small split for testing agents with regular driving recordings. Corresponds to nuPlan and downsampled to 2HZ. | 1GB | 217GB | train_test_split=test | |
| mini | Demo split for with regular driving recordings. Corresponds to nuPlan and downsampled to 2HZ. | 1GB | 151GB | train_test_split=mini | |
| NAVSIM | navtrain | Standard split for training agents in NAVSIM with non-trivial driving scenes. Sensors available separately in download_navtrain.sh. | - | 445GB* | train_test_split=navtrain |
| navtest | Standard split for testing agents in NAVSIM with non-trivial driving scenes. Available as a filter for test split. | - | - | train_test_split=navtest | |
| Competition | warmup_test_e2e | Warmup test split to validate submission on hugging face. Available as a filter for mini split. | - | - | train_test_split=warmup_test_e2e |
| private_test_e2e | Private test split for the challenge leaderboard on hugging face. | <1GB | 25GB | train_test_split=private_test_e2e |