The OpenScene dataset is a large-scale dataset for end-to-end planning, visual pretraining, and occupancy prediction in the field of autonomous driving.
Based on nuPlan, we provide bounding box, occupancy, and flow annotations in 3D space.
- We reorganized the meta data files and organized them by their nuPlan log files to improve usability.
- We added more logs that have sensor data and uploaded the LiDAR raw sensor data.
-
For Track End-to-End Driving at Scale, please download the
meta_data
and thecamera
or/andLiDAR
sensor data, depend on modalities you intend to use. Note that there is no separate competition track for camera-only planners. -
For Track Predictive World Model, please download the
meta_data
, thecamera
andLiDAR
sensor data. -
The
private test
set utilized in the challenge leaderboards is exclusively provided by Motional and should not be confused with thetest
set. -
It is important to note that the private test sets for the two tracks are distinct and do not share any data.
-
The input data (metadata, sensors) for the private test set will be accessible upon the opening of test server. The ground truth data will be only available on the test server operated by Motional.
-
We recommended to download all data from OpenXLab and use provided command line interface (CLI) for acceleration.
-
The sensor data for both the trainval and test subsets amount to approximately 2TB. We recommend initially training and validating your models on the
mini
set. -
🔔 For those who already possess the nuPlan sensor data (over 20TB) locally, you have the option to directly link it to the OpenScene folder to avoid redundant downloads. We carefully aligned the folder structure with nuPlan and just downsampled the nuPlan sensor data to improve the accessibility.
-
🔔 If you already have the OpenScene v1.0
image
data, you can use it for OpenScene v1.1 as well, since almost (>98%) of all the data is present. If you want to use the occupancy label, please also download it from OpenScene v1.0. There are only a few instances of additional data in v1.1 that are missing. You can temporarily ignore those frames during training. -
If you can't access OpenXLab, please try the HuggingFace download links below. Alternatively, you may directly download all the sensor data from nuPlan.
File Name | Download Link | Size |
---|---|---|
openscene_metadata_mini.tgz | Google Drive / Hugging Face | 509.6 MB |
openscene_sensor_mini_camera | OpenXLab / Hugging Face | 84 GB |
openscene_sensor_mini_lidar | OpenXLab / Hugging Face | 60 GB |
File Name | Download Link | Size |
---|---|---|
openscene_metadata_trainval.tgz | Google Drive / Hugging Face | 6.6 GB |
openscene_sensor_trainval_camera | OpenXLab / Hugging Face | 1.1 TB |
openscene_sensor_trainval_lidar | OpenXLab / Hugging Face | 822 GB |
File Name | Download Link | Size |
---|---|---|
openscene_metadata_test.tgz | Google Drive / Hugging Face | 454 MB |
openscene_sensor_test_camera | OpenXLab / Hugging Face | 120 GB |
openscene_sensor_test_lidar | OpenXLab / Hugging Face | 87 GB |
File Name | Download Link | Size |
---|---|---|
openscene_metadata_private_test_wm.tgz | Hugging Face | 7.3 MB |
openscene_sensor_private_test_wm.tgz | Hugging Face | 15 GB |
openscene_metadata_private_test_e2e.tgz | Hugging Face | 4 MB |
openscene_sensor_private_test_e2e.tgz | Hugging Face | 23.6 GB |
private_test_wm
is the private test set forPredictive World Model
track.private_test_e2e
is the private test set forEnd-to-End Driving at Scale
track.- [2024-04-09] We fix some bugs and update the metadata of
private_test_wm
, please replace it!
Please follow the steps below to get familiar with the OpenScene v1.1 dataset.
- Download all the data manually and unzip them.
- Make sure the filesystem hierarchy is the same as the dataset stats.
- Modify and run
python DriveEngine/process_data/collect_data.py
to collect the meta_data in any custom split.
We recommended to download from OpenXLab and use provided command line interface (CLI) for acceleration. In addition, Google Drive and Baidu Cloud are also available. If you already have the nuPlan dataset, you only need to download the label
and meta data
.
Subset | Google Drive | Baidu Cloud | Approx. Size |
---|---|---|---|
mini | image / label | image / label | 81.2G / 6.7G |
trainval | image / label | image / label | 1.1T / 95.4G |
test | image | image | 118.5G |
meta data | meta file | meta file | 6.4G |
- Mini and trainval data contain three parts --
sensor_blobs (images)
,meta_data
, andoccupancy (label)
.
To ensure the integrity of the downloaded data, we recommend verifying the file using its MD5 checksum after the download is complete.
We provide a baseline model based on OccNet. The baseline is currently compatible with OpenScene dataset v1.0
.
Train model with 4 GTX3090 GPUs
./tools/dist_train.sh ./projects/configs/bevformer/bev_tiny_occ_r50_nuplan.py 4
Eval model with 4 GTX3090 GPUs
./tools/dist_test.sh ./projects/configs/bevformer/bev_tiny_occ_r50_nuplan.py ./path/to/ckpts.pth 4