2021 Smoky Mountains Conference Data Challenge Synthetic-to-Real Domain Adaptation for Autonomous Driving Dataset

Coletti, Mark; Chipka, Jordan

doi:10.13139/OLCF/1772569

Title: 2021 Smoky Mountains Conference Data Challenge Synthetic-to-Real Domain Adaptation for Autonomous Driving Dataset

Dataset
Other Related Research

Abstract

The dataset is comprised of both real and synthetic images from a vehicleâs forward-facing camera. Each camera image is accompanied by a corresponding pixel-level semantic segmentation image (all files are .png files). In total, the dataset contains 5600 images in the training/validation set and 1400 images in the testing set. The training dataset contains mostly synthetic RGB images collected with a wide range of weather and lighting conditions using the CARLA simulator [1]. In addition, the training data also includes a small pre-selected subset of data from the Cityscapes training dataset â which is comprised of RGB-segmentation image pairs from driving scenarios in various European cities [2]. The testing data is split into three sets. The first set contains synthetic CARLA images with weather/lighting conditions that were not present in the training set. The second set is a subset of the Cityscapes testing dataset. Finally, the third set is an unknown testing set which will not be revealed to the participants until after the submission deadline. [1] Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., & Koltun, V. (2017, October). CARLA: An open urban driving simulator. In Conference on robot learning (pp. 1-16). PMLR. [2] Cordts, M., Omran, M., Ramos,more »« less

Authors:

Coletti, Mark; Chipka, Jordan

ORNL-OLCF

Publication Date:: Fri Mar 26 04:00:00 UTC 2021

DOE Contract Number:: AC05-00OR22725

Research Org.:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States); General Motors

Sponsoring Org.:: Office of Science (SC)

Collaborations:: General Motors

Subject:: 99 GENERAL AND MISCELLANEOUS; autonomous driving, computer vision, semantic segmentation, domain adaptation, synthetic data

OSTI Identifier:: 1772569

DOI:: https://doi.org/10.13139/OLCF/1772569

Citation Formats


                    Coletti, Mark, and Chipka, Jordan. 2021 Smoky Mountains Conference Data Challenge Synthetic-to-Real Domain Adaptation for Autonomous Driving Dataset.  United States: N. p., 2021. 
        Web.  doi:10.13139/OLCF/1772569.

Copy to clipboard


                    Coletti, Mark, & Chipka, Jordan. 2021 Smoky Mountains Conference Data Challenge Synthetic-to-Real Domain Adaptation for Autonomous Driving Dataset.  United States.  doi:https://doi.org/10.13139/OLCF/1772569

Copy to clipboard


                    Coletti, Mark, and Chipka, Jordan. 2021.  
"2021 Smoky Mountains Conference Data Challenge Synthetic-to-Real Domain Adaptation for Autonomous Driving Dataset".  United States.  doi:https://doi.org/10.13139/OLCF/1772569.  https://www.osti.gov/servlets/purl/1772569. Pub date:Fri Mar 26 04:00:00 UTC 2021

Copy to clipboard


                    
@article{osti_1772569,

  title        = {2021 Smoky Mountains Conference Data Challenge Synthetic-to-Real Domain Adaptation for Autonomous Driving Dataset},

  author       = {Coletti, Mark and Chipka, Jordan},

  abstractNote = {The dataset is comprised of both real and synthetic images from a vehicleâs forward-facing camera. Each camera image is accompanied by a corresponding pixel-level semantic segmentation image (all files are .png files). In total, the dataset contains 5600 images in the training/validation set and 1400 images in the testing set. The training dataset contains mostly synthetic RGB images collected with a wide range of weather and lighting conditions using the CARLA simulator [1]. In addition, the training data also includes a small pre-selected subset of data from the Cityscapes training dataset â which is comprised of RGB-segmentation image pairs from driving scenarios in various European cities [2]. The testing data is split into three sets. The first set contains synthetic CARLA images with weather/lighting conditions that were not present in the training set. The second set is a subset of the Cityscapes testing dataset. Finally, the third set is an unknown testing set which will not be revealed to the participants until after the submission deadline. [1] Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., & Koltun, V. (2017, October). CARLA: An open urban driving simulator. In Conference on robot learning (pp. 1-16). PMLR. [2] Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., ... & Schiele, B. (2016). The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3213-3223).},

  doi          = {10.13139/OLCF/1772569},

  journal      = {},

  number       = ,

  volume       = ,

  place        = {United States},

  year         = {Fri Mar 26 04:00:00 UTC 2021},

  month        = {Fri Mar 26 04:00:00 UTC 2021}

}

Copy to clipboard

Dataset:

View Dataset

DOI: https://doi.org/10.13139/OLCF/1772569

Save / Share:

Export Metadata

Save to My Library

Similar records in DOE Data Explorer and OSTI.GOV collections:

Neutron Imaging dataset for SMC 2021 data challenge

Dataset Peterson, Peter ; Granroth, Garrett ; Bilheux, Hassina ; ...

The neutron radiography (nR) dataset provides information of the neutron events as measured for the Siemens star mask using the Timepix3 detector. The recordings consist of the position (x and y axes), the time-stamp, and time-over-threshold (TOT) values of each neutron event.
VULCAN temperature dataset for SMC data challenge

Dataset Peterson, Peter

The VULCAN Beamline dataset provides the sample measurement, where temperatures is recorded in two physically different places on the sample. These are held in two different hdf5 groups in the data file.
COVID-19 Knowledge Graph -- Dataset for SMCDC 2021 Challenge 2

Dataset Herrmannova, Drahomira ; Kannan, Ramakrishnan ; Lim, Seung-Hwan ; ...

This repository contains the data for the 2021 Smoky Mountains Computational Sciences Data Challenge (SMCDC21) Challenge 2 -- Finding Novel Links in COVID-19 Knowledge Graph. The total size of all files in this repository is 285MB. Challenge website: https://smc-datachallenge.ornl.gov/2021-challenge-2/ More information about the challenge and the data: https://github.com/ORNL/smcdc-2021-covid-kg
NOMAD total scattering dataset for SMC data challenge

Dataset Peterson, Peter ; Neuefiend, Joerg ; Proffen, Thomas ; ...

The data provided for this challenge was measured using the Nanoscale-Ordered Materials Diffractometer (NOMAD) at the Spallation Neutron Source (SNS) at Oak Ridge National Laboratory. The data is stored in a hdf5 file following the NeXus standard and can be read with tools built for either. While the NeXus format is self-describing, there is benefit to explaining some details. The data is stored in 4 NXentries in the file. The NXentries that begin with âamorphous_SiO2â are for the amorphous data, and the NXentries that begin with âcrystalbolite_SiO2â are for the crystalline material. Solutions that were produced by the scientist aremore »« less
SNAP diffraction dataset for 2023 SMC data challenge

Dataset Peterson, Peter ; Guthrie, Malcom ; Granroth, Garrett

The data provided this challenge is ice under high pressure measured using the Spallation Neutrons and Pressure Diffractometer (SNAP) at the Spallation Neutron Source (SNS) at Oak Ridge National Laboratory. The data is stored in a hdf5 file following the NeXus standard and can be read with tools built for either. While the NeXus format is self-describing, there is benefit to explaining some details. The data is stored in a single NXdata entry within a single NXentry. The NXdata has several fields denoting the 3-dimensional data (signal), the axes (D0 is the Qx axis, D1 is the Qy axis, andmore »« less

Similar Records