Model America: Data and Models for every U.S. Building
- Oak Ridge National Laboratory
- B13 Engineering
- National Renewable Energy Laboratory
- Illinois Institute of Technology
- The University of Tennessee
The 5-year goal of the “Model America” concept was to generate a model of every building in the United States. This data repository delivers on that goal with "Model America v1".Oak Ridge National Laboratory (ORNL) has developed the Automatic Building Energy Modeling (AutoBEM) software suite to process multiple types of data, extract building-specific descriptors, generate building energy models, and simulate them on High Performance Computing (HPC) resources. For more information, see AutoBEM-related publications (bit.ly/AutoBEM).There were 125,715,609 buildings detected in the United States. Of this number, 122,146,671 (97.2%) buildings resulted in a successful generation and simulation of a building energy model. This dataset includes the full 125 million buildings. Future updates may include additional buildings, data improvements, or other algorithmic model enhancements in "Model America v2".This dataset contains OSM and IDF zip files for every U.S. county. Each zip file contains the generated buildings from that county.The .csv input data contains the following data fields:1. ID - unique building ID2. Centroid - building center location in latitude/longitude (from Footprint2D)3. Footprint2D - building polygon of 2D footprint (lat1/lon1_lat2/lon2_...)4. State_abbr - state name5. Area - estimate of total conditioned floor area (ft2)6. Area2D - footprint area (ft2)7. Height - building height (ft)8. NumFloors - number of floors (above-grade)9. WWR_surfaces - percent of each facade (pair of points from Footprint2D) covered by fenestration/windows (average 14.5% for residential, 40% for commercial buildings)10. CZ - ASHRAE Climate Zone designation11. BuildingType - DOE prototype building designation (IECC=residential) as implemented by OpenStudio-standards12. Standard - building vintageThis data is made free and openly available in hopes of stimulating any simulation-informed use case. Data is provided as-is with no warranties, express or implied, regarding fitness for a particular purpose. We wish to thank our sponsors which include Oak Ridge National Laboratory (ORNL) Laboratory Directed Research and Development (LDRD), U.S. Dept. of Energy’s (DOE) Building Technologies Office (BTO), Office of Electricity (OE), Biological and Environmental Research (BER), and National Nuclear Security Administration (NNSA).Update (March 2025): Corrected the ID field in all state-level .csv input files to ensure one-to-one consistency with the corresponding .osm and .idf output files. The schema and file structure are unchanged; only the values in the ID column were modified. No files were added or removed, and the .zip bundles (containing .osm/.idf) are unchanged. The corrected .csv inputs were re-extracted in March 2025 from the original data generated ~2021 (Theta supercomputer runs), and published here to align input IDs with model outputs.
- Research Organization:
- Southwest Urban Corridor Integrated Field Laboratory (SW-IFL)
- Sponsoring Organization:
- U.S. DOE > Office of Science > Biological and Environmental Research (BER)
- DOE Contract Number:
- AC02-05CH11231
- OSTI ID:
- 2283980
- Country of Publication:
- United States
- Language:
- English
Similar Records
Model America – data and models of every U.S. building
3D Reality Energy Modeling Software