DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Reviews and syntheses: The promise of big diverse soil data, moving current practices towards future potential

Journal Article · · Biogeosciences (Online)
ORCiD logo; ORCiD logo; ORCiD logo; ; ; ; ; ; ORCiD logo; ; ; ; ; ; ; ORCiD logo; ; ; ORCiD logo; more »; ORCiD logo; ; ORCiD logo; ORCiD logo « less

Abstract. In the age of big data, soil data are more available and richer than ever, but – outside of a few large soil survey resources – they remain largely unusable for informing soil management and understanding Earth system processes beyond the original study. Data science has promised a fully reusable research pipeline where data from past studies are used to contextualize new findings and reanalyzed for new insight. Yet synthesis projects encounter challenges at all steps of the data reuse pipeline, including unavailable data, labor-intensive transcription of datasets, incomplete metadata, and a lack of communication between collaborators. Here, using insights from a diversity of soil, data, and climate scientists, we summarize current practices in soil data synthesis across all stages of database creation: availability, input, harmonization, curation, and publication. We then suggest new soil-focused semantic tools to improve existing data pipelines, such as ontologies, vocabulary lists, and community practices. Our goal is to provide the soil data community with an overview of current practices in soil data and where we need to go to fully leverage big data to solve soil problems in the next century.

Sponsoring Organization:
USDOE
OSTI ID:
1878417
Journal Information:
Biogeosciences (Online), Journal Name: Biogeosciences (Online) Journal Issue: 14 Vol. 19; ISSN 1726-4189
Publisher:
Copernicus GmbHCopyright Statement
Country of Publication:
Germany
Language:
English

References (50)

FRST: A national soil testing database to improve fertility recommendations journal January 2020
Soil organic carbon is not just for soil scientists: measurement recommendations for diverse practitioners journal February 2021
WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas: NEW CLIMATE SURFACES FOR GLOBAL LAND AREAS journal May 2017
Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network journal August 2006
How deep is the soil studied – an analysis of four soil science journals journal May 2020
The Availability of Research Data Declines Rapidly with Article Age journal January 2014
A reporting format for field measurements of soil respiration journal May 2021
Data rescue and re-use: Recycling old information to address new policy concerns journal November 2013
Carbon and Other Biogeochemical Cycles book March 2014
Leveraging Environmental Research and Observation Networks to Advance Soil Carbon Science journal May 2019
The International Land Model Benchmarking (ILAMB) System: Design, Theory, and Implementation journal November 2018
A Guide to Using GitHub for Developing and Versioning Data Standards and Reporting Formats journal August 2021
Quantifying global soil carbon losses in response to warming journal November 2016
Predicting soil carbon loss with warming journal February 2018
A database for global soil health assessment journal January 2020
The TRUST Principles for digital repositories journal May 2020
The concept and future prospects of soil health journal August 2020
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
Data Sharing journal January 2016
Replication of scientific research: addressing geoprivacy, confidentiality, and data sharing challenges in geospatial research journal April 2015
Negative emissions—Part 2: Costs, potentials and side effects journal May 2018
Anthropogenic transformation of the terrestrial biosphere journal March 2011
Reproducible, flexible and high-throughput data extraction from primary literature: The metaDigitise R package posted_content October 2018
Open Data Hopes and Fears: Determining the Barriers of Open Data conference May 2017
Networking our science to characterize the state, vulnerabilities, and management opportunities of soil organic matter journal September 2017
How to measure, report and verify soil carbon change to realize the potential of soil carbon sequestration for atmospheric greenhouse gas removal journal October 2019
SoilTemp: A global database of near‐surface temperature journal June 2020
COSORE: A community database for continuous soil respiration and other soil‐atmosphere greenhouse gas flux data journal October 2020
Advances in global change research require open science by individual researchers journal April 2012
A global Fine-Root Ecology Database to address below-ground challenges in plant ecology journal February 2017
A Platform for Computationally Advanced Collaborative AgroInformatics Data Discovery and Analysis
  • Gustafson, Andrew; Erdmann, Jesse; Milligan, Michael
  • Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact https://doi.org/10.1145/3093338.3093376
conference July 2017
FAIR Computational Workflows journal January 2020
The landscape of soil carbon data: Emerging questions, synergies and databases journal May 2019
The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation journal September 2016
Empirical Study of Data Sharing by Authors Publishing in PLoS Journals journal September 2009
Making Research Data Repositories Visible: The re3data.org Registry journal November 2013
SoilGrids1km — Global Soil Information Based on Automated Mapping journal August 2014
SoilGrids250m: Global gridded soil information based on machine learning journal February 2017
A funder-imposed data publication requirement seldom inspired data sharing journal July 2018
Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs? journal March 2021
Completing the data life cycle: using information management in macrosystems ecology research journal February 2014
Soil Change, Soil Survey, and Natural Resources Decision Making: A Blueprint for Action journal May 2005
Human-Soil Relations are Changing Rapidly: Proposals from SSSA's Cross-Divisional Soil Change Working Group journal November 2011
A global database of soil respiration data journal January 2010
Decomposability of soil organic matter over time: the Soil Incubation Database (SIDb, version 1.0) and guidance for incubation procedures journal January 2020
Standardised soil profile data to support global mapping and modelling (WoSIS snapshot 2019) journal January 2020
An open-source database for the synthesis of soil radiocarbon data: International Soil Radiocarbon Database (ISRaD) version 1.0 journal January 2020
SoDaH: the SOils DAta Harmonization database, an open-source synthesis of soil data from research networks, version 1.0 journal January 2021
A data model of the Climate and Forecast metadata conventions (CF-1.6) with a software implementation (cf-python v2.1) journal January 2017
The CARE Principles for Indigenous Data Governance journal January 2020