skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: KGML-ag: a modeling framework of knowledge-guided machine learning to simulate agroecosystems: a case study of estimating N<sub>2</sub>O emission using data from mesocosm experiments

Journal Article · · Geoscientific Model Development (Online)

Agricultural nitrous oxide (N2O) emission accounts for a non-trivial fraction of global greenhouse gas (GHG) budget. To date, estimating N2O fluxes from cropland remains a challenging task because the related microbial processes (e.g., nitrification and denitrification) are controlled by complex interactions among climate, soil, plant and human activities. Existing approaches such as process-based (PB) models have well-known limitations due to insufficient representations of the processes or uncertainties of model parameters, and due to leverage recent advances in machine learning (ML) a new method is needed to unlock the “black box” to overcome its limitations such as low interpretability, out-of-sample failure and massive data demand. In this study, we developed a first-of-its-kind knowledge-guided machine learning model for agroecosystems (KGML-ag) by incorporating biogeophysical and chemical domain knowledge from an advanced PB model, ecosys, and tested it by comparing simulating daily N2O fluxes with real observed data from mesocosm experiments. The gated recurrent unit (GRU) was used as the basis to build the model structure. To optimize the model performance, we have investigated a range of ideas, including (1) using initial values of intermediate variables (IMVs) instead of time series as model input to reduce data demand; (2) building hierarchical structures to explicitly estimate IMVs for further N2O prediction; (3) using multi-task learning to balance the simultaneous training on multiple variables; and (4) pre-training with millions of synthetic data generated from ecosys and fine-tuning with mesocosm observations. Six other pure ML models were developed using the same mesocosm data to serve as the benchmark for the KGML-ag model. Results show that KGML-ag did an excellent job in reproducing the mesocosm N2O fluxes (overall r2=0.81, and RMSE=3.6 mgNm-2d-1 from cross validation). Importantly, KGML-ag always outperforms the PB model and ML models in predicting N2O fluxes, especially for complex temporal dynamics and emission peaks. Besides, KGML-ag goes beyond the pure ML models by providing more interpretable predictions as well as pinpointing desired new knowledge and data to further empower the current KGML-ag. We believe the KGML-ag development in this study will stimulate a new body of research on interpretable ML for biogeochemistry and other related geoscience processes.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Advanced Research Projects Agency - Energy (ARPA-E); USDOE Office of Science (SC), Biological and Environmental Research (BER); National Science Foundation (NSF)
Grant/Contract Number:
AR0001382; AC02-05CH11231; 2034385
OSTI ID:
1861922
Alternate ID(s):
OSTI ID: 1882849
Journal Information:
Geoscientific Model Development (Online), Journal Name: Geoscientific Model Development (Online) Vol. 15 Journal Issue: 7; ISSN 1991-9603
Publisher:
Copernicus GmbHCopyright Statement
Country of Publication:
Germany
Language:
English

References (45)

The global nitrous oxide budget revisited journal February 2011
Temperature sensitivity of N 2 O emissions from fertilized agricultural soils: Mathematical modeling in ecosys: MODELING N 2 O EMISSIONS journal December 2008
Predicting lake surface water phosphorus dynamics using process-guided machine learning journal August 2020
General model for N 2 O and N 2 gas emissions from soils due to dentrification journal December 2000
Quantifying nitrogen loss hotspots and mitigation potential for individual fields in the US Corn Belt with a metamodeling approach journal July 2021
On the Properties of Neural Machine Translation: Encoder–Decoder Approaches
  • Cho, Kyunghyun; van Merrienboer, Bart; Bahdanau, Dzmitry
  • Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation https://doi.org/10.3115/v1/W14-4012
conference January 2014
Response of nitrous oxide emissions to individual rain events and future changes in precipitation journal March 2022
Enforcing Analytic Constraints in Neural Networks Emulating Physical Systems journal March 2021
Mathematical modeling of nitrous oxide emissions from an agricultural field during spring thaw journal June 1999
Modeling the Effects of Fertilizer Application Rate on Nitrous Oxide Emissions journal January 2006
Changes in net ecosystem productivity and greenhouse gas exchange with fertilization of Douglas fir: Mathematical modeling in ecosys journal January 2010
Nitrous oxide emissions from soils: how well do we understand the processes and their controls?
  • Butterbach-Bahl, Klaus; Baggs, Elizabeth M.; Dannenmann, Michael
  • Philosophical Transactions of the Royal Society B: Biological Sciences, Vol. 368, Issue 1621 https://doi.org/10.1098/rstb.2013.0122
journal July 2013
Process‐Guided Deep Learning Predictions of Lake Water Temperature journal November 2019
Testing a Conceptual Model of Soil Emissions of Nitrous and Nitric Oxides journal January 2000
Machine learning for predicting greenhouse gas emissions from agricultural soils journal November 2020
The development of the DNDC plant growth sub-model and the application of DNDC in agriculture: A review journal August 2016
APSIM – Evolution towards a new generation of agricultural systems simulation journal December 2014
Modelling variability in N2O emissions from fertilized agricultural fields journal February 2003
Globally important nitrous oxide emissions from croplands induced by freeze–thaw cycles journal March 2017
Towards hybrid modeling of the global hydrological cycle journal January 2022
Long Short-Term Memory journal November 1997
Fertilizer Source and Tillage Effects on Yield-Scaled Nitrous Oxide Emissions in a Corn Cropping System journal January 2011
Continental-scale water and energy flux analysis and validation for the North American Land Data Assimilation System project phase 2 (NLDAS-2): 1. Intercomparison and application of model products: WATER AND ENERGY FLUX ANALYSIS journal February 2012
An overview of APSIM, a model designed for farming systems simulation journal January 2003
Sampling frequency affects estimates of annual nitrous oxide fluxes journal November 2015
Stacking ecosystem services journal April 2014
A comprehensive quantification of global nitrous oxide sources and sinks journal October 2020
Acceleration of global N2O emissions seen from two decades of atmospheric inversion journal November 2019
Physics-Guided Machine Learning for Scientific Discovery: An Application in Simulating Lake Temperature Profiles journal May 2021
Deep learning and process understanding for data-driven Earth system science journal February 2019
Physics Guided RNNs for Modeling Dynamical Systems: A Case Study in Simulating Lake Temperature Profiles book May 2019
Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data journal October 2017
Using the ecosys mathematical model to simulate temporal variability of nitrous oxide emissions from a fertilized agricultural soil journal December 2009
Evaluation of carbon isotope flux partitioning theory under simplified and controlled environmental conditions journal February 2012
Machine learning improves predictions of agricultural nitrous oxide (N 2 O) emissions from intensively managed cropping systems journal January 2021
Automated, Low‐Power Chamber System for Measuring Nitrous Oxide Emissions journal March 2013
Towards a multiscale crop modelling framework for climate change adaptation assessment journal April 2020
A simulation model linking crop growth and soil biogeochemistry for sustainable agriculture journal May 2002
Understanding the DayCent model: Calibration, sensitivity, and identifiability through inverse modeling journal April 2015
Denitrification in soil as a function of oxygen availability at the microscale journal January 2021
Towards neural Earth system modelling by integrating artificial intelligence in Earth system science journal August 2021
The digital revolution of Earth-system science journal February 2021
Ecological controls on N 2 O emission in surface litter and near-surface soil of a managed grassland: modelling and measurements journal January 2016
Quantifying carbon budget, crop yields and their responses to environmental variability using the ecosys model for U.S. Midwestern agroecosystems journal September 2021
Uncertainties in the Emissions Database for Global Atmospheric Research (EDGAR) emission inventory of greenhouse gases journal January 2021