Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Using automated machine learning for the upscaling of gross primary productivity

Journal Article · · Biogeosciences (Online)
 [1];  [2];  [3];  [2]
  1. Univ. of California, Berkeley, CA (United States); Univ. of Copenhagen (Denmark)
  2. Univ. of California, Berkeley, CA (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
  3. Univ. of Copenhagen (Denmark)
Estimating gross primary productivity (GPP) over space and time is fundamental for understanding the response of the terrestrial biosphere to climate change. Eddy covariance flux towers provide in situ estimates of GPP at the ecosystem scale, but their sparse geographical distribution limits larger-scale inference. Machine learning (ML) techniques have been used to address this problem by extrapolating local GPP measurements over space using satellite remote sensing data. However, the accuracy of the regression model can be affected by uncertainties introduced by model selection, parameterization, and choice of explanatory features, among others. Recent advances in automated ML (AutoML) provide a novel automated way to select and synthesize different ML models. In this work, we explore the potential of AutoML by training three major AutoML frameworks on eddy covariance measurements of GPP at 243 globally distributed sites. We compared their ability to predict GPP and its spatial and temporal variability based on different sets of remote sensing explanatory variables. Explanatory variables from only Moderate Resolution Imaging Spectroradiometer (MODIS) surface reflectance data and photosynthetically active radiation explained over 70 % of the monthly variability in GPP, while satellite-derived proxies for canopy structure, photosynthetic activity, environmental stressors, and meteorological variables from reanalysis (ERA5-Land) further improved the frameworks' predictive ability. We found that the AutoML framework Auto-sklearn consistently outperformed other AutoML frameworks as well as a classical random forest regressor in predicting GPP but with small performance differences, reaching an r2 of up to 0.75. We deployed the best-performing framework to generate global wall-to-wall maps highlighting GPP patterns in good agreement with satellite-derived reference data. This research benchmarks the application of AutoML in GPP estimation and assesses its potential and limitations in quantifying global photosynthetic activity.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
National Aeronautics and Space Administration (NASA); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231; SC0021023
OSTI ID:
2404343
Journal Information:
Biogeosciences (Online), Journal Name: Biogeosciences (Online) Journal Issue: 10 Vol. 21; ISSN 1726-4189
Publisher:
Copernicus Publications, EGUCopyright Statement
Country of Publication:
United States
Language:
English

References (78)

Effect of spatial sampling from European flux towers for estimating carbon and water fluxes with artificial neural networks: SAMPLING EFFECT ON FLUXES UPSCALING journal October 2015
Spatiotemporal patterns of terrestrial gross primary production: A review: GPP Spatiotemporal Patterns journal August 2015
Canopy and physiological controls of GPP during drought and heat wave journal April 2016
Estimating morning change in land surface temperature from MODIS day/night observations: Applications for surface energy balance modeling journal October 2017
Chlorophyll Fluorescence Better Captures Seasonal and Interannual Gross Primary Productivity Dynamics Across Dryland Ecosystems of Southwestern North America journal January 2018
A global study of GPP focusing on light-use efficiency in a random forest regression model journal May 2017
Automated Machine Learning: Methods, Systems, Challenges book January 2019
River flow forecasting through conceptual models part I — A discussion of principles journal April 1970
Estimation of net ecosystem carbon exchange for the conterminous United States by combining MODIS and AmeriFlux data journal October 2008
Modeling gross primary production of irrigated and rain-fed maize using MODIS imagery and CO2 flux tower data journal December 2011
Global comparison of light use efficiency models for simulating terrestrial vegetation gross primary production based on the LaThuile database journal July 2014
Revisiting the choice of the driving temperature for eddy covariance CO2 flux partitioning journal May 2017
Satellite-based reflectances capture large fraction of variability in global gross primary production (GPP) at weekly time scales journal September 2020
Representativeness of Eddy-Covariance flux footprints for areas surrounding AmeriFlux sites journal May 2021
A changing Amazon rainforest: Historical trends and future projections under post-Paris climate scenarios journal December 2020
AutoGluon: A revolutionary framework for landslide hazard analysis journal September 2021
Evaluation of MODIS NPP and GPP products across multiple biomes journal June 2006
ESA CCI Soil Moisture for improved Earth system understanding: State-of-the art and future directions journal December 2017
MODIS-derived global land products of shortwave radiation and diffuse and total photosynthetically active radiation at 5 km resolution from 2000 journal January 2018
Understanding the relationship between vegetation greenness and productivity across dryland ecosystems through the integration of PhenoCam, satellite, and eddy covariance data journal March 2019
Remote sensing of dryland ecosystem structure and function: Progress, challenges, and opportunities journal November 2019
Estimation of root zone soil moisture from ground and remotely sensed soil information with multisensor data fusion and automated machine learning journal July 2021
Summary for Policymakers book July 2023
Assessing and improving the representativeness of monitoring networks: The European flux tower network example journal January 2011
Global patterns of land-atmosphere fluxes of carbon dioxide, latent heat, and sensible heat derived from eddy covariance, satellite, and meteorological observations journal January 2011
Midwest US Croplands Determine Model Divergence in North American Carbon Fluxes journal May 2021
Direct human influence on atmospheric CO2 seasonality from increased cropland productivity journal November 2014
Recent pause in the growth rate of atmospheric CO2 due to enhanced terrestrial carbon uptake journal November 2016
Spatial validation reveals poor predictive performance of large-scale ecological mapping models journal September 2020
Widespread inhibition of daytime ecosystem respiration journal February 2019
Drought impacts on terrestrial primary production underestimated by satellite monitoring journal March 2019
Dominant role of soil moisture in mediating carbon and water fluxes in dryland ecosystems journal January 2024
Large influence of soil moisture on long-term terrestrial carbon uptake journal January 2019
Deep learning and process understanding for data-driven Earth system science journal February 2019
Large Chinese land carbon sink estimated from atmospheric carbon dioxide data journal October 2020
Author Correction: The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data journal February 2021
China and India lead in greening of the world through land-use management journal February 2019
Improved dryland carbon flux predictions with explicit consideration of water-carbon coupling journal December 2021
Heatwave effects on gross primary production of northern mid-latitude ecosystems journal July 2020
Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools conference November 2019
A Comparison of AutoML Tools for Machine Learning, Deep Learning and XGBoost conference July 2021
Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure journal March 2017
Effects of climate extremes on the terrestrial carbon cycle: concepts, processes and potential future impacts journal May 2015
Partitioning net carbon dioxide fluxes into photosynthesis and respiration using neural networks journal July 2020
Surface temperatures reveal the patterns of vegetation water stress and their environmental drivers across the tropical Americas journal March 2022
Environmental controls on the light use efficiency of terrestrial gross primary production journal November 2022
On the separation of net ecosystem exchange into assimilation and ecosystem respiration: review and improved algorithm journal September 2005
Quantifying soil moisture impacts on light use efficiency across biomes journal March 2018
Terrestrial Gross Carbon Dioxide Uptake: Global Distribution and Covariation with Climate journal July 2010
The dominant role of semi-arid ecosystems in the trend and variability of the land CO2 sink journal May 2015
Ensemble selection from libraries of models conference January 2004
Auto-WEKA conference August 2013
Modeling watershed nutrient concentrations with AutoML conference September 2020
Can AutoML outperform humans? An evaluation on popular OpenML datasets using AutoML Benchmark conference December 2020
AutoML to Date and Beyond: Challenges and Opportunities journal October 2021
First- and Second-Order Conservative Remapping Schemes for Grids in Spherical Coordinates journal September 1999
Compact Neural Architecture Search for Local Climate Zones Classification conference January 2021
Benchmark and Survey of Automated Machine Learning Frameworks journal January 2021
A Continuous Satellite-Derived Measure of Global Terrestrial Primary Production journal January 2004
Super Learner journal January 2007
Development of Multi-Inflow Prediction Ensemble Model Based on Auto-Sklearn Using Combined Approach: Case Study of Soyang River Dam journal April 2023
Estimation of Global Vegetation Productivity from Global LAnd Surface Satellite Data journal February 2018
Estimation of Terrestrial Global Gross Primary Production (GPP) with Satellite Data-Driven Models and Eddy Covariance Flux Data journal August 2018
Water-Quality Prediction Based on H2O AutoML and Explainable AI Techniques journal January 2023
Predicting carbon dioxide and energy fluxes across global FLUXNET sites with regression algorithms journal January 2016
Impacts of droughts and extreme-temperature events on gross primary production and ecosystem respiration: a systematic assessment across ecosystems and climate zones journal January 2018
Does predictability of fluxes vary between FLUXNET sites? journal January 2018
A global spatially contiguous solar-induced fluorescence (CSIF) dataset using neural networks journal January 2018
Scaling carbon fluxes from eddy covariance sites to globe: synthesis and evaluation of the FLUXCOM approach journal January 2020
Large-scale biospheric drought response intensifies linearly with drought duration in arid regions journal May 2020
Machine learning estimates of eddy covariance carbon flux in a scrub in the Mexican highland journal January 2021
Technical note: A view from space on global flux towers by MODIS and Landsat: the FluxnetEO data set journal June 2022
Upscaling dryland carbon and water fluxes with artificial neural networks of optical, thermal, and microwave satellite remote sensing journal January 2023
Upscaled diurnal cycles of land–atmosphere fluxes: a new global half-hourly data product journal January 2018
Global Carbon Budget 2019 journal January 2019
Evolution of the ESA CCI Soil Moisture climate data records and their underlying merging methodology journal January 2019
ERA5-Land: a state-of-the-art global reanalysis dataset for land applications journal January 2021
Assessing vegetation variability and trends in north-eastern Brazil using AVHRR and MODIS NDVI time series journal January 2013

Similar Records

Increased photosynthesis during spring drought in energy-limited ecosystems
Journal Article · Tue Nov 28 19:00:00 EST 2023 · Nature Communications · OSTI ID:2222964

Chlorophyll Fluorescence Better Captures Seasonal and Interannual Gross Primary Productivity Dynamics Across Dryland Ecosystems of Southwestern North America
Journal Article · Wed Jan 03 19:00:00 EST 2018 · Geophysical Research Letters · OSTI ID:1537304

Related Subjects