DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Predicting Biomass Yields of Advanced Switchgrass Cultivars for Bioenergy and Ecosystem Services Using Machine Learning

Journal Article · · Energies
DOI: https://doi.org/10.3390/en16104168 · OSTI ID:2368572

The production of advanced perennial bioenergy crops within marginal areas of the agricultural landscape is gaining interest due to its potential to sustainably produce feedstocks for biofuels and bioproducts while also improving the sustainability and resilience of commodity crop production. However, predicting the biomass yields of this production system is challenging because marginal areas are often relatively small and spread around agricultural fields and are typically associated with various abiotic conditions that limit crop production. Machine learning (ML) offers a viable solution as a biomass yield prediction tool because it is suited to predicting relationships with complex functional associations. The objectives of this study were to (1) evaluate the accuracy of commonly applied ML algorithms in agricultural applications for predicting the biomass yields of advanced switchgrass cultivars for bioenergy and ecosystem services and (2) determine the most important biomass yield predictors. Datasets on biomass yield, weather, land marginality, soil properties, and agronomic management were generated from three field study sites in two U.S. Midwest states (Illinois and Iowa) over three growing seasons. The ML algorithms evaluated in the study included random forests (RFs), gradient boosting machines (GBMs), artificial neural networks (ANNs), K-neighbors regressor (KNR), AdaBoost regressor (ABR), and partial least squares regression (PLSR). Coefficient of determination (R2) and mean absolute error (MAE) were used to evaluate the predictive accuracy of the tested algorithms. Results showed that the ensemble methods, RF (R2 = 0.86, MAE = 0.62 Mg/ha), GBM (R2 = 0.88, MAE = 0.57 Mg/ha), and GBM (R2 = 0.78, MAE = 0.66 Mg/ha), were the most accurate in predicting biomass yields of the Independence, Liberty, and Shawnee switchgrass cultivars, respectively. This is in agreement with similar studies that apply ML to multi-feature problems where traditional statistical methods are less applicable and datasets used were considered to be relatively small for ANNs. Consistent with previous studies on switchgrass, the most important predictors of biomass yield included average annual temperature, average growing season temperature, sum of the growing season precipitation, field slope, and elevation. This study helps pave the way for applying ML as a management tool for alternative bioenergy landscapes where understanding agronomic and environmental performance of a multifunctional cropping system seasonally and interannually at the sub-field scale is critical.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE), Office of Sustainable Transportation. Bioenergy Technologies Office (BETO)
Grant/Contract Number:
AC02-06CH11357; EE0008521
OSTI ID:
2368572
Alternate ID(s):
OSTI ID: 2537901
Journal Information:
Energies, Journal Name: Energies Journal Issue: 10 Vol. 16; ISSN 1996-1073
Publisher:
MDPICopyright Statement
Country of Publication:
United States
Language:
English

References (49)

Environmental limitation mapping of potential biomass resources across the conterminous United States journal January 2018
Array programming with NumPy journal September 2020
Crop yield prediction using machine learning: A systematic literature review journal October 2020
Nitrogen removal in switchgrass biomass under two harvest systems journal November 2000
Net energy of cellulosic ethanol from switchgrass journal January 2008
Emerging Trends in Machine Learning to Predict Crop Yield and Study Its Influential Factors: A Survey journal March 2021
Machine learning based estimation of land productivity in the contiguous US using biophysical predictors journal June 2020
A survey on multi-output regression: Multi-output regression survey
  • Borchani, Hanen; Varando, Gherardo; Bielza, Concha
  • Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, Vol. 5, Issue 5 https://doi.org/10.1002/widm.1157
journal July 2015
Effect of Soil and Topographic Properties on Crop Yield in a North-Central Corn–Soybean Cropping System journal January 2004
Switchgrass as a biofuels feedstock in the USA journal December 2006
Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review journal August 2018
A Leisurely Look at the Bootstrap, the Jackknife, and Cross-Validation journal February 1983
Illinois Climate Network
  • Program, Water And Atmospheric Resources Monitoring
  • Illinois State Water Survey, Prairie Research Institute, University of Illinois at Urbana-Champaign https://doi.org/10.13012/j8mw2f2q
dataset January 1998
Yield and Water Quality Impacts of Field-Scale Integration of Willow into a Continuous Corn Rotation System journal January 2017
The Biology and Agronomy of Switchgrass for Biofuels journal September 2005
Correlation of Corn and Soybean Grain Yield with Topography and Soil Properties journal January 2000
Progress and barriers in understanding and preventing indirect land‐use change journal June 2020
Base temperature and thermal time requirements for germination and emergence of temperate pasture species journal March 2000
Cellulosic biofuel contributions to a sustainable energy future: Choices and outcomes journal June 2017
An Integrated Landscape Designed for Commodity and Bioenergy Crops for a Tile-Drained Agricultural Watershed journal January 2016
DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks conference December 2018
Prediction of crime occurrence from multi-modal data using deep learning journal April 2017
Biomass Production of Switchgrass in Central South Dakota journal January 2005
A two-dimensional interpolation function for irregularly-spaced data conference January 1968
How Biased is the Apparent Error Rate of a Prediction Rule? journal June 1986
Untapped Potential: Opportunities and Challenges for Sustainable Bioenergy Production from Marginal Lands in the Northeast USA journal September 2014
Predictive ability of machine learning methods for massive crop yield prediction journal April 2014
Climatic impact, future biomass production, and local adaptation of four switchgrass cultivars journal March 2019
Remote sensing of chlorophyll concentration in higher plant leaves journal January 1998
A Critical Review on Lignocellulosic Biomass Yield Modeling and the Bioenergy Potential from Marginal Land journal November 2021
Biomass Production in Switchgrass across the United States: Database Description and Determinants of Yield journal January 2010
Wheat yield prediction using machine learning and advanced sensing techniques journal February 2016
Integrated lignocellulosic value chains in a growing bioeconomy: Status quo and perspectives journal January 2019
Nitrogen Deposition Modifies soil Carbon Storage Through Changes in Microbial Enzymatic Activity journal August 2004
Estimating hourly and continuous ground-level PM2.5 concentrations using an ensemble learning algorithm: The ST-stacking model journal February 2020
Multifunctional landscapes: Site characterization and field-scale design to incorporate biomass production into an agricultural system journal September 2015
Use of a green channel in remote sensing of global vegetation from EOS-MODIS journal December 1996
Some Studies in Machine Learning Using the Game of Checkers. II—Recent Progress journal November 1967
California Almond Yield Prediction at the Orchard Level With a Machine Learning Approach journal July 2019
Artificial neural networks for corn and soybean yield prediction journal July 2005
Microtopography-induced transient waterlogging affects switchgrass (Alamo) growth in the lower coastal plain of North Carolina, USA journal April 2018
Dedicated Energy Crops and Crop Residues for Bioenergy Feedstocks in the Central and Eastern USA journal June 2016
Geospatial Data Gateway dataset January 2016
Machine learning to predict biomass sorghum yields under future climate scenarios journal February 2020
Atmospherically resistant vegetation index (ARVI) for EOS-MODIS journal March 1992
Multiple ecosystem services provision and biomass logistics management in bioenergy buffers: A state-of-the-art review journal June 2017
Remote Sensing-Based Estimation of Advanced Perennial Grass Biomass Yields for Bioenergy journal November 2021
Spectral Reflectance Changes Associated with Autumn Senescence of Aesculus hippocastanum L. and Acer platanoides L. Leaves. Spectral Features and Relation to Chlorophyll Estimation journal March 1994
Climatic and genetic controls of yields of switchgrass, a model bioenergy species journal January 2012