Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Predicting measures of soil health using the microbiome and supervised machine learning

Journal Article · · Soil Biology and Biochemistry
Soil health encompasses a range of biological, chemical, and physical soil properties that sustain the commercial and ecological value of agroecosystems. Monitoring soil health requires a comprehensive set of diagnostics that can be cost-prohibitive for routine analyses. The soil microbiome provides a rich source of information about soil properties, which can be assayed in a high-throughput, cost-effective way. We evaluated the accuracy of random forest (RF) and support vector machine (SVM) regression and classification models in predicting 12 measures of soil health, tillage status, and soil texture from 16S rRNA gene amplicon data with an operationally relevant sample set. We validated the efficacy of the best performing models against independent datasets and also tested best practices for processing microbiome data for use in machine learning. Soil health metrics could be predicted from microbiome data with the best models achieving a Kappa value of ~0.65, for categorical assessments, and a R2 value of ~0.8, for numerical scores. Biological health ratings were better predicted than chemical or physical ratings. Validation with independent datasets revealed that models had general predictive value for soil properties, including yield. The ecological profiles of several taxa important for model accuracy matched the observed relationships with soil health, including Pyrinomonadaceae, Nitrososphaeraceae, and Candidatus Udeaobacter. Models trained at the highest taxonomic resolution proved most accurate, with losses in accuracy resulting from rarefying, sparsity filtering, and aggregating at higher taxonomic ranks. Furthermore, our study provides the groundwork for developing scalable technology to use microbiome-based diagnostics for the assessment of soil health.
Research Organization:
Cornell Univ., Ithaca, NY (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
Grant/Contract Number:
SC0016364
OSTI ID:
1863901
Journal Information:
Soil Biology and Biochemistry, Journal Name: Soil Biology and Biochemistry Vol. 164; ISSN 0038-0717
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (71)

Introducing the North American project to evaluate soil health measurements journal May 2020
Soil health and global sustainability: translating science into practice journal February 2002
Harvested perennial grasslands provide ecological benchmarks for agricultural sustainability journal April 2010
Legacy effects of contrasting organic grain cropping systems on soil health indicators, soil invertebrates, weeds, and crop yield journal January 2020
Soil health characterization in smallholder agricultural catchments in India journal June 2019
Human-Associated Microbial Signatures: Examining Their Predictive Value journal October 2011
Soil Health Gap: A concept to establish a benchmark for soil health management journal September 2020
No-till and cropping system diversification improve soil health and crop yield journal October 2018
The influence of soil management on soil health: An on-farm study in southern Sweden journal February 2020
Linking microbial taxa and the effect of mineral nitrogen forms on residue decomposition at the early stage in arable soil by DNA-qSIP journal October 2021
Effects of biological soil crusts on surface roughness and implications for runoff and erosion journal April 2012
Effect of soil sample preservation, compared to the effect of other environmental variables, on bacterial and eukaryotic diversity journal March 2009
Soil health assessment: A critical review of current methodologies and a proposed new approach journal January 2019
Survival of bacterial DNA and culturable bacteria in archived soils from the Rothamsted Broadbalk experiment journal May 2008
Prevalent root-derived phenolics drive shifts in microbial community composition and prime decomposition in forest soil journal June 2020
Plant Growth‐Promoting Rhizobacteria (PGPR) Reduce Evaporation and Increase Soil Water Retention journal May 2018
UniFrac: an effective distance metric for microbial community comparison journal September 2010
Spatial distribution of ammonia-oxidizing bacteria and archaea across a 44-hectare farm related to ecosystem functioning journal January 2011
Soil bacterial and fungal communities across a pH gradient in an arable soil journal May 2010
Distinct soil microbial diversity under long-term organic and conventional farming journal October 2014
Microbial and biochemical basis of a Fusarium wilt-suppressive soil journal June 2015
Biogeography and organic matter removal shape long-term effects of timber harvesting on forest soil microbial communities journal July 2017
DADA2: High-resolution sample inference from Illumina amplicon data journal May 2016
Soil pH mediates the balance between stochastic and deterministic assembly of bacteria journal March 2018
Establishing microbial composition measurement standards with reference frames journal June 2019
Palaeoclimate explains a unique proportion of the global variation in soil bacterial communities journal August 2017
Soil biota contributions to soil aggregation journal October 2017
Microbiome analyses of blood and tissues suggest cancer diagnostic approach journal March 2020
Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2 journal July 2019
The concept and future prospects of soil health journal August 2020
Microbiomic Signatures of Psoriasis: Feasibility and Methodology Comparison journal September 2013
Soil quality change 50 years after forestland conversion to tea farming journal January 2018
Bacterial community structures are unique and resilient in full-scale bioenergy systems journal February 2011
Estimating active carbon for soil quality assessment: A simplified method for laboratory and field use journal March 2003
The role of plant–microbiome interactions in weed establishment and control journal July 2016
Microbial indicators are better predictors of wheat yield and quality than N fertilization journal December 2019
The SILVA ribosomal RNA gene database project: improved data processing and web-based tools journal November 2012
Effects of plant community history, soil legacy and plant diversity on soil microbial communities journal June 2021
Effect of preservation method on the assessment of bacterial community structure in soil and water samples journal June 2014
Methods for normalizing microbiome data: An ecological perspective journal November 2018
Effect of storage conditions on the assessment of bacterial community structure in soil and human-associated samples: Influence of short-term storage conditions on microbiota journal March 2010
Long‐term stability of soil bacterial and fungal community structures revealed in their abundant and rare fractions journal July 2021
Deciphering the Rhizosphere Microbiome for Disease-Suppressive Bacteria journal May 2011
Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform journal June 2013
Characterizing the Key Agents in a Disease-Suppressed Soil Managed by Reductive Soil Disinfestation journal April 2019
A Framework for Effective Application of Machine Learning to Microbiome-Based Classification Problems journal June 2020
Fecal Short-Chain Fatty Acids Are Not Predictive of Colonic Tumor Status and Cannot Be Predicted Based on Bacterial Community Structure journal August 2019
Competitive Exclusion and Metabolic Dependency among Microorganisms Structure the Cellulose Economy of an Agricultural Soil journal February 2021
Longitudinal survey of microbiome associated with particulate matter in a megacity journal March 2020
Using soil bacterial communities to predict physico-chemical variables and soil quality journal June 2020
Complete genome sequence of the thermophilic Acidobacteria, Pyrinomonas methylaliphatogenes type strain K22T journal November 2015
Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights journal July 2016
phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data journal April 2013
Comparative Analysis of Prokaryotic Communities Associated with Organic and Conventional Farming Systems journal December 2015
Machine learning to predict microbial community functions: An analysis of dissolved organic carbon from litter decomposition journal July 2019
Linking ecology and systematics of acidobacteria: Distinct habitat preferences of the Acidobacteriia and Blastocatellia in tundra soils journal March 2020
Building Predictive Models in R Using the caret Package journal January 2008
Soil Protein as a Rapid Soil Health Indicator of Potentially Available Organic Nitrogen journal January 2018
What We Talk about When We Talk about Soil Health journal January 2018
Statistics, Scoring Functions, and Regional Analysis of a Comprehensive Soil Health Database journal May 2017
Reanalysis Validates Soil Health Indicator Sensitivity and Correlation with Long-term Crop Yields journal May 2019
Use of Soil Protein Pools as Indicators of Soil Nitrogen Mineralization Potential journal July 2019
Cyanobacteria Inoculation Improves Soil Stability and Fertility on Different Textured Soils: Gaining Insights for Applicability in Soil Restoration journal June 2018
A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction journal June 2019
Ca. Nitrososphaera and Bradyrhizobium are inversely correlated and related to agricultural practices in long-term field experiments journal January 2013
The Community Structures of Prokaryotes and Fungi in Mountain Pasture Soils are Highly Correlated and Primarily Influenced by pH journal November 2015
Metagenome-Wide Association Study and Machine Learning Prediction of Bulk Soil Microbiome and Crop Productivity journal April 2017
Temporal Dynamics of Soil Microbial Communities below the Seedbed under Two Contrasting Tillage Regimes journal June 2017
The Microbiome Stress Project: Toward a Global Meta-Analysis of Environmental Stressors and Their Effects on Microbial Communities journal January 2019
Evidence of Soil Health Benefits of Flooded Rice Compared to Fallow Practice journal July 2018
GMPR: A robust normalization method for zero-inflated count data with application to microbiome sequencing data journal January 2018