Predicting measures of soil health using the microbiome and supervised machine learning
Journal Article
·
· Soil Biology and Biochemistry
- Cornell Univ., Ithaca, NY (United States); Cornell
- Cornell Univ., Ithaca, NY (United States)
Soil health encompasses a range of biological, chemical, and physical soil properties that sustain the commercial and ecological value of agroecosystems. Monitoring soil health requires a comprehensive set of diagnostics that can be cost-prohibitive for routine analyses. The soil microbiome provides a rich source of information about soil properties, which can be assayed in a high-throughput, cost-effective way. We evaluated the accuracy of random forest (RF) and support vector machine (SVM) regression and classification models in predicting 12 measures of soil health, tillage status, and soil texture from 16S rRNA gene amplicon data with an operationally relevant sample set. We validated the efficacy of the best performing models against independent datasets and also tested best practices for processing microbiome data for use in machine learning. Soil health metrics could be predicted from microbiome data with the best models achieving a Kappa value of ~0.65, for categorical assessments, and a R2 value of ~0.8, for numerical scores. Biological health ratings were better predicted than chemical or physical ratings. Validation with independent datasets revealed that models had general predictive value for soil properties, including yield. The ecological profiles of several taxa important for model accuracy matched the observed relationships with soil health, including Pyrinomonadaceae, Nitrososphaeraceae, and Candidatus Udeaobacter. Models trained at the highest taxonomic resolution proved most accurate, with losses in accuracy resulting from rarefying, sparsity filtering, and aggregating at higher taxonomic ranks. Furthermore, our study provides the groundwork for developing scalable technology to use microbiome-based diagnostics for the assessment of soil health.
- Research Organization:
- Cornell Univ., Ithaca, NY (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
- Grant/Contract Number:
- SC0016364
- OSTI ID:
- 1863901
- Journal Information:
- Soil Biology and Biochemistry, Journal Name: Soil Biology and Biochemistry Vol. 164; ISSN 0038-0717
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Ecological insights into soil health according to the genomic traits and environment-wide associations of bacteria in agricultural soils
Human limits in machine learning: prediction of potato yield and disease using soil microbiome data
Journal Article
·
Sun Jan 08 23:00:00 EST 2023
· ISME Communications
·
OSTI ID:1908294
Human limits in machine learning: prediction of potato yield and disease using soil microbiome data
Journal Article
·
Mon Nov 25 23:00:00 EST 2024
· BMC Bioinformatics
·
OSTI ID:2478895