DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Combining compositional data sets introduces error in covariance network reconstruction

Journal Article · · ISME Communications

Microbial communities are diverse biological systems that include taxa from across multiple kingdoms of life. Notably, interactions between bacteria and fungi play a significant role in determining community structure. However, these statistical associations across kingdoms are more difficult to infer than intra-kingdom associations due to the nature of the data involved using standard network inference techniques. We quantify the challenges of cross-kingdom network inference from both theoretical and practical points of view using synthetic and real-world microbiome data. We detail the theoretical issue presented by combining compositional data sets drawn from the same environment, e.g. 16S and ITS sequencing of a single set of samples, and we survey common network inference techniques for their ability to handle this error. We then test these techniques for the accuracy and usefulness of their intra- and inter-kingdom associations by inferring networks from a set of simulated samples for which a ground-truth set of associations is known. We show that while the two methods mitigate the error of cross-kingdom inference, there is little difference between techniques for key practical applications including identification of strong correlations and identification of possible keystone taxa (i.e. hub nodes in the network). Furthermore, we identify a signature of the error caused by transkingdom network inference and demonstrate that it appears in networks constructed using real-world environmental microbiome data.

Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
89233218CNA000001
OSTI ID:
2426739
Report Number(s):
LA-UR--23-32660
Journal Information:
ISME Communications, Journal Name: ISME Communications Journal Issue: 1 Vol. 4; ISSN 2730-6151
Publisher:
Springer NatureCopyright Statement
Country of Publication:
United States
Language:
English

References (52)

Absolute quantitation of microbes using 16S rRNA gene metabarcoding: A rapid normalization of relative abundances by quantitative PCR targeting a 16S rRNA gene spike‐in standard journal January 2020
A review of normalization and differential abundance methods for microbiome counts data journal May 2022
On criteria for measures of compositional difference journal May 1992
JuMP 1.0: recent improvements to a modeling language for mathematical optimization journal June 2023
Compositional data analysis of the microbiome: fundamentals, tools, and challenges journal May 2016
Comprehensive evaluation of shotgun metagenomics, amplicon sequencing, and harmonization of these platforms for epidemiological studies journal January 2023
PCR-based quantification of taxa-specific abundances in microbial communities: Quantifying and avoiding common pitfalls journal October 2018
Network analysis reveals functional redundancy and keystone taxa amongst bacterial and fungal communities during organic matter decomposition in an arable soil journal June 2016
Resource-allocation constraint governs structure and function of microbial communities in metabolic modeling journal March 2022
Modular community structure suggests metabolic plasticity during the transition to polar night in ice-covered Antarctic lakes journal October 2013
Correlation detection strategies in microbial data sets vary widely in sensitivity and precision journal February 2016
Absolute quantification of microbial taxon abundances journal September 2016
Shotgun metagenomics, from sampling to analysis journal September 2017
Soil bacterial networks are less stable under drought than fungal networks journal August 2018
Top-down identification of keystone taxa in the microbiome journal July 2023
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4 journal February 2023
Challenges in benchmarking metagenomic profilers journal May 2021
Characterizing both bacteria and fungi improves understanding of the Arabidopsis root microbiome journal January 2019
Metagenomic profiling pipelines improve taxonomic classification for 16S amplicon sequencing data journal August 2023
Fast unfolding of communities in large networks journal October 2008
Learning Microbial Interaction Networks from Metagenomic Count Data journal June 2016
CCLasso: correlation inference for compositional data through Lasso journal June 2015
DIABLO: an integrative approach for identifying key molecular drivers from multi-omics assays journal January 2019
Sparse inverse covariance estimation with the graphical lasso journal December 2007
Accurate read-based metagenome characterization using a hierarchical suite of unique signatures journal March 2015
Ergosterol extraction: a comparison of methodologies journal April 2023
A Graphical Model for Fusing Diverse Microbiome Data journal January 2023
Determinants of bacterial communities in Canadian agroforestry systems: Co-occurrence patterns of soil bacterial communities journal August 2015
Keystone taxa predict compositional change in microbial communities journal June 2018
The quest for absolute abundance: The use of internal standards for DNA‐based community ecology journal September 2020
SCNIC: Sparse correlation network investigation for compositional data journal September 2022
Pitfalls in the statistical analysis of microbiome amplicon sequencing data journal November 2022
The Statistical Analysis of Compositional Data journal January 1982
A step forward in fungal biomass estimation – a new protocol for more precise measurements of soil ergosterol with liquid chromatography‐mass spectrometry and comparison of extraction methods journal November 2023
MICOM: Metagenome-Scale Modeling To Infer Metabolic Interactions in the Gut Microbiota journal January 2020
Fungal microbiota dysbiosis in IBD journal February 2016
Fungi participate in the dysbiosis of gut microbiota in patients with primary sclerosing cholangitis journal April 2019
MDSINE: Microbial Dynamical Systems INference Engine for microbiome time-series analyses journal June 2016
Improved metagenomic analysis with Kraken 2 journal November 2019
MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data journal May 2020
Fungi stabilize connectivity in the lung and skin microbial ecosystems journal January 2018
Opportunities and challenges of using metagenomic data to bring uncultured microbes into cultivation journal May 2022
Microbial Hub Taxa Link Host and Abiotic Factors to Plant Microbiome Variation journal January 2016
Microbial Co-occurrence Relationships in the Human Microbiome journal July 2012
Inferring Correlation Networks from Genomic Survey Data journal September 2012
Sparse and Compositionally Robust Inference of Microbial Ecological Networks journal May 2015
Identification of fungi in shotgun metagenomics datasets journal February 2018
The Keystone-Species Concept in Ecology and Conservation journal April 1993
Deciphering microbial interactions and detecting keystone species with co-occurrence networks journal May 2014
Microbiome Datasets Are Compositional: And This Is Not Optional journal November 2017
Cross-kingdom co-occurrence networks in the plant microbiome: Importance and ecological interpretations journal July 2022
Detection of stable community structures within gut microbiota co-occurrence networks from different human populations journal February 2018