DOE Data Explorer title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: JGI QC impact on assembly, binning, phylogenomics, and functional analysis

Abstract

Background Investigators using metagenomic sequencing to study their microbiomes are often provided data that has been trimmed and decontaminated or do it themselves without knowing the effect these procedures can have on their downstream analyses. Here we evaluated the impact that JGI trimming and decontamination procedures had on assembly and binning metrics, placement of metagenome assembled genomes into species trees, and functional profiles of metagenome-assembled genomes (MAGs) extracted from twenty three complex rhizosphere metagenomes. We also investigated how more aggressive trimming impacts these binning metrics. Results We found that JGI trimmed and decontamination of input reads had some significant impacts in assembly and binning metrics compared to raw reads, and that differences in placement of MAGs in species trees increased with decreasing completeness and contamination thresholds. More aggressive trimming beyond those used by JGI were found to reduce MAG counts. Conclusions Mild trimming and decontamination of metagenomics reads prior to assembly can change an investigator’s answer to the questions, “Who is there and what are they doing? However, mild trimming and decontamination of metagenomic reads with high quality scores is recommended for those who elect to do so.

Authors:
ORCiD logo
  1. North Carolina State Univ., Raleigh, NC (United States); Department of Energy Systems Biology Knowledgebase (KBase)
Publication Date:
Research Org.:
North Carolina State Univ., Raleigh, NC (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
Subject:
59 BASIC BIOLOGICAL SCIENCES; metagenomics, decontamination, assembly, binning, phylogenomics, functional analysis
OSTI Identifier:
1779219
DOI:
https://doi.org/10.25982/62657.1515/1779219

Citation Formats

Whitham, Jason. JGI QC impact on assembly, binning, phylogenomics, and functional analysis. United States: N. p., 2021. Web. doi:10.25982/62657.1515/1779219.
Whitham, Jason. JGI QC impact on assembly, binning, phylogenomics, and functional analysis. United States. doi:https://doi.org/10.25982/62657.1515/1779219
Whitham, Jason. 2021. "JGI QC impact on assembly, binning, phylogenomics, and functional analysis". United States. doi:https://doi.org/10.25982/62657.1515/1779219. https://www.osti.gov/servlets/purl/1779219. Pub date:Fri Jan 01 04:00:00 UTC 2021
@article{osti_1779219,
title = {JGI QC impact on assembly, binning, phylogenomics, and functional analysis},
author = {Whitham, Jason},
abstractNote = {Background Investigators using metagenomic sequencing to study their microbiomes are often provided data that has been trimmed and decontaminated or do it themselves without knowing the effect these procedures can have on their downstream analyses. Here we evaluated the impact that JGI trimming and decontamination procedures had on assembly and binning metrics, placement of metagenome assembled genomes into species trees, and functional profiles of metagenome-assembled genomes (MAGs) extracted from twenty three complex rhizosphere metagenomes. We also investigated how more aggressive trimming impacts these binning metrics. Results We found that JGI trimmed and decontamination of input reads had some significant impacts in assembly and binning metrics compared to raw reads, and that differences in placement of MAGs in species trees increased with decreasing completeness and contamination thresholds. More aggressive trimming beyond those used by JGI were found to reduce MAG counts. Conclusions Mild trimming and decontamination of metagenomics reads prior to assembly can change an investigator’s answer to the questions, “Who is there and what are they doing? However, mild trimming and decontamination of metagenomic reads with high quality scores is recommended for those who elect to do so.},
doi = {10.25982/62657.1515/1779219},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Fri Jan 01 04:00:00 UTC 2021},
month = {Fri Jan 01 04:00:00 UTC 2021}
}