DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: CONSTAX2: improved taxonomic classification of environmental DNA markers

Abstract

Abstract Summary CONSTAX—the CONSensus TAXonomy classifier—was developed for accurate and reproducible taxonomic annotation of fungal rDNA amplicon sequences and is based upon a consensus approach of RDP, SINTAX and UTAX algorithms. CONSTAX2 extends these features to classify prokaryotes as well as eukaryotes and incorporates BLAST-based classifiers to reduce classification errors. Additionally, CONSTAX2 implements a conda-installable command-line tool with improved classification metrics, faster training, multithreading support, capacity to incorporate external taxonomic databases and new isolate matching and high-level taxonomy tools, replete with documentation and example tutorials. Availability and implementation CONSTAX2 is available at https://github.com/liberjul/CONSTAXv2, and is packaged for Linux and MacOS from Bioconda with use under the MIT License. A tutorial and documentation are available at https://constax.readthedocs.io/en/latest/. Data and scripts associated with the manuscript are available at https://github.com/liberjul/CONSTAXv2_ms_code. Supplementary information Supplementary data are available at Bioinformatics online.

Authors:
ORCiD logo; ; ;
Publication Date:
Research Org.:
Univ. of Wisconsin, Madison, WI (United States); Michigan State Univ., East Lansing, MI (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER); National Science Foundation (NSF)
OSTI Identifier:
1829161
Alternate Identifier(s):
OSTI ID: 1979482
Grant/Contract Number:  
SC0018409; DEB 1737898
Resource Type:
Published Article
Journal Name:
Bioinformatics
Additional Journal Information:
Journal Name: Bioinformatics Journal Volume: 37 Journal Issue: 21; Journal ID: ISSN 1367-4803
Publisher:
Oxford University Press
Country of Publication:
United Kingdom
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES

Citation Formats

Liber, Julian A., Bonito, Gregory, Benucci, Gian Maria Niccolò, and Marschall, ed., Tobias. CONSTAX2: improved taxonomic classification of environmental DNA markers. United Kingdom: N. p., 2021. Web. doi:10.1093/bioinformatics/btab347.
Liber, Julian A., Bonito, Gregory, Benucci, Gian Maria Niccolò, & Marschall, ed., Tobias. CONSTAX2: improved taxonomic classification of environmental DNA markers. United Kingdom. https://doi.org/10.1093/bioinformatics/btab347
Liber, Julian A., Bonito, Gregory, Benucci, Gian Maria Niccolò, and Marschall, ed., Tobias. Fri . "CONSTAX2: improved taxonomic classification of environmental DNA markers". United Kingdom. https://doi.org/10.1093/bioinformatics/btab347.
@article{osti_1829161,
title = {CONSTAX2: improved taxonomic classification of environmental DNA markers},
author = {Liber, Julian A. and Bonito, Gregory and Benucci, Gian Maria Niccolò and Marschall, ed., Tobias},
abstractNote = {Abstract Summary CONSTAX—the CONSensus TAXonomy classifier—was developed for accurate and reproducible taxonomic annotation of fungal rDNA amplicon sequences and is based upon a consensus approach of RDP, SINTAX and UTAX algorithms. CONSTAX2 extends these features to classify prokaryotes as well as eukaryotes and incorporates BLAST-based classifiers to reduce classification errors. Additionally, CONSTAX2 implements a conda-installable command-line tool with improved classification metrics, faster training, multithreading support, capacity to incorporate external taxonomic databases and new isolate matching and high-level taxonomy tools, replete with documentation and example tutorials. Availability and implementation CONSTAX2 is available at https://github.com/liberjul/CONSTAXv2, and is packaged for Linux and MacOS from Bioconda with use under the MIT License. A tutorial and documentation are available at https://constax.readthedocs.io/en/latest/. Data and scripts associated with the manuscript are available at https://github.com/liberjul/CONSTAXv2_ms_code. Supplementary information Supplementary data are available at Bioinformatics online.},
doi = {10.1093/bioinformatics/btab347},
journal = {Bioinformatics},
number = 21,
volume = 37,
place = {United Kingdom},
year = {Fri May 07 00:00:00 EDT 2021},
month = {Fri May 07 00:00:00 EDT 2021}
}

Works referenced in this record:

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
journal, September 1997

  • Altschul, Stephen F.; Madden, Thomas L.; Schäffer, Alejandro A.
  • Nucleic Acids Research, Vol. 25, Issue 17, p. 3389-3402
  • DOI: 10.1093/nar/25.17.3389

CONSTAX: a tool for improved taxonomic resolution of environmental fungal ITS sequences
journal, December 2017

  • Gdanetz, Kristi; Benucci, Gian Maria Niccolò; Vande Pol, Natalie
  • BMC Bioinformatics, Vol. 18, Issue 1
  • DOI: 10.1186/s12859-017-1952-x

Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy
journal, June 2007

  • Wang, Q.; Garrity, G. M.; Tiedje, J. M.
  • Applied and Environmental Microbiology, Vol. 73, Issue 16
  • DOI: 10.1128/AEM.00062-07

Bioconda: sustainable and comprehensive software distribution for the life sciences
journal, July 2018


High throughput sequencing methods and analysis for microbiome research
journal, December 2013

  • Di Bella, Julia M.; Bao, Yige; Gloor, Gregory B.
  • Journal of Microbiological Methods, Vol. 95, Issue 3
  • DOI: 10.1016/j.mimet.2013.08.011

Improved metagenomic analysis with Kraken 2
journal, November 2019


SPINGO: a rapid species-classifier for microbial amplicon sequences
journal, October 2015


The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications
journal, October 2018

  • Nilsson, Rolf Henrik; Larsson, Karl-Henrik; Taylor, Andy F. S.
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky1022

Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin
journal, May 2018


Scraping the bottom of the barrel: are rare high throughput sequences artifacts?
journal, February 2015


Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities
journal, October 2009

  • Schloss, P. D.; Westcott, S. L.; Ryabin, T.
  • Applied and Environmental Microbiology, Vol. 75, Issue 23, p. 7537-7541
  • DOI: 10.1128/AEM.01541-09

25 years of serving the community with ribosomal RNA gene reference databases and tools
journal, November 2017


Improved tools for biological sequence comparison.
journal, April 1988

  • Pearson, W. R.; Lipman, D. J.
  • Proceedings of the National Academy of Sciences, Vol. 85, Issue 8, p. 2444-2448
  • DOI: 10.1073/pnas.85.8.2444

UPARSE: highly accurate OTU sequences from microbial amplicon reads
journal, August 2013