Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

CONSTAX2: improved taxonomic classification of environmental DNA markers

Journal Article · · Bioinformatics
Abstract Summary

CONSTAX—the CONSensus TAXonomy classifier—was developed for accurate and reproducible taxonomic annotation of fungal rDNA amplicon sequences and is based upon a consensus approach of RDP, SINTAX and UTAX algorithms. CONSTAX2 extends these features to classify prokaryotes as well as eukaryotes and incorporates BLAST-based classifiers to reduce classification errors. Additionally, CONSTAX2 implements a conda-installable command-line tool with improved classification metrics, faster training, multithreading support, capacity to incorporate external taxonomic databases and new isolate matching and high-level taxonomy tools, replete with documentation and example tutorials.

Availability and implementation

CONSTAX2 is available at https://github.com/liberjul/CONSTAXv2, and is packaged for Linux and MacOS from Bioconda with use under the MIT License. A tutorial and documentation are available at https://constax.readthedocs.io/en/latest/. Data and scripts associated with the manuscript are available at https://github.com/liberjul/CONSTAXv2_ms_code.

Supplementary information

Supplementary data are available at Bioinformatics online.

Research Organization:
Michigan State University, East Lansing, MI (United States); University of Wisconsin, Madison, WI (United States)
Sponsoring Organization:
National Science Foundation (NSF); USDOE; USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
SC0018409
OSTI ID:
1829161
Alternate ID(s):
OSTI ID: 1979482
Journal Information:
Bioinformatics, Journal Name: Bioinformatics Journal Issue: 21 Vol. 37; ISSN 1367-4803
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (14)

Scraping the bottom of the barrel: are rare high throughput sequences artifacts? journal February 2015
25 years of serving the community with ribosomal RNA gene reference databases and tools journal November 2017
High throughput sequencing methods and analysis for microbiome research journal December 2013
UPARSE: highly accurate OTU sequences from microbial amplicon reads journal August 2013
Bioconda: sustainable and comprehensive software distribution for the life sciences journal July 2018
Improved tools for biological sequence comparison. journal April 1988
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs journal September 1997
The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications journal October 2018
Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy journal June 2007
Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities journal October 2009
SPINGO: a rapid species-classifier for microbial amplicon sequences journal October 2015
CONSTAX: a tool for improved taxonomic resolution of environmental fungal ITS sequences journal December 2017
Improved metagenomic analysis with Kraken 2 journal November 2019
Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin journal May 2018

Similar Records

VPF-Class: taxonomic assignment and host prediction of uncultivated viruses based on viral protein families
Journal Article · Tue Jan 19 19:00:00 EST 2021 · Bioinformatics · OSTI ID:1810588

Related Subjects