skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: MetaBAT

Software ·
OSTI ID:1231763

Assembling individual genomes from shotgun metagenomic sequences derived from complex microbial communities is so far one of the most challenging problems in bioinformatics.As it is impractical to directly assemble full-length genomes, a first step that groups contigs from the same organisms, called metagenome binning, has been developed to provide insights of individual organisms. However, current binning methods perform poorly in the context of large complex community, and as a result they fail to recover many novel genomes. To overcome this limitation, we developed integrated software, called MetaBAT, which automatically forms hundreds of individual genome bins from metagenome contigs. Probabilistic models of abundance and tetranucleotide frequency were trained by extensive empirical studies and integrated to decide the membership of contigs iteratively. To test the performance of MetaBAT, we applied MetaBAT to both synthetic and several large-scale real world metagenome datasets. By using two independent metrics, we demonstrate that in all the data sets tested MetaBAT achieves good sensitivity (16~87%) and very high specificity (56~99%) in forming genome bins. Further analyses of the novel genomes recovered from the human gut microbiome suggest a subset of these genomes are potentially associated with pathological conditions. In conclusion, we believe MetaBAT is a powerful tool

Short Name / Acronym:
METABAT; 003006MLTPL00
Site Accession Number:
2014-075
Version:
00
Programming Language(s):
Medium: X; OS: Linux, Mac, Windows; Compatibility: Multi-Platform
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1231763
Country of Origin:
United States

Similar Records

Related Subjects