Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4)

Journal Article · · Standards in Genomic Sciences

Abstract

The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provided via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation is followed by functional annotation including assignment of protein product names and connection to various protein family databases.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NONE; AC02-05CH11231
OSTI ID:
1618964
Alternate ID(s):
OSTI ID: 1379106
Journal Information:
Standards in Genomic Sciences, Journal Name: Standards in Genomic Sciences Journal Issue: 1 Vol. 11; ISSN 1944-3277
Publisher:
Springer Science + Business MediaCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (17)

Systematic artifacts in metagenomes from complex microbial communities journal July 2009
A Fast and Symmetric DUST Implementation to Mask Low-Complexity DNA Sequences journal June 2006
DNA sequence quality trimming and vector removal journal December 2001
Search and clustering orders of magnitude faster than BLAST journal August 2010
MetaGeneAnnotator: Detecting Species-Specific Patterns of Ribosomal Binding Site for Precise Gene Prediction in Anonymous Prokaryotic and Phage Genomes journal October 2008
tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence journal March 1997
GeneMark.hmm: new solutions for gene finding journal February 1998
CDD: a conserved domain database for interactive domain family analysis journal January 2007
FragGeneScan: predicting genes in short and error-prone reads journal August 2010
The Pfam protein families database journal November 2011
Data, information, knowledge and principle: back to metabolism in KEGG journal November 2013
The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification journal October 2014
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
PILER-CR: Fast and accurate identification of CRISPR repeats journal January 2007
CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats journal June 2007
Accelerated Profile HMM Searches journal October 2011
Direct Comparisons of Illumina vs. Roche 454 Sequencing Technologies on the Same Microbial Community DNA Sample journal February 2012

Similar Records

The DOE-JGI Standard Operating Procedure for the Annotations of the Microbial Genomes
Journal Article · Wed May 20 00:00:00 EDT 2009 · Standards in Genomic Sciences · OSTI ID:974530

The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4)
Journal Article · Mon Oct 26 00:00:00 EDT 2015 · Standards in Genomic Sciences · OSTI ID:1618953

Related Subjects