skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: MannDB – A microbial database of automated protein sequence analyses and evidence integration for protein characterization

Journal Article · · BMC Bioinformatics
 [1];  [1];  [1];  [1];  [2];  [1];  [1];  [1]
  1. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Pathogen Bio-informatics
  2. Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States). Virginia Bioinformatics Inst.

Background: MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. Description: MannDB is a relational database that organizes data resulting from fully automated, highthroughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. Conclusion: MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports. Access to MannDB is freely available at http://manndb.llnl.gov/.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division; US Department of Homeland Security (DHS)
Grant/Contract Number:
AC52-07NA27344; W-7405-ENG-48
OSTI ID:
1626328
Journal Information:
BMC Bioinformatics, Vol. 7, Issue 1; ISSN 1471-2105
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (18)

Comparative genomics tools applied to bioterrorism defence journal January 2003
GenDB--an open source genome annotation system for prokaryote genomes journal April 2003
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins journal December 2004
Predicting Subcellular Localization of Proteins Based on their N-terminal Amino Acid Sequence journal July 2000
BRIGEP--the BRIDGE-based genome-transcriptome-proteome browser journal July 2005
The Comprehensive Microbial Resource journal January 2001
MaGe: a microbial genome annotation system supported by synteny results journal January 2006
Automated annotation of microbial proteomes in SWISS-PROT journal February 2003
PSORTb v.2.0: Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis journal October 2004
BASys: a web server for automated bacterial genome annotation journal July 2005
Principles governing amino acid composition of integral membrane proteins: application to topology prediction 1 1Edited by J. Thornton journal October 1998
Automated genome sequence analysis and annotation journal May 1999
Cleavage site analysis in picornaviral polyproteins: Discovering cellular targets by neural networks journal November 1996
Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. Cohen journal January 2001
Functional and structural genomics using PEDANT journal January 2001
Improved Prediction of Signal Peptides: SignalP 3.0 journal July 2004
TopPred II: an improved software for membrane protein structure predictions journal January 1994
The integrated microbial genomes (IMG) system journal January 2006

Similar Records

MannDB: A microbial annotation database for protein characterization
Journal Article · Fri May 19 00:00:00 EDT 2006 · BMC Bioinformatics, vol. 7, n/a, October 16, 2006, pp. 459 · OSTI ID:1626328

MvirDB--a microbial database of protein toxins, virulence factors and antibiotic resistance genes for bio-defence applications
Journal Article · Wed Jan 03 00:00:00 EST 2007 · Nucleic Acids Research · OSTI ID:1626328

FastBLAST: Homology Relationships for Millions of Proteins
Journal Article · Fri Oct 31 00:00:00 EDT 2008 · PLoS ONE · OSTI ID:1626328