DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Protein Sequence Annotation Tool (PSAT): a centralized web-based meta-server for high-throughput sequence annotations

Journal Article · · BMC Bioinformatics

Abstract Background Here we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. Results In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resulting functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Conclusions PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/ . PSAT stands apart from other sequence-based genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.

Sponsoring Organization:
USDOE
OSTI ID:
1618521
Journal Information:
BMC Bioinformatics, Journal Name: BMC Bioinformatics Journal Issue: 1 Vol. 17; ISSN 1471-2105
Publisher:
Springer Science + Business MediaCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (26)

Optimizing high performance computing workflow for protein functional annotation: HPC FOR PROTEIN ANNOTATION journal April 2014
Cloud computing and the DNA data race journal July 2010
SignalP 4.0: discriminating signal peptides from transmembrane regions journal September 2011
Unraveling the Complexities of Life Sciences Data journal March 2013
What's that gene (or protein)? Online resources for exploring functions of genes, transcripts, and proteins journal April 2014
ASAP: automated sequence annotation pipeline for web-based updating of sequence information with a local dynamic database journal March 2003
EFICAz2.5: application of a high-precision enzyme function predictor to 396 proteomes journal August 2012
InterProScan 5: genome-scale protein function classification journal January 2014
MvirDB--a microbial database of protein toxins, virulence factors and antibiotic resistance genes for bio-defence applications journal January 2007
ANNIE: integrated de novo protein sequence annotation journal April 2009
STRING v9.1: protein-protein interaction networks, with increased coverage and integration journal November 2012
Data, information, knowledge and principle: back to metabolism in KEGG journal November 2013
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases journal November 2013
The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST) journal November 2013
IMG 4 version of the integrated microbial genomes comparative analysis system journal October 2013
Combination of degradation pathways for naphthalene utilization in R hodococcus sp. strain TFB : Naphthalene degradation in journal December 2013
Draft Genome Sequence of the Naphthalene Degrader Herbaspirillum sp. Strain RV1423 journal March 2014
BLAST+: architecture and applications journal January 2009
iGepros: an integrated gene and protein annotation server for biological nature exploration journal December 2011
Towards the integration, annotation and association of historical microarray experiments with RNA-seq journal January 2013
The RAST Server: Rapid Annotations using Subsystems Technology journal January 2008
MESSA: MEta-Server for protein Sequence Analysis journal October 2012
EC2KEGG: a command line tool for comparison of metabolic pathways journal September 2014
The Earth Microbiome project: successes and aspirations journal August 2014
WImpiBLAST: Web Interface for mpiBLAST to Help Biologists Perform Large-Scale Annotation Using High Performance Computing journal June 2014
The IGS Standard Operating Procedure for Automated Prokaryotic Annotation journal April 2011

Similar Records

P finder: genomic and metagenomic annotation of RNase P RNA gene (rnpB)
Journal Article · Wed Apr 29 04:00:00 UTC 2020 · BMC Genomics · OSTI ID:1779954

Filling gaps in bacterial catabolic pathways with computation and high-throughput genetics
Journal Article · Wed Apr 13 04:00:00 UTC 2022 · PLoS Genetics · OSTI ID:1863121

GapMind: Automated Annotation of Amino Acid Biosynthesis
Journal Article · Tue Jun 23 04:00:00 UTC 2020 · mSystems · OSTI ID:1634168

Related Subjects