skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae

Journal Article · · PLoS One

Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, and a transcriptional regulator, among other proteins, most of which are annotated as hypothetical, that were missed during annotation.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States). Environmental Molecular Sciences Lab. (EMSL)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1038637
Report Number(s):
PNNL-SA-79961; 33210; 400412000; 400412000; TRN: US201208%%729
Journal Information:
PLoS One, Vol. 7, Issue 3
Country of Publication:
United States
Language:
English

Similar Records

A multi-omic systems approach to elucidating Yersinia virulence mechanisms
Journal Article · Tue Jan 01 00:00:00 EST 2013 · Molecular Biosystems · OSTI ID:1038637

Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis
Journal Article · Wed Sep 01 00:00:00 EDT 2004 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:1038637

Insights into the genome evolution of Yersinia pestis through whole genome comparison with Yersinia pseudotuberculosis
Journal Article · Sat Jan 24 00:00:00 EST 2004 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:1038637