Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks
The spectral networks approach enables the detection of pairs of spectra from related peptides and thus allows for the propagation of annotations from identified peptides to unidentified spectra. Beyond allowing for unbiased discovery of unexpected post-translational modifications, spectral networks are also applicable to multi-species comparative proteomics or metaproteomics to identify numerous orthologous versions of a protein. We present algorithmic and statistical advances in spectral networks that have made it possible to rigorously assess the statistical significance of spectral pairs and accurately estimate the error rate of identifications via propagation. In the analysis of three related Cyanothece species, a model organism for biohydrogen production, spectral networks identified peptides with highly divergent sequences with up to dozens of variants per peptide, including many novel peptides in species that lack a sequenced genome. Furthermore, spectral networks strongly suggested the presence of novel peptides even in genomically characterized species (i.e. missing from databases) in that a significant portion of unidentified multi-species networks included at least two polymorphic peptide variants.
- Research Organization:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States). Environmental Molecular Sciences Lab. (EMSL)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1340839
- Report Number(s):
- PNNL-SA-110651; 48582; KP1601010
- Journal Information:
- Molecular and Cellular Proteomics, Vol. 15, Issue 11; ISSN 1535-9476
- Publisher:
- American Society for Biochemistry and Molecular Biology
- Country of Publication:
- United States
- Language:
- English
Similar Records
Protein-based forensic identification using genetically variant peptides in human bone
Constructing the Nitrogen Flux Maps (NFMs) of Plants