Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A robust linear regression based algorithm for automated evaluation of peptide identifications from shotgun proteomics by use of reversed-phase liquid chromatography retention time

Journal Article · · BMC Bioinformatics
 [1];  [2];  [3]
  1. The Ohio State Univ., Columbus, OH (United States). Comprehensive Cancer Center. Dept. of Molecular Virology Immunology and Medical Genetics; DOE/OSTI
  2. The Ohio State Univ., Columbus, OH (United States). Dept. of Chemistry
  3. The Ohio State Univ., Columbus, OH (United States). Comprehensive Cancer Center. Dept. of Molecular Virology Immunology and Medical Genetics

Background: Rejection of false positive peptide matches in database searches of shotgun proteomic experimental data is highly desirable. Several methods have been developed to use the peptide retention time as to refine and improve peptide identifications from database search algorithms. This report describes the implementation of an automated approach to reduce false positives and validate peptide matches. Results: A robust linear regression based algorithm was developed to automate the evaluation of peptide identifications obtained from shotgun proteomic experiments. The algorithm scores peptides based on their predicted and observed reversed-phase liquid chromatography retention times. The robust algorithm does not require internal or external peptide standards to train or calibrate the linear regression model used for peptide retention time prediction. The algorithm is generic and can be incorporated into any database search program to perform automated evaluation of the candidate peptide matches based on their retention times. It provides a statistical score for each peptide match based on its retention time. Conclusion: Analysis of peptide matches where the retention time score was included resulted in a significant reduction of false positive matches with little effect on the number of true positives. Overall higher sensitivities and specificities were achieved for database searches carried out with MassMatrix, Mascot and X!Tandem after implementation of the retention time based score algorithm.

Research Organization:
Battelle Memorial Institute, Columbus, OH (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
1626360
Journal Information:
BMC Bioinformatics, Journal Name: BMC Bioinformatics Journal Issue: 1 Vol. 9; ISSN 1471-2105
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (46)

An evaluation, comparison, and accurate benchmarking of several publicly available MS/MS search algorithms: Sensitivity and specificity analysis journal August 2005
The isolation of peptides by high-performance liquid chromatography using predicted elution positions journal July 1982
Prediction of peptide retention times in reversed-phase high-performance liquid chromatography I. Determination of retention coefficients of amino acid residues of model synthetic peptides journal January 1986
Prediction of peptide retention times in reversed-phase high-performance liquid chromatography II. Correlation of observed and predicted peptide retention times factors and influencing the retention times of peptides journal January 1986
Differential expression of histone post-translational modifications in acute myeloid and chronic lymphocytic leukemia determined by high-pressure liquid chromatography and mass spectrometry journal January 2004
The Utility of Accurate Mass and LC Elution Time Information in the Analysis of Complex Proteomes journal August 2005
Protein identification by liquid chromatography–mass spectrometry using retention time prediction journal April 2004
Liquid chromatography mass spectrometry profiling of histones journal May 2007
Factors affecting retention and resolution of peptides in high-performance liquid chromatography journal June 1981
Effect of peptide chain length on peptide retention behaviour in reversed-phase chromatogrphy journal December 1988
Prediction of peptide retention times journal January 1988
Prediction of peptide retention times in reversed-phases high-performance liquid chromatography during linear gradient elution journal May 1982
Analysis, statistical validation and dissemination of large-scale proteomics datasets generated by tandem MS journal February 2004
Use of Artificial Neural Networks for the Accurate Prediction of Peptide Liquid Chromatography Elution Times in Proteome Analyses journal March 2003
Prediction of Chromatographic Retention and Protein Identification in Liquid Chromatography/Mass Spectrometry journal November 2002
Robust Algorithm for Alignment of Liquid Chromatography−Mass Spectrometry Analyses in an Accurate Mass and Time Tag Data Analysis Pipeline journal November 2006
Improved Peptide Elution Time Prediction for Reversed-Phase Liquid Chromatography-MS by Incorporating Peptide Sequence Information journal July 2006
Use of Peptide Retention Time Prediction for Protein Identification by off-line Reversed-Phase HPLC−MALDI MS/MS journal September 2006
Liquid Chromatography at Critical Conditions:  Comprehensive Approach to Sequence-Dependent Retention Time Prediction journal November 2006
Histone-specific acetyltransferases from calf thymus. Isolation, properties, and substrate specificity of three different enzymes journal March 1980
Prediction of Peptide Retention at Different HPLC Conditions from Multiple Linear Regression Models journal February 2005
Application of Peptide LC Retention Time Information in a Discriminant Function for Peptide Identification by Tandem Mass Spectrometry journal July 2004
Prediction of Error Associated with False-Positive Rate Determination for Peptide Identification in Large-Scale Proteomics Experiments Using a Combined Reverse and Forward Peptide Sequence Database Strategy journal November 2006
Monte Carlo Simulation-Based Algorithms for Analysis of Shotgun Proteomic Data journal June 2008
Phosphate dysregulation via the XPR1–KIDINS220 protein complex is a therapeutic vulnerability in ovarian cancer journal April 2022
Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition journal March 1980
Prediction of Peptide Retention in RP-LC journal October 2005
Probability-based protein identification by searching sequence databases using mass spectrometry data journal December 1999
Informatics for peptide retention properties in proteomic LC-MS journal February 2008
The isolation of peptides by high-performance liquid chromatography using predicted elution positions journal July 1982
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database journal November 1994
Factors affecting retention and resolution of peptides in high-performance liquid chromatography journal June 1981
Effect of peptide chain length on peptide retention behaviour in reversed-phase chromatogrphy journal December 1988
Prediction of peptide retention times journal January 1988
Prediction of peptide retention times in reversed-phases high-performance liquid chromatography during linear gradient elution journal May 1982
Analysis, statistical validation and dissemination of large-scale proteomics datasets generated by tandem MS journal February 2004
Requirements for prediction of peptide retention time in reversed-phase high-performance liquid chromatography: Hydrophilicity/hydrophobicity of side-chains at the N- and C-termini of peptides are dramatically affected by the end-groups and location journal February 2007
Improving Tandem Mass Spectrum Identification Using Peptide Retention Time Prediction across Diverse Chromatography Conditions journal July 2007
Open Source System for Analyzing, Validating, and Storing Protein Identification Data journal December 2004
A Platform for Accurate Mass and Time Analyses of Mass Spectrometry Data journal June 2007
Mass spectrometry-based proteomics journal March 2003
The need for a public proteomics repository journal April 2004
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry journal February 2007
Large-scale database searching using tandem mass spectra: Looking up the answer in the back of the book journal November 2004
An Improved Model for Prediction of Retention Times of Tryptic Peptides in Ion Pair Reversed-phase HPLC journal September 2004
A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry data journal April 2007

Cited By (7)

Chemical Modifications in Aggregates of Recombinant Human Insulin Induced by Metal-Catalyzed Oxidation: Covalent Cross-Linking via Michael Addition to Tyrosine Oxidation Products journal May 2012
Comparative cross-linking and mass spectrometry of an intact F-type ATPase suggest a role for phosphorylation journal June 2013
Vesicle-based secretion in schistosomes: Analysis of protein and microRNA (miRNA) content of exosome-like vesicles derived from Schistosoma mansoni journal February 2018
In-silico prediction of disorder content using hybrid sequence representation journal June 2011
The proteomic and metabolomic characterization of exercise-induced sweat for human performance monitoring: A pilot investigation journal November 2018
Prediction of Gene Expression Patterns With Generalized Linear Regression Model journal March 2019
Identification of replication-dependent and replication-independent linker histone complexes: Tpr specifically promotes replication-dependent linker histone stability journal October 2016

Figures / Tables (9)