Combined Statistical Analyses of Peptide Intensities and Peptide Occurrences Improves Identification of Significant Peptides from MS-based Proteomics Data
Journal Article
·
· Journal of Proteome Research, 9(11):5748-5756
Liquid chromatography-mass spectrometry-based (LC-MS) proteomics uses peak intensities of proteolytic peptides to infer the differential abundance of peptides/proteins. However, substantial run-to-run variability in peptide intensities and observations (presence/absence) of peptides makes data analysis quite challenging. The missing abundance values in LC-MS proteomics data are difficult to address with traditional imputation-based approaches because the mechanisms by which data are missing are unknown a priori. Data can be missing due to random mechanisms such as experimental error, or non-random mechanisms such as a true biological effect. We present a statistical approach that uses a test of independence known as a G-test to test the null hypothesis of independence between the number of missing values and the experimental groups. We pair the G-test results evaluating independence of missing data (IMD) with a standard analysis of variance (ANOVA) that uses only means and variances computed from the observed data. Each peptide is therefore represented by two statistical confidence metrics, one for qualitative differential observation and one for quantitative differential intensity. We use two simulated and two real LC-MS datasets to demonstrate the robustness and sensitivity of the ANOVA-IMD approach for assigning confidence to peptides with significant differential abundance among experimental groups.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (US), Environmental Molecular Sciences Laboratory (EMSL)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1000135
- Report Number(s):
- PNNL-SA-72886; 33200; 33706; 30796; 600306000
- Journal Information:
- Journal of Proteome Research, 9(11):5748-5756, Journal Name: Journal of Proteome Research, 9(11):5748-5756 Journal Issue: 11 Vol. 9; ISSN 1535-3893; ISSN 1535-3907
- Country of Publication:
- United States
- Language:
- English
Similar Records
Improved Quality Control Processing of Peptide-centric LC-MS Proteomics Data
A statistical framework for protein quantitation in bottom-up MS-based proteomics
Detecting differential protein abundance by combining peptide level P-values
Journal Article
·
Tue Sep 20 00:00:00 EDT 2011
· Bioinformatics
·
OSTI ID:1031421
A statistical framework for protein quantitation in bottom-up MS-based proteomics
Journal Article
·
Sat Aug 15 00:00:00 EDT 2009
· Bioinformatics, 25(16):2028-2034
·
OSTI ID:965993
Detecting differential protein abundance by combining peptide level P-values
Journal Article
·
Fri Sep 11 00:00:00 EDT 2020
· Molecular Omics
·
OSTI ID:1752963