skip to main content

Title: Signatures for Mass Spectrometry Data Quality

Ensuring data quality and proper instrument functionality is a prerequisite for scientific investigation. Manual validation for quality assurance is time consuming, expensive and subjective. Metrics for describing various features of LC-MS data have been developed to assist operators in discriminating poor (out of control) and good (in control) datasets. However, the wide variety of instrument specifications and LC-MS configurations precludes applying a simple range of acceptable values or cutoffs for such metrics. We explored a variety of statistical modeling approaches to predict the quality of LC-MS data. Using 1164 manually classified quality control (QC) LC-MS datasets, we fit logistic regression classification models to the QC data to predict whether a dataset is in or out of control. Model parameters were optimized by minimizing a loss function that accounts for the tradeoff between false positive and false negative errors. The optimal logistic regression classifier models detected bad data sets with high sensitivity (i.e. low false negative rate) while maintaining high specificity (i.e. controlling the false positive rate). As an example, predictions for Velos-Orbitrap instrumentation data had a sensitivity of 93.7% in detecting out of control datasets with a false positive rate of 8.3%. In comparison, we investigated the performance of severalmore » single metrics in predicting dataset quality. While maintaining a sensitivity of 93.7%, the corresponding false positive rates for these single-metric models unacceptably ranged from 32% to 97.7%. Finally, we evaluated the performance of the« less
 [1] ;  [1] ;  [1] ;  [1] ;  [1] ;  [1] ;  [1] ;  [1] ;  [1] ;  [1]
  1. Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Publication Date:
OSTI Identifier:
Report Number(s):
Journal ID: ISSN 1535-3893; 47418; KP1601010
DOE Contract Number:
Resource Type:
Journal Article
Resource Relation:
Journal Name: Journal of Proteome Research; Journal Volume: 13; Journal Issue: 4
American Chemical Society (ACS)
Research Org:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States), Environmental Molecular Sciences Laboratory (EMSL)
Sponsoring Org:
Country of Publication:
United States
37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY Environmental Molecular Sciences Laboratory