An efficient data format for mass spectrometry based proteomics

Shah, Anuj R; Davidson, Jennifer L; Monroe, Matthew E; Mayampurath, Anoop M; Danielson, William F; Shi, Yan; Robinson, Aaron C; Clowers, Brian H; Belov, Mikhail E; Anderson, Gordon A; Smith, Richard D

doi:10.1016/j.jasms.2010.06.014

Title: An efficient data format for mass spectrometry based proteomics

Journal Article · Fri Oct 01 00:00:00 EDT 2010 · Journal of the American Society for Mass Spectrometry, 21(10):1784-1788

DOI:https://doi.org/10.1016/j.jasms.2010.06.014· OSTI ID:1000611

Shah, Anuj R; Davidson, Jennifer L; Monroe, Matthew E; Mayampurath, Anoop M; Danielson, William F; Shi, Yan; Robinson, Aaron C; Clowers, Brian H; Belov, Mikhail E; Anderson, Gordon A; Smith, Richard D

The diverse range of mass spectrometry (MS) instrumentation along with corresponding proprietary and non-proprietary data formats has generated a proteomics community driven call for a standardized format to facilitate management, processing, storing, visualization, and exchange of both experimental and processed data. To date, significant efforts have been extended towards standardizing XML-based formats for mass spectrometry data representation, despite the recognized inefficiencies associated with storing large numeric datasets in XML. The proteomics community has periodically entertained alternate strategies for data exchange, e.g., using a common application programming interface or a database-derived format. However these efforts have yet to garner significant attention, mostly because they haven’t illustrated significant performance benefits over existing standards, but also due to issues such as extensibility to multi-dimensional separation systems, robustness of operation, and incomplete or mismatched vocabulary. Here, we describe a format based on standard database principles that offers multiple benefits over existing formats in terms of storage size, ease of processing, data retrieval times and extensibility to accommodate multi-dimensional separation systems.

Cite

Export

Save

Research Organization:: Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC05-76RL01830

OSTI ID:: 1000611

Report Number(s):: PNNL-SA-69363; JAMSEF; KP1601010; TRN: US201101%%411

Journal Information:: Journal of the American Society for Mass Spectrometry, 21(10):1784-1788, Vol. 21, Issue 10; ISSN 1044-0305

Country of Publication:: United States

Language:: English

Similar Records

A common open representation of mass spectrometry data and its application to proteomics research

Journal Article · Mon Nov 01 00:00:00 EST 2004 · Nature Biotechnology · OSTI ID:1000611

Pedrioli, Patrick G; Eng, Jimmy K; Hubley, Robert; +21 more

Long-term data archiving

Journal Article · Thu Jan 01 00:00:00 EST 2009 · Analytical and Bioanalytical Chemistry · OSTI ID:1000611

Moore, David Steven

Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions

Journal Article · Tue Oct 09 00:00:00 EDT 2007 · BMC Biology · OSTI ID:1000611

Kerrien, Samuel; Orchard, Sandra; Montecchi-Palazzi, Luisa; +27 more

Related Subjects

99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
MANAGEMENT
MASS SPECTROSCOPY
PERFORMANCE
PROCESSING
PROGRAMMING
STORAGE

Title: An efficient data format for mass spectrometry based proteomics

Citation Formats

Similar Records

Related Subjects