DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: mzapy: An Open-Source Python Library Enabling Efficient Extraction and Processing of Ion Mobility Spectrometry-Mass Spectrometry Data in the MZA File Format

Journal Article · · Analytical Chemistry

We have recently reported MZA, a new and simple mass spectrometry data structure based on the broadly supported HDF5 format and created to facilitate software development. While this format is inherently supportive of application development, the availability of a core library with standard mass spectrometry utilities greatly facilitates fast software development. Here, we present a Python library, mzapy, for efficient extraction and processing of mass spectrometry data in the MZA format. In addition to raw data extraction, mzapy contains supporting utilities enabling tasks including calibration, signal processing, peak finding, and generating plots. Being implemented in pure Python with minimal and largely standardized dependencies makes mzapy uniquely suited to application development in the multi-omics domain. The free and open source mzapy is built with extensibility in mind, and future development will support cloud computing and artificial intelligence/machine learning applications. The software source code is freely available at https://github.com/PNNL-m-q/mzapy.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES). Scientific User Facilities (SUF); USDOE Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
1994861
Report Number(s):
PNNL-SA-177972
Journal Information:
Analytical Chemistry, Vol. 95, Issue 25; ISSN 0003-2700
Publisher:
American Chemical Society (ACS)Copyright Statement
Country of Publication:
United States
Language:
English

References (13)

SciPy 1.0: fundamental algorithms for scientific computing in Python journal February 2020
The arc of Mass Spectrometry Exchange Formats is long, but it bends toward HDF5: PLAIN HDF5 AS A MASS SPECTROMETRY EXCHANGE FORMAT journal October 2016
mz5: Space- and Time-efficient Storage of Mass Spectrometry Data Sets journal January 2012
pymzML v2.0: introducing a highly compressed and seekable gzip format journal January 2018
MZA: A Data Conversion Tool to Facilitate Software Development and Artificial Intelligence Research in Multidimensional Mass Spectrometry journal November 2022
Array programming with NumPy journal September 2020
imzML — A common data format for the flexible exchange and processing of mass spectrometry imaging data journal August 2012
Fast and Efficient XML Data Access for Next-Generation Mass Spectrometry journal April 2015
mzML—a Community Standard for Mass Spectrometry Data journal August 2010
Matplotlib: A 2D Graphics Environment journal January 2007
LipidOz enables automated elucidation of lipid carbon–carbon double bond positions from ozone-induced dissociation mass spectrometry data journal April 2023
Breaking Down Structural Diversity for Comprehensive Prediction of Ion-Neutral Collision Cross Sections journal February 2020
An Interlaboratory Evaluation of Drift Tube Ion Mobility–Mass Spectrometry Collision Cross Section Measurements journal August 2017