Axolotl: a scalable genomics library based on Apache Spark (Axolotl) v1.0.0

RESOURCE

Abstract

Axolotl is a Python library for scalable distributed genome and metagenome data analysis. Existing tools and systems that we rely on are struggling to keep up with the rapid explosion of genomic data. Compounding this issue, developing scalable solutions require a steep learning curve in parallel programming, which presents a barrier to academic researchers. While we do have scalable solutions for specific tasks, we lack comprehensive, end-to-end solutions. It's this gap in our toolkit that we aim to address with Axolotl. The Axolotl library is built for easy parallel processing, efficiently handling multiple tasks or large datasets simultaneously, and scaling up to meet the demands of extensive genomic data analysis.
Developers:
Wang, Zhong [1] Foster, Bryce [1] Ho, Harrison [1] Kautsar, Satri [1]
  1. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Release Date:
2024-04-17
Project Type:
Open Source, Publicly Available Repository
Software Type:
Scientific
Licenses:
BSD 3-clause "New" or "Revised" License
Sponsoring Org.:
Code ID:
128495
Site Accession Number:
2024-059
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Country of Origin:
United States

RESOURCE

Citation Formats

Wang, Zhong, Foster, Bryce, Ho, Harrison, and Kautsar, Satri. Axolotl: a scalable genomics library based on Apache Spark (Axolotl) v1.0.0. Computer Software. https://github.com/JGI-Bioinformatics/axolotl. USDOE. 17 Apr. 2024. Web. doi:10.11578/dc.20240612.1.
Wang, Zhong, Foster, Bryce, Ho, Harrison, & Kautsar, Satri. (2024, April 17). Axolotl: a scalable genomics library based on Apache Spark (Axolotl) v1.0.0. [Computer software]. https://github.com/JGI-Bioinformatics/axolotl. https://doi.org/10.11578/dc.20240612.1.
Wang, Zhong, Foster, Bryce, Ho, Harrison, and Kautsar, Satri. "Axolotl: a scalable genomics library based on Apache Spark (Axolotl) v1.0.0." Computer software. April 17, 2024. https://github.com/JGI-Bioinformatics/axolotl. https://doi.org/10.11578/dc.20240612.1.
@misc{ doecode_128495,
title = {Axolotl: a scalable genomics library based on Apache Spark (Axolotl) v1.0.0},
author = {Wang, Zhong and Foster, Bryce and Ho, Harrison and Kautsar, Satri},
abstractNote = {Axolotl is a Python library for scalable distributed genome and metagenome data analysis. Existing tools and systems that we rely on are struggling to keep up with the rapid explosion of genomic data. Compounding this issue, developing scalable solutions require a steep learning curve in parallel programming, which presents a barrier to academic researchers. While we do have scalable solutions for specific tasks, we lack comprehensive, end-to-end solutions. It's this gap in our toolkit that we aim to address with Axolotl. The Axolotl library is built for easy parallel processing, efficiently handling multiple tasks or large datasets simultaneously, and scaling up to meet the demands of extensive genomic data analysis.},
doi = {10.11578/dc.20240612.1},
url = {https://doi.org/10.11578/dc.20240612.1},
howpublished = {[Computer Software] \url{https://doi.org/10.11578/dc.20240612.1}},
year = {2024},
month = {apr}
}