Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Axolotl: a scalable genomics library based on Apache Spark (Axolotl) v1.0.0

Software ·
DOI:https://doi.org/10.11578/dc.20240612.1· OSTI ID:code-128495 · Code ID:128495
 [1];  [1];  [1];  [1]
  1. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)

Axolotl is a Python library for scalable distributed genome and metagenome data analysis. Existing tools and systems that we rely on are struggling to keep up with the rapid explosion of genomic data. Compounding this issue, developing scalable solutions require a steep learning curve in parallel programming, which presents a barrier to academic researchers. While we do have scalable solutions for specific tasks, we lack comprehensive, end-to-end solutions. It's this gap in our toolkit that we aim to address with Axolotl. The Axolotl library is built for easy parallel processing, efficiently handling multiple tasks or large datasets simultaneously, and scaling up to meet the demands of extensive genomic data analysis.

Short Name / Acronym:
Axolotl v1.0.0
Site Accession Number:
2024-059
Software Type:
Scientific
License(s):
BSD 3-clause "New" or "Revised" License
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE

Primary Award/Contract Number:
AC02-05CH11231
DOE Contract Number:
AC02-05CH11231
Code ID:
128495
OSTI ID:
code-128495
Country of Origin:
United States

Similar Records

SpaRC: scalable sequence clustering using Apache Spark
Journal Article · Thu Aug 23 00:00:00 EDT 2018 · Bioinformatics · OSTI ID:1542383

Computational Strategies for Scalable Genomics Analysis
Journal Article · Thu Dec 05 23:00:00 EST 2019 · Genes · OSTI ID:1599823

Related Subjects