---
code_id: 52870
site_ownership_code: "LBNL"
open_source: true
landing_page: "https://sourceforge.net/projects/dibella"
project_type: "ON"
software_type: "S"
official_use_only: {}
developers:
- email: "MEllis@lbl.gov"
  orcid: ""
  first_name: "Marquita"
  last_name: "Ellis"
  middle_name: ""
  affiliations:
  - "Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)"
contributors: []
sponsoring_organizations:
- organization_name: "USDOE"
  funding_identifiers: []
  primary_award: "AC02-05CH11231"
  DOE: true
- organization_name: "Oak Ridge National Laboratory"
  funding_identifiers:
  - identifier_type: "AwardNumber"
    identifier_value: "AWD3408 "
  primary_award: "AWD1896"
  DOE: true
contributing_organizations: []
research_organizations:
- organization_name: "Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United\
    \ States)"
  DOE: true
related_identifiers: []
release_date: "2020-10-22"
software_title: "Distributed Berkeley Efficient Long-Read to Long-Read Aligner and\
  \ Overlapper (DiBELLA) v1.0.0"
acronym: "Distributed Berkeley Efficient Long-Read to Long-Read Aligner and Overlapper"
doi: "https://doi.org/10.11578/dc.20210316.1"
description: "We present a parallel algorithm and scalable implementation for genome\
  \ analysis, specifically the problem of finding overlaps and alignments for data\
  \ from \"third generation\" long read sequencers. While long sequences of DNA offer\
  \ enormous advantages for biological analysis and insight, current long read sequencing\
  \ instruments have high error rates and therefore require different approaches to\
  \ analysis than their short read counterparts. Our work focuses on an efficient\
  \ distributed-memory parallelization of an accurate single-node algorithm for overlapping\
  \ and aligning long reads. We achieve scalability of this irregular algorithm by\
  \ addressing the competing issues of increasing parallelism, minimizing communication,\
  \ constraining the memory footprint, and ensuring good load balance. The resulting\
  \ application, DiBELLA, is the first distributed memory overlapper and aligner specifically\
  \ designed for long reads and parallel scalability."
programming_languages: []
country_of_origin: "United States"
project_keywords: []
licenses:
- "BSD 3-clause \"New\" or \"Revised\" License"
recipient_org: "Lawrence Berkeley National Laboratory (LBNL)"
site_accession_number: "2020-158"
file_name: "dibella-code-08e23505dc060ba15dd2ca3e8185dadc03bc7236.zip"
date_record_added: "2021-03-16"
date_record_updated: "2021-03-16"
is_file_certified: false
is_limited: false
links:
- rel: "citation"
  href: "https://www.osti.gov/doecode/biblio/52870"
