skip to main content

SciTech ConnectSciTech Connect

Title: A case study for cloud based high throughput analysis of NGS data using the globus genomics system

Next generation sequencing (NGS) technologies produce massive amounts of data requiring a powerful computational infrastructure, high quality bioinformatics software, and skilled personnel to operate the tools. We present a case study of a practical solution to this data management and analysis challenge that simplifies terabyte scale data handling and provides advanced tools for NGS data analysis. These capabilities are implemented using the “Globus Genomics” system, which is an enhanced Galaxy workflow system made available as a service that offers users the capability to process and transfer data easily, reliably and quickly to address end-to-end NGS analysis requirements. The Globus Genomics system is built on Amazon's cloud computing infrastructure. The system takes advantage of elastic scaling of compute resources to run multiple workflows in parallel and it also helps meet the scale-out analysis needs of modern translational genomics research.
 [1] ;  [2] ;  [1] ;  [3] ;  [2] ;  [2] ;  [2] ;  [2] ;  [1] ;  [1]
  1. Georgetown Univ., Washington, DC (United States). Innovation Center for Biomedical Informatics (ICBI).
  2. Argonne National Lab. (ANL), Argonne, IL (United States); Globus Genomics, Chicago, IL (United States)
  3. Argonne National Lab. (ANL), Argonne, IL (United States)
Publication Date:
OSTI Identifier:
Grant/Contract Number:
Accepted Manuscript
Journal Name:
Computational and Structural Biotechnology Journal
Additional Journal Information:
Journal Volume: 13; Journal Issue: C; Journal ID: ISSN 2001-0370
Research Org:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Org:
Country of Publication:
United States
97 MATHEMATICS AND COMPUTING next generation sequencing; galaxy; cloud computing; translational research