Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Architecture and method for a burst buffer using flash technology

Patent ·
OSTI ID:1243041

A parallel supercomputing cluster includes compute nodes interconnected in a mesh of data links for executing an MPI job, and solid-state storage nodes each linked to a respective group of the compute nodes for receiving checkpoint data from the respective compute nodes, and magnetic disk storage linked to each of the solid-state storage nodes for asynchronous migration of the checkpoint data from the solid-state storage nodes to the magnetic disk storage. Each solid-state storage node presents a file system interface to the MPI job, and multiple MPI processes of the MPI job write the checkpoint data to a shared file in the solid-state storage in a strided fashion, and the solid-state storage node asynchronously migrates the checkpoint data from the shared file in the solid-state storage to the magnetic disk storage and writes the checkpoint data to the magnetic disk storage in a sequential fashion.

Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC52-06NA25396
Assignee:
EMC Corporation (Hopkinton, MA) Los Alamos National Security, LLC (Los Alamos, NM)
Patent Number(s):
9,286,261
Application Number:
13/676,000
OSTI ID:
1243041
Country of Publication:
United States
Language:
English

References (48)

On the role of burst buffers in leadership-class storage systems conference April 2012
Hybrid checkpointing using emerging nonvolatile memories for future exascale systems journal July 2011
A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications conference January 2004
Remote Large Data Visualization in the ParaView Framework null January 2006
Visualization by Proxy: A Novel Framework for Deferred Interaction with Volume Data journal November 2010
Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System
  • Moody, Adam; Bronevetsky, Greg; Mohror, Kathryn
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.18
conference November 2010
Using Active NVRAM for Cloud I/O conference October 2011
A comprehensive study of energy efficiency and performance of flash-based SSD journal April 2011
A higher order estimate of the optimum checkpoint interval for restart dumps journal February 2006
The ParaView Coprocessing Library: A scalable, general purpose in situ visualization library conference October 2011
Making a case for distributed file systems at Exascale conference January 2011
PaScal-- A New Parallel and Scalable Server IO Networking Infrastructure for Supporting Global Storage/File Systems in Large-size Linux Clusters conference January 2006
Umbrella file system: Storage management across heterogeneous devices journal March 2009
...and eat it too: high read performance in write-optimized HPC I/O middleware file formats conference January 2009
Incorporating Network RAM and Flash into Fast Backing Store for Clusters conference September 2011
Evaluating the benefits of an extended memory hierarchy for parallel streamline algorithms conference October 2011
Storage challenges at Los Alamos National Lab conference April 2012
In-situ Sampling of a Large-Scale Particle Simulation for Interactive Visualization and Analysis journal June 2011
Jitter-free co-processing on a prototype exascale storage stack conference April 2012
Enhancing Checkpoint Performance with Staging IO and SSD
  • Ouyang, Xiangyong; Marcarelli, Sonya; Panda, Dhabaleswar K.
  • 2010 International Workshop on Storage Network Architecture and Parallel I/Os (SNAPI) https://doi.org/10.1109/SNAPI.2010.10
conference May 2010
Toward simulation-time data analysis and I/O acceleration on leadership-class systems conference October 2011
Integration Experiences and Performance Studies of A COTS Parallel Archive System conference September 2010
Detailed Modeling, Design, and Evaluation of a Scalable Multi-level Checkpointing System report April 2010
PLFS: a checkpoint filesystem for parallel applications conference January 2009
Exploiting Concurrency to Improve Latency and throughput in a Hybrid Storage System
  • Wu, Xiaojian; Reddy, A. L. Narasimha
  • Simulation of Computer and Telecommunication Systems (MASCOTS), 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems https://doi.org/10.1109/MASCOTS.2010.11
conference August 2010
Managing storage space in a flash and disk hybrid storage system
  • No authors listed
  • amp; Simulation of Computer and Telecommunication Systems (MASCOTS), 2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems https://doi.org/10.1109/MASCOT.2009.5366764
conference September 2009
Parallel In Situ Coupling of Simulation with a Fully Featured Visualization System null January 2011
Can Checkpoint/Restart Mechanisms Benefit from Hierarchical Data Staging? book January 2012
DASH-IO: an empirical study of flash-based IO for HPC conference January 2010
A Cost-Effective, High Bandwidth Server I/O network Architecture for Cluster Systems conference March 2007
Pageserver: High-Performance SSD-Based Checkpointing of Transactional Distributed Memory conference March 2010
The Conquest file system : Better performance through a disk/persistent-RAM hybrid design journal August 2006
Scalable I/O forwarding framework for high-performance computing systems conference August 2009
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
  • Lofstead, Jay F.; Klasky, Scott; Schwan, Karsten
  • Proceedings of the 6th international workshop on Challenges of large applications in distributed environments - CLADE '08 https://doi.org/10.1145/1383529.1383533
conference January 2008
GIGA+: scalable directories for shared file systems
  • Patil, Swapnil V.; Gibson, Garth A.; Lang, Sam
  • Proceedings of the 2nd international workshop on Petascale data storage held in conjunction with Supercomputing '07 - PDSW '07 https://doi.org/10.1145/1374596.1374604
conference January 2007
Verifying Scientific Simulations via Comparative and Quantitative Visualization journal November 2010
Design issues for a shingled write disk system conference May 2010
Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures
  • Li, Min; Vazhkudai, Sudharshan S.; Butt, Ali R.
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.28
conference November 2010
Flexibility, manageability, and performance in a Grid storage appliance conference January 2002
Azor: Using Two-Level Block Selection to Improve SSD-Based I/O Caches
  • Klonatos, Yannis; Makatos, Thanos; Marazakis, Manolis
  • 2011 6th IEEE International Conference on Networking, Architecture, and Storage (NAS), 2011 IEEE Sixth International Conference on Networking, Architecture, and Storage https://doi.org/10.1109/NAS.2011.50
conference July 2011
Modeling a Leadership-Scale Storage System book January 2012
High Performance RDMA Protocols in HPC book January 2006
Managing Variability in the IO Performance of Petascale Storage Systems
  • Lofstead, Jay; Zheng, Fang; Liu, Qing
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.32
conference November 2010
Distributed-and-split data-control extension to SCSI for scalable storage area networks conference January 2002
Scalable parallel building blocks for custom data analysis conference October 2011
An abstract-device interface for implementing portable parallel-I/O interfaces conference January 1996
Evaluation of active storage strategies for the lustre parallel file system conference January 2007
Provisioning a Multi-tiered Data Staging Area for Extreme-Scale Machines conference June 2011