Ad Hoc File Systems for High-Performance Computing
- Johannes Gutenberg Univ., Mainz (Germany)
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
- Florida State Univ., Tallahassee, FL (United States)
- Argonne National Lab. (ANL), Argonne, IL (United States)
- Univ. Politecnica de Catalunya, Barcelona (Spain)
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Barcelona Supercomputing Center, Barcelona (Spain)
- Fraunhofer Inst. for Industrial Mathematics ITWM, Kaiserslautern (Germany)
Storage backends of parallel compute clusters are still based mostly on magnetic disks, while newer and faster storage technologies such as flash-based SSDs or non-volatile random access memory (NVRAM) are deployed within compute nodes. Including these new storage technologies into scientific workflows is unfortunately today a mostly manual task, and most scientists therefore do not take advantage of the faster storage media. One approach to systematically include nodelocal SSDs or NVRAMs into scientific workflows is to deploy ad hoc file systems over a set of compute nodes, which serve as temporary storage systems for single applications or longer-running campaigns. This paper presents results from the Dagstuhl Seminar 17202 “Challenges and Opportunities of User-Level File Systems for HPC” and discusses application scenarios as well as design strategies for ad hoc file systems using node-local storage media. The discussion includes open research questions, such as how to couple ad hoc file systems with the batch scheduling environment and how to schedule stage-in and stage-out processes of data between the storage backend and the ad hoc file systems. Also presented are strategies to build ad hoc file systems by using reusable components for networking and how to improve storage device compatibility. Various interfaces and semantics are presented, for example those used by the three ad hoc file systems BeeOND, GekkoFS, and BurstFS. Their presentation covers a range from file systems running in production to cutting-edge research focusing on reaching the performance limits of the underlying devices.
- Research Organization:
- Argonne National Laboratory (ANL), Argonne, IL (United States); Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); German Research Foundation (DFG); European Union (EU); Spanish Ministry of Science and Innovation (MICINN); National Science Foundation (NSF); USDOE National Nuclear Security Administration (NNSA)
- Grant/Contract Number:
- AC02-06CH11357; 1561041; 1564647; 1744336; 1763547; 1822737; 2014-SGR-1051; TIN2015-65316; 671591; AC52-07NA27344
- OSTI ID:
- 1596689
- Alternate ID(s):
- OSTI ID: 1606092
- Report Number(s):
- LLNL-JRNL-779789; 155300
- Journal Information:
- Journal of Computer Science and Technology, Vol. 35, Issue 1; ISSN 1000-9000
- Publisher:
- Springer NatureCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Web of Science
Characterizing output bottlenecks in a supercomputer
|
conference | November 2012 |
TRIO: Burst Buffer Based I/O Orchestration
|
conference | September 2015 |
Scaling Embedded In-Situ Indexing with DeltaFS
|
conference | November 2018 |
An introduction to disk drive modeling
|
journal | March 1994 |
‘Big data’, Hadoop and cloud computing in genomics
|
journal | October 2013 |
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
|
conference | January 2008 |
The IBM Blue Gene/Q interconnection network and message unit
|
conference | January 2011 |
Entropy-Aware I/O Pipelining for Large-Scale Deep Learning on HPC Systems
|
conference | September 2018 |
Brochure on Storage Systems and I/O: Organizing, Storing, and Accessing Data for Scientific Discovery | report | July 2019 |
LPCC: hierarchical persistent client caching for lustre
|
conference | November 2019 |
Task-based programming in COMPSs to converge from HPC to big data
|
journal | April 2017 |
On the role of burst buffers in leadership-class storage systems
|
conference | April 2012 |
FusionFS: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems
|
conference | October 2014 |
MCREngine: A scalable checkpointing system using data-aware aggregation and compression
|
conference | November 2012 |
Direct lookup and hash-based metadata placement for local file systems
|
conference | January 2013 |
An overview of the HDF5 technology suite and its applications
|
conference | January 2011 |
Cray Cascade: A scalable HPC system based on a Dragonfly network
|
conference | November 2012 |
Efficient Data-Movement for Lightweight I/O
|
conference | September 2006 |
Harmonia: An Interference-Aware Dynamic I/O Scheduler for Shared Non-volatile Burst Buffers
|
conference | September 2018 |
Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System
|
conference | November 2010 |
Methodology for the Rapid Development of Scalable HPC Data Services
|
conference | November 2018 |
Data Elevator: Low-Contention Data Movement in Hierarchical Storage System
|
conference | December 2016 |
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In-Situ Workflows
|
conference | November 2018 |
ROOT — A C++ framework for petabyte data storage, statistical analysis and visualization
|
journal | June 2011 |
A Brief Introduction to the OpenFabrics Interfaces - A New Network API for Maximizing High Performance Application Efficiency
|
conference | August 2015 |
Deduplication Potential of HPC Applications’ Checkpoints
|
conference | September 2016 |
UCX: An Open Source Framework for HPC Network APIs and Beyond
|
conference | August 2015 |
GekkoFS - A Temporary Distributed File System for HPC Applications
|
conference | September 2018 |
High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA
|
conference | May 2015 |
Improving Collective I/O Performance Using Non-volatile Memory Devices
|
conference | September 2016 |
On the Root Causes of Cross-Application I/O Interference in HPC Storage Systems
|
conference | May 2016 |
NetCDF: an interface for scientific data access
|
journal | July 1990 |
Optimizing a hybrid SSD/HDD HPC storage system based on file size distributions
|
conference | May 2013 |
Apache Spark: a unified engine for big data processing
|
journal | October 2016 |
Search and clustering orders of magnitude faster than BLAST
|
journal | August 2010 |
A configurable rule based classful token bucket filter network request scheduler for the lustre file system
|
conference | January 2017 |
The Hadoop Distributed File System
|
conference | May 2010 |
Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web
|
conference | January 1997 |
Scientific computing meets big data technology: An astronomy use case
|
conference | October 2015 |
Challenges and Solutions for Tracing Storage Systems: A Case Study with Spectrum Scale
|
journal | April 2018 |
Understanding and Improving Computational Science Storage Access through Continuous Characterization
|
journal | October 2011 |
PLFS: a checkpoint filesystem for parallel applications | conference | January 2009 |
An Overview of the Atmospheric Component of the Energy Exascale Earth System Model
|
journal | August 2019 |
On the Quality of Wall Time Estimates for Resource Allocation Prediction
|
conference | January 2019 |
Mercury: Enabling remote procedure call for high-performance computing
|
conference | September 2013 |
Qthreads: An API for programming with millions of lightweight threads
|
conference | April 2008 |
Managing Variability in the IO Performance of Petascale Storage Systems
|
conference | November 2010 |
Snakemake—a scalable bioinformatics workflow engine
|
journal | May 2018 |
Managing I/O Interference in a Shared Burst Buffer System
|
conference | August 2016 |
SSD Failures in Datacenters: What? When? and Why?
|
conference | January 2016 |
Argobots: A Lightweight Low-Level Threading and Tasking Framework
|
journal | March 2018 |
Exascale Deep Learning for Climate Analytics
|
conference | November 2018 |
Poster: Portals 4 Network Programming Interface
|
conference | November 2012 |
NORNS: Extending Slurm to Support Data-Driven Workflows through Asynchronous Data Staging
|
conference | September 2019 |
A Large-Scale Study of Flash Memory Failures in the Field
|
journal | June 2015 |
High Performance RDMA-Based MPI Implementation over InfiniBand
|
journal | June 2004 |
A 1 PB/s file system to checkpoint three million MPI tasks
|
conference | January 2013 |
Parallel netCDF: A High-Performance Scientific I/O Interface
|
conference | January 2003 |
Performance and extension of user space file systems
|
conference | January 2010 |
An Ephemeral Burst-Buffer File System for Scientific Applications
|
conference | November 2016 |
File System Scalability with Highly Decentralized Metadata on Independent Storage Devices
|
conference | May 2016 |
A 1 PB/s file system to checkpoint three million MPI tasks
|
conference | October 2018 |
Similar Records
Towards a Scalable and Adaptive Application Support Platform for Large-Scale Distributed E-Sciences in High-Performance Network Environments
Virtual machine-based simulation platform for mobile ad-hoc network-based cyber infrastructure