FlexIO: I/O Middleware for Location-Flexible Scientific Data Analytics
Conference
·
· IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013)
Increasingly severe I/O bottlenecks on High-End Computing machines are prompting scientists to process simulation output data online while simulations are running and before storing data on disk. There are several options to place data analytics along the I/O path: on compute nodes, on separate nodes dedicated to analytics, or after data is stored on persistent storage. Since different placements have different impact on performance and cost, there is a consequent need for flexibility in the location of data analytics. The FlexIO middleware described in this paper makes it easy for scientists to obtain such flexibility, by offering simple abstractions and diverse data movement methods to couple simulation with analytics. Various placement policies can be built on top of FlexIO to exploit the trade-offs in performing analytics at different levels of the I/O hierarchy. Experimental results demonstrate that FlexIO can support a variety of simulation and analytics workloads at large scale through flexible placement options, efficient data movement, and dynamic deployment of data manipulation functionalities.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- OSTI ID:
- 1567340
- Conference Information:
- Journal Name: IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013)
- Country of Publication:
- United States
- Language:
- English
Similar Records
PreDatA - Preparatory Data Analytics on Peta-Scale Machines
Performance analysis of emerging data analytics and HPC workloads
Proactive Data Containers for Scientific Storage (Final Report)
Conference
·
Thu Dec 31 23:00:00 EST 2009
·
OSTI ID:982176
Performance analysis of emerging data analytics and HPC workloads
Journal Article
·
Sat Dec 31 23:00:00 EST 2016
·
OSTI ID:1544357
Proactive Data Containers for Scientific Storage (Final Report)
Technical Report
·
Mon Dec 09 23:00:00 EST 2019
·
OSTI ID:1577855