Combining In-situ and In-transit Processing to Enable Extreme-Scale Scientific Analysis
- Sandia National Laboratories (SNL)
- ORNL
- Lawrence Livermore National Laboratory (LLNL)
- National Renewable Energy Laboratory (NREL)
- University of Utah
- Rutgers University
- Kitware
With the onset of extreme-scale computing, I/O constraints make it increasingly difficult for scientists to save a sufficient amount of raw simulation data to persistent storage. One potential solution is to change the data analysis pipeline from a post-process centric to a concurrent approach based on either in-situ or in-transit processing. In this context computations are considered in-situ if they utilize the primary compute resources, while in-transit processing refers to offloading computations toa set of secondary resources using asynchronous data transfers. In this paper we explore the design and implementation of three common analysis techniques typically performed on large-scale scientific simulations: topological analysis, descriptive statistics, and visualization. We summarize algorithmic developments, describe a resource scheduling system to coordinate the execution ofvarious analysis workflows, and discuss our implementation using the DataSpaces and ADIOS frameworks that support efficient data movement between in-situ and in-transit computations. Wedemonstrate the efficiency of our lightweight, flexible framework by deploying it on the Jaguar XK6 to analyze data generated by S3D, a massively parallel turbulent combustion code. Our framework allows scientists dealing with the data deluge at extreme scale to perform analyses at increased temporal resolutions, mitigate I/O costs, and significantly improve the time to insight.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1096981
- Country of Publication:
- United States
- Language:
- English
Similar Records
A scalable messaging system for accelerating discovery from large scale scientific simulations
Current parallel I/O limitations to scalable data analysis.
Final Technical Report for: A Unified Data-Driven Approach for Programming In Situ Analysis and Visualization
Conference
·
Sat Dec 31 23:00:00 EST 2011
·
OSTI ID:1096347
Current parallel I/O limitations to scalable data analysis.
Technical Report
·
Fri Jul 01 00:00:00 EDT 2011
·
OSTI ID:1022200
Final Technical Report for: A Unified Data-Driven Approach for Programming In Situ Analysis and Visualization
Technical Report
·
Sat May 26 00:00:00 EDT 2018
·
OSTI ID:1439415