RISE: Reducing I/O Contention in Staging-based Extreme-Scale In-situ Workflows
Abstract
While in-situ workflow formulations have addressed some of the data-related challenges associated with extreme-scale scientific workflows, these workflows involve complex interactions and different modes of data exchange. In the context of increasing system complexity, such workflows present significant resource management challenges, requiring complex cost-performance tradeoffs. This paper presents RISE, an intelligent staging-based data management middleware, which builds on the DataSpaces framework and performs intelligent scheduling of data management operations to reduce I/O contention. In RISE, data are always written immediately to local buffers to reduce the effect of the transfer impact upon application performance. RISE identifies applications’ data access patterns and moves data towards data consumers only when the network is expected to be idle, reducing the impact of asynchronous background data movement upon critical data read/write requests. Here, we experimentally demonstrate that RISE can take advantage of staging nodes to offload data during writes without degrading application data movement performance.
- Authors:
-
- Univ. of Utah, Salt Lake City, UT (United States)
- Publication Date:
- Research Org.:
- Rutgers Univ., Piscataway, NJ (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC)
- OSTI Identifier:
- 1907691
- Grant/Contract Number:
- SC0021326; AC05-00OR22725
- Resource Type:
- Journal Article: Accepted Manuscript
- Journal Name:
- Proceedings - IEEE International Conference on Cluster Computing (Online)
- Additional Journal Information:
- Journal Volume: 2021; Conference: 2021 IEEE International Conference on Cluster Computing (CLUSTER), Portland, OR (United States), 7-10 Sep 2021; Journal ID: ISSN 2168-9253
- Publisher:
- IEEE
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING; Extreme Scale Data Staging; Machine Learning; Data Management; High Performance Computing
Citation Formats
Subedi, Pradeep, Davis, Philip E., and Parashar, Manish. RISE: Reducing I/O Contention in Staging-based Extreme-Scale In-situ Workflows. United States: N. p., 2021.
Web. doi:10.1109/cluster48925.2021.00021.
Subedi, Pradeep, Davis, Philip E., & Parashar, Manish. RISE: Reducing I/O Contention in Staging-based Extreme-Scale In-situ Workflows. United States. https://doi.org/10.1109/cluster48925.2021.00021
Subedi, Pradeep, Davis, Philip E., and Parashar, Manish. 2021.
"RISE: Reducing I/O Contention in Staging-based Extreme-Scale In-situ Workflows". United States. https://doi.org/10.1109/cluster48925.2021.00021. https://www.osti.gov/servlets/purl/1907691.
@article{osti_1907691,
title = {RISE: Reducing I/O Contention in Staging-based Extreme-Scale In-situ Workflows},
author = {Subedi, Pradeep and Davis, Philip E. and Parashar, Manish},
abstractNote = {While in-situ workflow formulations have addressed some of the data-related challenges associated with extreme-scale scientific workflows, these workflows involve complex interactions and different modes of data exchange. In the context of increasing system complexity, such workflows present significant resource management challenges, requiring complex cost-performance tradeoffs. This paper presents RISE, an intelligent staging-based data management middleware, which builds on the DataSpaces framework and performs intelligent scheduling of data management operations to reduce I/O contention. In RISE, data are always written immediately to local buffers to reduce the effect of the transfer impact upon application performance. RISE identifies applications’ data access patterns and moves data towards data consumers only when the network is expected to be idle, reducing the impact of asynchronous background data movement upon critical data read/write requests. Here, we experimentally demonstrate that RISE can take advantage of staging nodes to offload data during writes without degrading application data movement performance.},
doi = {10.1109/cluster48925.2021.00021},
url = {https://www.osti.gov/biblio/1907691},
journal = {Proceedings - IEEE International Conference on Cluster Computing (Online)},
issn = {2168-9253},
number = ,
volume = 2021,
place = {United States},
year = {2021},
month = {9}
}
Works referenced in this record:
In-memory staging and data-centric task placement for coupled scientific simulation workflows: In-memory staging and data-centric task placement for coupled scientific simulation workflows
journal, April 2017
- Zhang, Fan; Jin, Tong; Sun, Qian
- Concurrency and Computation: Practice and Experience, Vol. 29, Issue 12
Dual space analysis of turbulent combustion particle data
conference, March 2011
- Wei, Jishang; Yu, Hongfeng; Grout, Ray W.
- 2011 IEEE Pacific Visualization Symposium
Opportunities for Nonvolatile Memory Systems in Extreme-Scale High-Performance Computing
journal, March 2015
- Vetter, Jeffrey S.; Mittal, Sparsh
- Computing in Science & Engineering, Vol. 17, Issue 2
DeStager: feature guided in-situ data management in distributed deep memory hierarchies
journal, August 2018
- Zhang, Xuechen; Zheng, Fang; Nguyen, Bao
- Distributed and Parallel Databases, Vol. 37, Issue 1
DataSpaces: an interaction and coordination framework for coupled simulation workflows
journal, February 2011
- Docan, Ciprian; Parashar, Manish; Klasky, Scott
- Cluster Computing, Vol. 15, Issue 2
ActiveSpaces: Exploring dynamic code deployment for extreme scale data processing: ActiveSpaces: Exploring dynamic code deployment for extreme scale data processing
journal, October 2014
- Docan, Ciprian; Zhang, Fan; Jin, Tong
- Concurrency and Computation: Practice and Experience, Vol. 27, Issue 14
Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction
conference, November 2014
- Dorier, Matthieu; Ibrahim, Shadi; Antoniu, Gabriel
- SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
Addressing data resiliency for staging based scientific workflows
conference, November 2019
- Duan, Shaohua; Subedi, Pradeep; Davis, Philip E.
- Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Scalable Data Resilience for In-memory Data Staging
conference, May 2018
- Duan, Shaohua; Subedi, Pradeep; Teranishi, Keita
- 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales
conference, December 2017
- Foster, Ian
- 2017 IEEE 24th International Conference on High Performance Computing (HiPC)
ADIOS 2: The Adaptable Input Output System. A framework for high-performance data management
journal, July 2020
- Godoy, William F.; Podhorszki, Norbert; Wang, Ruonan
- SoftwareX, Vol. 12
Exploring Data Staging Across Deep Memory Hierarchies for Coupled Data Intensive Simulation Workflows
conference, May 2015
- Jin, Tong; Zhang, Fan; Sun, Qian
- 2015 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
Full-f gyrokinetic particle simulation of centrally heated global ITG turbulence from magnetic axis to edge pedestal top in a realistic tokamak geometry
journal, September 2009
- Ku, S.; Chang, C. S.; Diamond, P. H.
- Nuclear Fusion, Vol. 49, Issue 11
Lynx: a learning linux prefetching mechanism for SSD performance model
conference, August 2016
- Laga, Arezki; Boukhobza, Jalil; Koskas, Michel
- 2016 5th Non-Volatile Memory Systems and Applications Symposium (NVMSA)
Leveraging Machine Learning for Anticipatory Data Delivery in Extreme Scale In-situ Workflows
conference, September 2019
- Subedi, Pradeep; Davis, Philip E.; Parashar, Manish
- 2019 IEEE International Conference on Cluster Computing (CLUSTER)
Terascale direct numerical simulations of turbulent combustion using S3D
journal, January 2009
- Chen, J. H.; Choudhary, A.; de Supinski, B.
- Computational Science & Discovery, Vol. 2, Issue 1
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In-Situ Workflows
conference, November 2018
- Subedi, Pradeep; Davis, Philip; Duan, Shaohua
- SC18: International Conference for High Performance Computing, Networking, Storage and Analysis
Flexpath: Type-Based Publish/Subscribe System for Large-Scale Science Analytics
conference, May 2014
- Dayal, Jai; Bratcher, Drew; Eisenhauer, Greg
- 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)
Adaptive data placement for staging-based coupled scientific workflows
conference, January 2015
- Sun, Qian; Parashar, Manish; Jin, Tong
- Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '15
Practical prefetching via data compression
conference, January 1993
- Curewitz, Kenneth M.; Krishnan, P.; Vitter, Jeffrey Scott
- Proceedings of the 1993 ACM SIGMOD international conference on Management of data - SIGMOD '93
The future of scientific workflows
journal, April 2017
- Deelman, Ewa; Peterka, Tom; Altintas, Ilkay
- The International Journal of High Performance Computing Applications, Vol. 32, Issue 1
Combining in-situ and in-transit processing to enable extreme-scale scientific analysis
conference, November 2012
- Bennett, Janine C.; Abbasi, Hasan; Bremer, Peer-Timo
- 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis
Moving the Code to the Data - Dynamic Code Deployment Using ActiveSpaces
conference, May 2011
- Docan, Ciprian; Parashar, Manish; Cummings, Julian
- Distributed Processing Symposium (IPDPS), 2011 IEEE International Parallel & Distributed Processing Symposium
DataStager: scalable data staging services for petascale applications
journal, June 2010
- Abbasi, Hasan; Wolf, Matthew; Eisenhauer, Greg
- Cluster Computing, Vol. 13, Issue 3
Scientific workflow management and the Kepler system
journal, January 2006
- Ludäscher, Bertram; Altintas, Ilkay; Berkley, Chad
- Concurrency and Computation: Practice and Experience, Vol. 18, Issue 10
Identifying Hierarchical Structure in Sequences: A linear-time algorithm
journal, September 1997
- Nevill-Manning, C. G.; Witten, I. H.
- Journal of Artificial Intelligence Research, Vol. 7
In Situ Visualization at Extreme Scale: Challenges and Opportunities
journal, November 2009
- Kwan-Liu Ma,
- IEEE Computer Graphics and Applications, Vol. 29, Issue 6
Taverna: a tool for the composition and enactment of bioinformatics workflows
journal, June 2004
- Oinn, T.; Addis, M.; Ferris, J.
- Bioinformatics, Vol. 20, Issue 17
Mercury: Enabling remote procedure call for high-performance computing
conference, September 2013
- Soumagne, Jerome; Kimpe, Dries; Zounmevo, Judicael
- 2013 IEEE International Conference on Cluster Computing (CLUSTER)
Mochi: Composing Data Services for High-Performance Computing Environments
journal, January 2020
- Ross, Robert B.; Amvrosiadis, George; Carns, Philip
- Journal of Computer Science and Technology, Vol. 35, Issue 1