Collective I/O Tuning Using Analytical and Machine-Learning Models

Isaila, Florin; Balaprakash, Prasanna; Wild, Stefan M.; Kimpe, Dries; Latham, Rob; Ross, Rob; Hovland, Paul

doi:10.1109/CLUSTER.2015.29

Title: Collective I/O Tuning Using Analytical and Machine-Learning Models

Conference · Thu Jan 01 00:00:00 EST 2015

DOI:https://doi.org/10.1109/CLUSTER.2015.29· OSTI ID:1351298

Isaila, Florin; Balaprakash, Prasanna; Wild, Stefan M.; Kimpe, Dries; Latham, Rob; Ross, Rob; Hovland, Paul

The ever larger demand of scientific applications for computation and data is currently driving a continuous increase in scale of parallel computers. The inherent complexity of scaling up a computing systems in terms of both hardware and software stack exposes an increasing number of factors impacting the performance and complicating the process of optimization. In particular, the optimization of parallel I/O has become increasingly challenging due to increasing storage hierarchy and well known performance variability of shared storage systems. This paper focuses on model-based autotuning of the two-phase collective I/O algorithm from a popular MPI distribution on the Blue Gene/Q architecture. We propose a novel hybrid model, constructed as a composition of analytical models for communication and storage operations and black-box models for the performance of the individual operations. We perform an in-depth study of the complexity involved in performance modeling including architecture, software stack and noise. In particular we address this challenges of modeling the performance of shared storage systems by building a benchmark that helps synthesizing factors such as topology, file caching, and noise. The experimental results show that the hybrid approach produces significantly better results than state-of-the-art machine learning approaches and shows a higher robustness to noise, at the cost of a higher modeling complexity

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

Research Organization:: Argonne National Lab. (ANL), Argonne, IL (United States)

Sponsoring Organization:: USDOE Office of Science - Office of Advanced Scientific Computing Research

DOE Contract Number:: AC02-06CH11357

OSTI ID:: 1351298

Resource Relation:: Conference: 2015 IEEE Cluster , 09/08/15 - 09/11/15, Chicago, IL, US

Country of Publication:: United States

Language:: English

Similar Records

Data Locality Enhancement of Dynamic Simulations for Exascale Computing (Final Report)

Technical Report · Fri Nov 29 00:00:00 EST 2019 · OSTI ID:1351298

Shen, Xipeng

Roofline Analysis in the Intel® Advisor to Deliver Optimized Performance for applications on Intel® Xeon Phi™ Processor

Conference · Tue May 23 00:00:00 EDT 2017 · OSTI ID:1351298

Koskela, Tuomas S.; Lobet, Mathieu; Deslippe, Jack; +1 more

Quantum Monte Carlo Endstation for Petascale Computing

Technical Report · Wed Mar 02 00:00:00 EST 2011 · OSTI ID:1351298

Ceperley, David

Related Subjects

I/O performance modeling
model-based tuning
statistical and analytical performance models

Title: Collective I/O Tuning Using Analytical and Machine-Learning Models

Citation Formats

Similar Records

Related Subjects