skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Investigating Operating System Noise in Extreme-Scale High-Performance Computing Systems using Simulation

Conference ·
OSTI ID:1073673

Hardware/software co-design for future-generation high-performance computing (HPC) systems aims at closing the gap between the peak capabilities of the hardware and the performance realized by applications (application-architecture performance gap). Performance profiling of architectures and applications is a crucial part of this iterative process. The work in this paper focuses on operating system (OS) noise as an additional factor to be considered for co-design. It represents the first step in including OS noise in HPC hardware/software co-design by adding a noise injection feature to an existing simulation-based co-design toolkit. It reuses an existing abstraction for OS noise with frequency (periodic recurrence) and period (duration of each occurrence) to enhance the processor model of the Extreme-scale Simulator (xSim) with synchronized and random OS noise simulation. The results demonstrate this capability by evaluating the impact of OS noise on MPI_Bcast() and MPI_Reduce() in a simulated future-generation HPC system with 2,097,152 compute nodes.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC); USDOE Laboratory Directed Research and Development (LDRD) Program
DOE Contract Number:
DE-AC05-00OR22725
OSTI ID:
1073673
Resource Relation:
Conference: 11th IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN) 2013, Innsbruck, Austria, 20130211, 20130213
Country of Publication:
United States
Language:
English

Similar Records

Scaling To A Million Cores And Beyond: Using Light-Weight Simulation to Understand The Challenges Ahead On The Road To Exascale
Journal Article · Wed Jan 01 00:00:00 EST 2014 · Future Generation Computer Systems · OSTI ID:1073673

A new deadlock resolution protocol and message matching algorithm for the extreme-scale simulator
Journal Article · Tue Mar 22 00:00:00 EDT 2016 · Concurrency and Computation. Practice and Experience · OSTI ID:1073673

Improving the Performance of the Extreme-scale Simulator
Conference · Wed Jan 01 00:00:00 EST 2014 · OSTI ID:1073673

Related Subjects