On the memory attribution problem: A solution and case study using MPI
Journal Article
·
· Concurrency and Computation. Practice and Experience
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Univ. of New Mexico, Albuquerque, NM (United States)
- Emory Univ., Atlanta, GA (United States)
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
As parallel applications running on large–scale computing systems become increasingly memory constrained, the ability to attribute memory usage to the various components of the application is becoming increasingly important. We present the design and implementation of memnesia, a novel memory usage profiler for parallel and distributed message–passing applications. Our approach captures both application– and message–passing library–specific memory usage statistics from unmodified binaries dynamically linked to a message–passing communication library. Using microbenchmarks and proxy applications, we evaluated our profiler across three Message Passing Interface (MPI) implementations and two hardware platforms. Furthermore, the results show that our approach and the corresponding implementation can accurately quantify memory resource usage as a function of time, scale, communication workload, and software or hardware system architecture, clearly distinguishing between application and MPI library memory usage at a per–process level. With this new capability, we show that job size, communication workload, and hardware/software architecture influence peak runtime memory usage. In practice, this tool provides a potentially valuable source of information for application developers seeking to measure and optimize memory usage.
- Research Organization:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- Grant/Contract Number:
- 89233218CNA000001; AC52-06NA25396
- OSTI ID:
- 1495167
- Alternate ID(s):
- OSTI ID: 1493495
- Report Number(s):
- LA-UR--18-30292
- Journal Information:
- Concurrency and Computation. Practice and Experience, Journal Name: Concurrency and Computation. Practice and Experience Journal Issue: 3 Vol. 32; ISSN 1532-0626
- Publisher:
- WileyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Performance Characterization of a Hierarchical MPI Implementations on Large-scale Distributed-memory Platforms
Compiling MPI for Many-Core Systems
MPI as a coordination layer for communicating HPF tasks
Conference
·
Wed Dec 31 23:00:00 EST 2008
·
OSTI ID:1024230
Compiling MPI for Many-Core Systems
Technical Report
·
Mon Jun 03 00:00:00 EDT 2013
·
OSTI ID:1088441
MPI as a coordination layer for communicating HPF tasks
Conference
·
Mon Dec 30 23:00:00 EST 1996
·
OSTI ID:418494