On the memory attribution problem: A solution and case study using MPI

Gutiérrez, Samuel Keith; Arnold, Dorian C.; Davis, Kei Marion; McCormick, Patrick Sean

doi:10.1002/cpe.5159

Title: On the memory attribution problem: A solution and case study using MPI

Journal Article · Mon Feb 04 00:00:00 EST 2019 · Concurrency and Computation. Practice and Experience

DOI:https://doi.org/10.1002/cpe.5159· OSTI ID:1495167

^[1]; Arnold, Dorian C. ^[2];

^[3];

^[3]

Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Univ. of New Mexico, Albuquerque, NM (United States)
Emory Univ., Atlanta, GA (United States)
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

As parallel applications running on large–scale computing systems become increasingly memory constrained, the ability to attribute memory usage to the various components of the application is becoming increasingly important. We present the design and implementation of memnesia, a novel memory usage profiler for parallel and distributed message–passing applications. Our approach captures both application– and message–passing library–specific memory usage statistics from unmodified binaries dynamically linked to a message–passing communication library. Using microbenchmarks and proxy applications, we evaluated our profiler across three Message Passing Interface (MPI) implementations and two hardware platforms. Furthermore, the results show that our approach and the corresponding implementation can accurately quantify memory resource usage as a function of time, scale, communication workload, and software or hardware system architecture, clearly distinguishing between application and MPI library memory usage at a per–process level. With this new capability, we show that job size, communication workload, and hardware/software architecture influence peak runtime memory usage. In practice, this tool provides a potentially valuable source of information for application developers seeking to measure and optimize memory usage.

View Accepted Manuscript (DOE)

View Accepted Manuscript (Publisher)

Cite

Export

Save

Research Organization:: Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA)

Grant/Contract Number:: 89233218CNA000001; AC52‐06NA25396

OSTI ID:: 1495167

Alternate ID(s):: OSTI ID: 1493495

Report Number(s):: LA-UR-18-30292

Journal Information:: Concurrency and Computation. Practice and Experience, Vol. 32, Issue 3; ISSN 1532-0626

Publisher:: WileyCopyright Statement

Country of Publication:: United States

Language:: English

Citation Metrics:

Cited by: 2 works

Citation information provided by
Web of Science

References (11)

Valgrind: a framework for heavyweight dynamic binary instrumentation Nethercote, Nicholas; Seward, Julian ACM SIGPLAN Notices, Vol. 42, Issue 6 https://doi.org/10.1145/1273442.1250746	journal	June 2007
Memory registration caching correctness Wyckoff, P.; Wu, J. CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005. https://doi.org/10.1109/CCGRID.2005.1558671	conference	January 2005
Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation Gabriel, Edgar; Fagg, Graham E.; Bosilca, George Recent Advances in Parallel Virtual Machine and Message Passing Interface https://doi.org/10.1007/978-3-540-30218-6_19	book	January 2004
A uGNI-Based MPICH2 Nemesis Network Module for the Cray XE Pritchard, Howard; Gorodetsky, Igor; Buntinas, Darius Recent Advances in the Message Passing Interface https://doi.org/10.1007/978-3-642-24449-0_14	book	January 2011
DAS User Manual Communications in Asteroseismology, Vol. 155 https://doi.org/10.1553/cia155s7	journal	January 2008
Technology-Driven, Highly-Scalable Dragonfly Topology Kim, John; Dally, Wiliam J.; Scott, Steve 2008 35th International Symposium on Computer Architecture (ISCA), 2008 International Symposium on Computer Architecture https://doi.org/10.1109/ISCA.2008.19	conference	June 2008
A high-performance, portable implementation of the MPI message passing interface standard Gropp, William; Lusk, Ewing; Doss, Nathan Parallel Computing, Vol. 22, Issue 6 https://doi.org/10.1016/0167-8191(96)00024-5	journal	September 1996
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools Roth, Philip C.; Arnold, Dorian C.; Miller, Barton P. Proceedings of the 2003 ACM/IEEE conference on Supercomputing - SC '03 https://doi.org/10.1145/1048935.1050172	conference	January 2003
Improving the reliability of commodity operating systems Swift, Michael M.; Bershad, Brian N.; Levy, Henry M. ACM Transactions on Computer Systems, Vol. 23, Issue 1 https://doi.org/10.1145/1047915.1047919	journal	February 2005
Technology-Driven, Highly-Scalable Dragonfly Topology Kim, John; Dally, Wiliam J.; Scott, Steve ACM SIGARCH Computer Architecture News, Vol. 36, Issue 3 https://doi.org/10.1145/1394608.1382129	journal	June 2008
Improving the reliability of commodity operating systems Swift, Michael M.; Bershad, Brian N.; Levy, Henry M. Proceedings of the nineteenth ACM symposium on Operating systems principles - SOSP '03 https://doi.org/10.1145/945445.945466	conference	January 2003

Similar Records

Performance Characterization of a Hierarchical MPI Implementations on Large-scale Distributed-memory Platforms

Conference · Thu Jan 01 00:00:00 EST 2009 · OSTI ID:1495167

Alam, Sadaf R; Barrett, Richard F; Kuehn, Jeffery A; +1 more

Compiled MPI: Cost-Effective Exascale Applications Development

Technical Report · Tue Apr 10 00:00:00 EDT 2012 · OSTI ID:1495167

Bronevetsky, G; Quinlan, D; Lumsdaine, A; +1 more

Characterizing MPI matching via trace-based simulation

Journal Article · Mon Sep 25 00:00:00 EDT 2017 · Parallel Computing · OSTI ID:1495167

Ferreira, Kurt Brian; Levy, Scott Larson Nicoll; Pedretti, Kevin; +1 more

Related Subjects

97 MATHEMATICS AND COMPUTING
Computer Science

Title: On the memory attribution problem: A solution and case study using MPI

Citation Formats

References (11)

Similar Records

Related Subjects