LDRD final report : managing shared memory data distribution in hybrid HPC applications.

Merritt, Alexander M; Pedretti, Kevin Thomas Tauke

doi:10.2172/1007320

Title: LDRD final report : managing shared memory data distribution in hybrid HPC applications.

Technical Report · Wed Sep 01 00:00:00 EDT 2010

DOI:https://doi.org/10.2172/1007320· OSTI ID:1007320

Merritt, Alexander M ^[1]; Pedretti, Kevin Thomas Tauke

Georgia Institute of Technology, Atlanta, GA

MPI is the dominant programming model for distributed memory parallel computers, and is often used as the intra-node programming model on multi-core compute nodes. However, application developers are increasingly turning to hybrid models that use threading within a node and MPI between nodes. In contrast to MPI, most current threaded models do not require application developers to deal explicitly with data locality. With increasing core counts and deeper NUMA hierarchies seen in the upcoming LANL/SNL 'Cielo' capability supercomputer, data distribution poses an upper boundary on intra-node scalability within threaded applications. Data locality therefore has to be identified at runtime using static memory allocation policies such as first-touch or next-touch, or specified by the application user at launch time. We evaluate several existing techniques for managing data distribution using micro-benchmarks on an AMD 'Magny-Cours' system with 24 cores among 4 NUMA domains and argue for the adoption of a dynamic runtime system implemented at the kernel level, employing a novel page table replication scheme to gather per-NUMA domain memory access traces.

View Technical Report

Cite

Export

Save

Research Organization:: Sandia National Laboratories (SNL), Albuquerque, NM, and Livermore, CA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC04-94AL85000

OSTI ID:: 1007320

Report Number(s):: SAND2010-6262; TRN: US201106%%897

Country of Publication:: United States

Language:: English

Similar Records

Parallel Breadth-First Search on Distributed Memory Systems

Technical Report · Fri Apr 15 00:00:00 EDT 2011 · OSTI ID:1007320

Buluc, Aydin; Madduri, Kamesh

Optimization of Forward Wave Modeling on Contemporary HPC Architectures

Technical Report · Fri Jul 20 00:00:00 EDT 2012 · OSTI ID:1007320

Krueger, Jens; Micikevicius, Paulius; Williams, Samuel

Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems

Conference · Sat Jan 01 00:00:00 EST 2011 · OSTI ID:1007320

Madduri, Kamesh; Ibrahim, Khaled Z.; Williams, Samuel; +4 more

Related Subjects

99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
COMPUTERS
DISTRIBUTION
KERNELS
PROGRAMMING

Title: LDRD final report : managing shared memory data distribution in hybrid HPC applications.

Citation Formats

Similar Records

Related Subjects