Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT

Conference ·

Distributed Shared Memory (DSM) machines are a wide class of multi-processor computing systems where a large virtually-shared address space is mapped on a network of physically distributed memories. High memory latency and network contention are two of the main factors that limit performance scaling of such architectures. Modern high-performance computing DSM systems have evolved toward exploitation of massive hardware multi-threading and fine-grained memory hashing to tolerate irregular latencies, avoid network hot-spots and enable high scaling. In order to model the performance of such large-scale machines, parallel simulation has been proved to be a promising approach to achieve good accuracy in reasonable times. One of the most critical factors in solving the simulation speed-accuracy trade-off is network modeling. The Cray XMT is a massively multi-threaded supercomputing architecture that belongs to the DSM class, since it implements a globally-shared address space abstraction on top of a physically distributed memory substrate. In this paper, we discuss the development of a contention-aware network model intended to be integrated in a full-system XMT simulator. We start by measuring the effects of network contention in a 128-processor XMT machine and then investigate the trade-off that exists between simulation accuracy and speed, by comparing three network models which operate at different levels of accuracy. The comparison and model validation is performed by executing a string-matching algorithm on the full-system simulator and on the XMT, using three datasets that generate noticeably different contention patterns.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (US)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1023734
Report Number(s):
PNNL-SA-76834; 400470000
Country of Publication:
United States
Language:
English

Similar Records

Fast and Accurate Simulation of the Cray XMT Multithreaded Supercomputer
Journal Article · Sun Dec 30 23:00:00 EST 2012 · IEEE Transactions on Parallel and Distributed Systems, 23(12):2266-2279 · OSTI ID:1088646

Implementing and Evaluating Multithreaded Triad Census Algorithms on the Cray XMT
Conference · Fri May 29 00:00:00 EDT 2009 · OSTI ID:973732

Designing Next Generation Massively Multithreaded Architectures for Irregular Applications
Journal Article · Fri Aug 31 00:00:00 EDT 2012 · Computer, 45(8):53-61 · OSTI ID:1088645