Fast and Accurate Simulation of the Cray XMT Multithreaded Supercomputer
Journal Article
·
· IEEE Transactions on Parallel and Distributed Systems, 23(12):2266-2279
Irregular applications, such as data mining and analysis or graph-based computations, show unpredictable memory/network access patterns and control structures. Highly multithreaded architectures with large processor counts, like the Cray MTA-1, MTA-2 and XMT, appear to address their requirements better than commodity clusters. However, the research on highly multithreaded systems is currently limited by the lack of adequate architectural simulation infrastructures due to issues such as size of the machines, memory footprint, simulation speed, accuracy and customization. At the same time, Shared-memory MultiProcessors (SMPs) with multi-core processors have become an attractive platform to simulate large scale machines. In this paper, we introduce a cycle-level simulator of the highly multithreaded Cray XMT supercomputer. The simulator runs unmodified XMT applications. We discuss how we tackled the challenges posed by its development, detailing the techniques introduced to make the simulation as fast as possible while maintaining a high accuracy. By mapping XMT processors (ThreadStorm with 128 hardware threads) to host computing cores, the simulation speed remains constant as the number of simulated processors increases, up to the number of available host cores. The simulator supports zero-overhead switching among different accuracy levels at run-time and includes a network model that takes into account contention. On a modern 48-core SMP host, our infrastructure simulates a large set of irregular applications 500 to 2000 times slower than real time when compared to a 128-processor XMT, while remaining within 10\% of accuracy. Emulation is only from 25 to 200 times slower than real time.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (US)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1088646
- Report Number(s):
- PNNL-SA-76835; 400470000
- Journal Information:
- IEEE Transactions on Parallel and Distributed Systems, 23(12):2266-2279, Journal Name: IEEE Transactions on Parallel and Distributed Systems, 23(12):2266-2279
- Country of Publication:
- United States
- Language:
- English
Similar Records
Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT
Implementing and Evaluating Multithreaded Triad Census Algorithms on the Cray XMT
Early Experience with Out-of-Core Applications on the Cray XMT
Conference
·
Wed Jul 27 00:00:00 EDT 2011
·
OSTI ID:1023734
Implementing and Evaluating Multithreaded Triad Census Algorithms on the Cray XMT
Conference
·
Fri May 29 00:00:00 EDT 2009
·
OSTI ID:973732
Early Experience with Out-of-Core Applications on the Cray XMT
Conference
·
Fri Apr 18 00:00:00 EDT 2008
·
OSTI ID:949128