Development and Validation of a Hierarchical Memory Model Incorporating CPU- and Memory-Operation Overlap
Conference
·
OSTI ID:621718
Distributed shared memory architectures (DSM`s) such as the Origin 2000 are being implemented which extend the concept of single-processor cache hierarchies across an entire physically-distributed multiprocessor machine. The scalability of a DSM machine is inherently tied to memory hierarchy performance, including such issues as latency hiding techniques in the architecture, global cache-coherence protocols, memory consistency models and, of course, the inherent locality of reference in algorithms of interest. In this paper, we characterize application performance with a {open_quotes}memory-centric{close_quotes} view. Using a simple mean value analysis (MVA) strategy and empirical performance data, we infer the contribution of each level in the memory system to the application`s overall cycles per instruction (cpi). We account for the overlap of processor execution with memory accesses - a key parameter which is not directly measurable on the Origin systems. We infer the separate contributions of three major architecture features in the memory subsystem of the Origin 2000: cache size, outstanding loads-under-miss, and memory latency.
- Research Organization:
- Los Alamos National Lab., NM (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- W-7405-ENG-36
- OSTI ID:
- 621718
- Report Number(s):
- LA-UR--97-3462; CONF-980214--; ON: DE98000349
- Country of Publication:
- United States
- Language:
- English
Similar Records
An empirical hierarchical memory model based on hardware performance counters
Performance characterization and validation of ASCI applications: A memory centric view
A mean value analysis multiprocessor model incorporating superscalar processors and latency tolerating techniques
Conference
·
Tue Sep 01 00:00:00 EDT 1998
·
OSTI ID:674716
Performance characterization and validation of ASCI applications: A memory centric view
Conference
·
Wed Oct 01 00:00:00 EDT 1997
·
OSTI ID:532536
A mean value analysis multiprocessor model incorporating superscalar processors and latency tolerating techniques
Journal Article
·
Sat Jun 01 00:00:00 EDT 1996
· International Journal of Parallel Programming
·
OSTI ID:273923