Cache performance in vector supercomputers
Book
·
OSTI ID:87629
- Univ. of Rochester, NY (United States). Computer Science Dept.
- Cray Research Inc., Chippewa Falls, WI (United States)
- Univ. of Wisconsin, Madison, WI (United States). Dept. of Electrical and Computer Engineering
Traditional supercomputers use a flat multi-bank SRAM memory organization to supply high bandwidth at low latency. Most other computers use a hierarchical organization with a small SRAM cache and slower, cheaper DRAM for main memory. Such systems rely heavily on data locality for achieving optimum performance. This paper evaluates cache-based memory systems for vector supercomputers. The authors develop a simulation model for a cache-based version of the Cray Research C90 and use the NAS parallel benchmarks to provide a large scale workload. They show that while catches reduce memory traffic and improve the performance of plain DRAM memory, they still lag behind cacheless SRAM. They identify the performance bottle-necks in DRAM-based memory systems and quantify their contribution to program performance degradation. They find the data fetch strategy to be a significant parameter affecting performance, evaluate the performance of several fetch policies, and show that small fetch sizes improve performance by maximizing the use of available memory bandwidth.
- OSTI ID:
- 87629
- Report Number(s):
- CONF-941118--; ISBN 0-8186-6605-6
- Country of Publication:
- United States
- Language:
- English
Similar Records
Data Movement Dominates: Final Report
The Wisconsin Multicube: A new large-scale cache-coherent multiprocessor
ON THE ACCELERATION OF SHORTEST PATH CALCULATIONS IN TRANSPORTATION NETWORKS
Technical Report
·
Wed Jul 01 00:00:00 EDT 2015
·
OSTI ID:1168664
The Wisconsin Multicube: A new large-scale cache-coherent multiprocessor
Book
·
Thu Dec 31 23:00:00 EST 1987
·
OSTI ID:6933059
ON THE ACCELERATION OF SHORTEST PATH CALCULATIONS IN TRANSPORTATION NETWORKS
Conference
·
Sun Jan 07 23:00:00 EST 2007
·
OSTI ID:1000496