Improving Memory Subsystem Performance Using ViVA: Virtual Vector Architecture

Gebis, Joseph; Oliker, Leonid; Shalf, John; Williams, Samuel; Yelick, Katherine

Improving Memory Subsystem Performance Using ViVA: Virtual Vector Architecture

Conference · Sun Jan 11 23:00:00 EST 2009

OSTI ID:963537

Gebis, Joseph; Oliker, Leonid; Shalf, John; Williams, Samuel; Yelick, Katherine

The disparity between microprocessor clock frequencies and memory latency is a primary reason why many demanding applications run well below peak achievable performance. Software controlled scratchpad memories, such as the Cell local store, attempt to ameliorate this discrepancy by enabling precise control over memory movement; however, scratchpad technology confronts the programmer and compiler with an unfamiliar and difficult programming model. In this work, we present the Virtual Vector Architecture (ViVA), which combines the memory semantics of vector computers with a software-controlled scratchpad memory in order to provide a more effective and practical approach to latency hiding. ViVA requires minimal changes to the core design and could thus be easily integrated with conventional processor cores. To validate our approach, we implemented ViVA on the Mambo cycle-accurate full system simulator, which was carefully calibrated to match the performance on our underlying PowerPC Apple G5 architecture. Results show that ViVA is able to deliver significant performance benefits over scalar techniques for a variety of memory access patterns as well as two important memory-bound compact kernels, corner turn and sparse matrix-vector multiplication -- achieving 2x-13x improvement compared the scalar version. Overall, our preliminary ViVA exploration points to a promising approach for improving application performance on leading microprocessors with minimal design and complexity costs, in a power efficient manner.

Research Organization:: Ernest Orlando Lawrence Berkeley National Laboratory, Berkeley, CA (US)

Sponsoring Organization:: Computational Research Division

DOE Contract Number:: AC02-05CH11231

OSTI ID:: 963537

Report Number(s):: LBNL-2139E

Country of Publication:: United States

Language:: English

Similar Records

Dense and Sparse Matrix Operations on the Cell Processor

Conference · Sun May 01 00:00:00 EDT 2005 · OSTI ID:891029

Collective Memory Transfers for Multi-Core Chips

Technical Report · Tue Nov 12 23:00:00 EST 2013 · OSTI ID:1164908

Loop parallelism on Tera MTA using SISAL

Conference · Tue Oct 31 23:00:00 EST 1995 · OSTI ID:224967

Related Subjects

97 MATHEMATICS AND COMPUTING
ARCHITECTURE
COMPUTERS
DESIGN
EXPLORATION
KERNELS
MICROPROCESSORS
PERFORMANCE
PROGRAMMING
SCALARS
VECTORS

Improving Memory Subsystem Performance Using ViVA: Virtual Vector Architecture

Citation Formats

Similar Records

Related Subjects