skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Light-weight cache coherence for data processors with limited data sharing

Patent ·
OSTI ID:1469175

A data processing system includes a plurality of processors, local memories associated with a corresponding processor, and at least one inter-processor link. In response to a first processor performing a load or store operation on an address of a corresponding local memory that is not currently in the local cache, a local cache allocates a first cache line and encodes a local state with the first cache line. In response to a load operation from an address of a remote memory that is not currently in the local cache, the local cache allocates a second cache line and encodes a remote state with the second cache line. The first processor performs subsequent loads and stores on the first cache line in the local cache in response to the local state, and subsequent loads from the second cache line in the local cache in response to the remote state.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC52-07NA27344
Assignee:
Advanced Micro Devices, Inc. (Santa Clara, CA)
Patent Number(s):
10,042,762
Application Number:
15/264,804
OSTI ID:
1469175
Resource Relation:
Patent File Date: 2016 Sep 14
Country of Publication:
United States
Language:
English

References (18)

Active Memory Cube: A processing-in-memory architecture for exascale systems journal March 2015
QuickRelease: A throughput-oriented approach to release consistency on GPUs conference February 2014
Memory management for a symmetric multiprocessor computer system patent-application November 2004
Method and apparatus for reducing overhead in a data processing system with a cache patent-application April 2004
Assisted Coherent Shared Memory patent-application July 2015
Method, Apparatus, and Computer Program Product for a Cache Coherency Protocol State that Predicts Locations of Shared Memory Blocks patent-application September 2008
Hybrid Input/Output Write Operations patent-application April 2015
Dynamic self-invalidation: reducing coherence overhead in shared-memory multiprocessors conference January 1995
Method and apparatus for injecting write data into a cache patent December 2006
Two-stage request protocol for accessing remote memory data in a NUMA data processing system patent September 2003
Methods and apparatus for providing cache state information patent-application November 2004
Parallel processor system including a cache memory subsystem that has independently addressable local and remote data areas patent July 1998
The directory-based cache coherence protocol for the DASH multiprocessor journal May 1990
Caching mechanism for remote read-only data in a cache coherent non-uniform memory access (CCNUMA) architecture patent July 2004
Cache coherence for GPU architectures conference February 2013
Vertical Integration of Stacked DRAM and High-Speed Logic Device Using SMAFTI Technology journal August 2009
Method and apparatus for a global cache directory in a storage cluster patent-application February 2003
Hybrid Cache Coherence Using Fine-Grained Hardware Message Passing patent-application April 2009