Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
Extending the TOKENCMP Cache Coherence Protocol for Low Overhead Fault Tolerance
 

Summary: Extending the TOKENCMP Cache Coherence
Protocol for Low Overhead Fault Tolerance
in CMP Architectures
Ricardo Ferna´ndez-Pascual, Jose´ M. Garci´a, Member, IEEE,
Manuel E. Acacio, and Jose´ Duato, Member, IEEE
Abstract--It is widely accepted that transient failures will appear more frequently in chips designed in the near future due to several
factors such as the increased integration scale. On the other hand, chip multiprocessors (CMPs) that integrate several processor cores
in a single chip are nowadays the best alternative to more efficient use of the increasing number of transistors that can be placed in a
single die. Hence, it is necessary to design new techniques to deal with these faults to be able to build sufficiently reliable CMPs. In this
work, we present a coherence protocol aimed at dealing with transient failures that affect the interconnection network of a CMP, thus
assuming that the network is no longer reliable. In particular, our proposal extends a token-based cache coherence protocol so that no
data can be lost and no deadlock can occur due to any dropped message. Using the GEMS full-system simulator, we compare our
proposal against a similar protocol without fault tolerance (TOKENCMP). We show that in the absence of failures, our proposal does not
introduce overhead in terms of increased execution time over TOKENCMP. Additionally, our protocol can tolerate message loss rates
much higher than those likely to be found in the real world, without increasing the execution time by more than 15 percent.
Index Terms--Fault tolerance, cache coherence, CMP, transient failures, TOKENCMP.
Ç
1 INTRODUCTION
CHIP multiprocessors (CMPs) [3], [6] are currently
accepted as the best way to take advantage of the

  

Source: Acacio, Manuel - Departamento de Ingenieria y Tecnologia de Computadores, Universidad de Murcia
Fernández Pascual, Ricardo - Departamento de Ingenieria y Tecnologia de Computadores, Universidad de Murcia

 

Collections: Computer Technologies and Information Sciences