DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Methodology for fast detection of false sharing in threaded scientific codes

Abstract

A profiling tool identifies a code region with a false sharing potential. A static analysis tool classifies variables and arrays in the identified code region. A mapping detection library correlates memory access instructions in the identified code region with variables and arrays in the identified code region while a processor is running the identified code region. The mapping detection library identifies one or more instructions at risk, in the identified code region, which are subject to an analysis by a false sharing detection library. A false sharing detection library performs a run-time analysis of the one or more instructions at risk while the processor is re-running the identified code region. The false sharing detection library determines, based on the performed run-time analysis, whether two different portions of the cache memory line are accessed by the generated binary code.

Inventors:
; ; ; ;
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1164340
Patent Number(s):
8898648
Application Number:
13/689,927
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B554331
Resource Type:
Patent
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Chung, I-Hsin, Cong, Guojing, Murata, Hiroki, Negishi, Yasushi, and Wen, Hui-Fang. Methodology for fast detection of false sharing in threaded scientific codes. United States: N. p., 2014. Web.
Chung, I-Hsin, Cong, Guojing, Murata, Hiroki, Negishi, Yasushi, & Wen, Hui-Fang. Methodology for fast detection of false sharing in threaded scientific codes. United States.
Chung, I-Hsin, Cong, Guojing, Murata, Hiroki, Negishi, Yasushi, and Wen, Hui-Fang. Tue . "Methodology for fast detection of false sharing in threaded scientific codes". United States. https://www.osti.gov/servlets/purl/1164340.
@article{osti_1164340,
title = {Methodology for fast detection of false sharing in threaded scientific codes},
author = {Chung, I-Hsin and Cong, Guojing and Murata, Hiroki and Negishi, Yasushi and Wen, Hui-Fang},
abstractNote = {A profiling tool identifies a code region with a false sharing potential. A static analysis tool classifies variables and arrays in the identified code region. A mapping detection library correlates memory access instructions in the identified code region with variables and arrays in the identified code region while a processor is running the identified code region. The mapping detection library identifies one or more instructions at risk, in the identified code region, which are subject to an analysis by a false sharing detection library. A false sharing detection library performs a run-time analysis of the one or more instructions at risk while the processor is re-running the identified code region. The false sharing detection library determines, based on the performed run-time analysis, whether two different portions of the cache memory line are accessed by the generated binary code.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {11}
}

Works referenced in this record:

Method and system for reducing the computing overhead associated with thread local objects
patent, June 2004


Method and apparatus for autonomically moving cache entries to dedicated storage when false cache line sharing is detected
patent, September 2006


Memory access assignment for parallel processing architectures
patent, May 2012


Compiling code for parallel processing architectures based on control flow
patent, August 2012


Distributing parallelism for parallel processing architectures
patent, August 2012


Communication scheduling for parallel processing architectures
patent, October 2012


Efficient Object Profiling for Optimizing Object Locality
patent-application, June 2009


Method and system for optimizing code for a multi-threaded application
patent-application, August 2011


Multi-Petascale Highly Efficient Parallel Supercomputer
patent-application, September 2011


Detecting and Optimizing False Sharing
patent-application, November 2011


Architecture-Aware Field Affinity Estimation
patent-application, December 2011


System and Method for Analyzing Dynamic Performance of Complex Applications
patent-application, November 2012


SHERIFF: precise detection and automatic mitigation of false sharing
journal, October 2011


Anywhere, any-time binary instrumentation
conference, January 2011


Gprof: A call graph execution profiler
journal, June 1982