Methodology for fast detection of false sharing in threaded scientific codes
Abstract
A profiling tool identifies a code region with a false sharing potential. A static analysis tool classifies variables and arrays in the identified code region. A mapping detection library correlates memory access instructions in the identified code region with variables and arrays in the identified code region while a processor is running the identified code region. The mapping detection library identifies one or more instructions at risk, in the identified code region, which are subject to an analysis by a false sharing detection library. A false sharing detection library performs a run-time analysis of the one or more instructions at risk while the processor is re-running the identified code region. The false sharing detection library determines, based on the performed run-time analysis, whether two different portions of the cache memory line are accessed by the generated binary code.
- Inventors:
- Issue Date:
- Research Org.:
- International Business Machines Corp., Armonk, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1164340
- Patent Number(s):
- 8898648
- Application Number:
- 13/689,927
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Chung, I-Hsin, Cong, Guojing, Murata, Hiroki, Negishi, Yasushi, and Wen, Hui-Fang. Methodology for fast detection of false sharing in threaded scientific codes. United States: N. p., 2014.
Web.
Chung, I-Hsin, Cong, Guojing, Murata, Hiroki, Negishi, Yasushi, & Wen, Hui-Fang. Methodology for fast detection of false sharing in threaded scientific codes. United States.
Chung, I-Hsin, Cong, Guojing, Murata, Hiroki, Negishi, Yasushi, and Wen, Hui-Fang. Tue .
"Methodology for fast detection of false sharing in threaded scientific codes". United States. https://www.osti.gov/servlets/purl/1164340.
@article{osti_1164340,
title = {Methodology for fast detection of false sharing in threaded scientific codes},
author = {Chung, I-Hsin and Cong, Guojing and Murata, Hiroki and Negishi, Yasushi and Wen, Hui-Fang},
abstractNote = {A profiling tool identifies a code region with a false sharing potential. A static analysis tool classifies variables and arrays in the identified code region. A mapping detection library correlates memory access instructions in the identified code region with variables and arrays in the identified code region while a processor is running the identified code region. The mapping detection library identifies one or more instructions at risk, in the identified code region, which are subject to an analysis by a false sharing detection library. A false sharing detection library performs a run-time analysis of the one or more instructions at risk while the processor is re-running the identified code region. The false sharing detection library determines, based on the performed run-time analysis, whether two different portions of the cache memory line are accessed by the generated binary code.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {11}
}
Works referenced in this record:
Validation checking of shared memory accesses
patent, June 1998
- Scales, Daniel J.
- US Patent Document 5,761,729
Method and system for reducing the computing overhead associated with thread local objects
patent, June 2004
- Azagury, Alain; Kolodner, Elliott Karl; Factor, Michael E.
- US Patent Document 6,757,891
Method and apparatus for autonomically moving cache entries to dedicated storage when false cache line sharing is detected
patent, September 2006
- DeWitt, Jr., Jimmie Earl; Levine, Frank Eliot; Richardson, Christopher Michael
- US Patent Document 7,114,036
Memory access assignment for parallel processing architectures
patent, May 2012
- Lee, Walter; Gottlieb, Robert A.; Soni, Vineet
- US Patent Document 8,181,168
Compiling code for parallel processing architectures based on control flow
patent, August 2012
- Lee, Walter; Gottlieb, Robert A.; Soni, Vineet
- US Patent Document 8,250,555
Distributing parallelism for parallel processing architectures
patent, August 2012
- Lee, Walter; Gottlieb, Robert A.; Soni, Vineet
- US Patent Document 8,250,556
Communication scheduling for parallel processing architectures
patent, October 2012
- Lee, Walter; Gottlieb, Robert A.; Soni, Vineet
- US Patent Document 8,291,400
Efficient generation of SIMD code in presence of multi-threading and other false sharing conditions and in machines having memory protection support
patent-application, September 2007
- Eichenberger, Alexandre E.; Wang, Kasi-Ting Amy; Wu, Peng
- US Patent Application 11/358372; 20070226723
System for and Method of Capturing Performance Characteristics Data From A Computer System and Modeling Target System Performance
patent-application, October 2008
- Berg, Erik; Hagersten, Erik; Zeffer, Hakan
- US Patent Application 12/055996; 20080244533
Efficient Object Profiling for Optimizing Object Locality
patent-application, June 2009
- Maier, Daryl James; Groevski, Nikola; Siegwart, David Kevin
- US Patent Application 11/950962; 20090150875
Method and system for optimizing code for a multi-threaded application
patent-application, August 2011
- Dice, David; Marathe, Virendra J.; Moir, Mark S.
- US Patent Application 12/708014; 20110202907
Multi-Petascale Highly Efficient Parallel Supercomputer
patent-application, September 2011
- Asaad, Sameh; Bellofatto, Ralph E.; Blocksome, Michael A.
- US Patent Document 13/004007; 20110219208
Detecting and Optimizing False Sharing
patent-application, November 2011
- Citron, Daniel; Klausner, Moshe; Kupershtok, Aharon
- US Patent Application 12/780904; 20110283152
Architecture-Aware Field Affinity Estimation
patent-application, December 2011
- Dayan, Alon; Edelson, David Joel; Golovanevsky, Olga
- US Patent Application 12/795769; 20110302561
System and Method for Analyzing Dynamic Performance of Complex Applications
patent-application, November 2012
- Jalan, Rohit; Kejariwal, Arun
- US Patent Application 13/097446; 20120278793
SHERIFF: precise detection and automatic mitigation of false sharing
journal, October 2011
- Liu, Tongping; Berger, Emery D.
- ACM SIGPLAN Notices, Vol. 46, Issue 10
Anywhere, any-time binary instrumentation
conference, January 2011
- Bernat, Andrew R.; Miller, Barton P.
- Proceedings of the 10th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools - PASTE '11
Gprof: A call graph execution profiler
journal, June 1982
- Graham, Susan L.; Kessler, Peter B.; Mckusick, Marshall K.
- ACM SIGPLAN Notices, Vol. 17, Issue 6