System, method, and computer program product for bulk synchronous binary program translation and optimization
Abstract
A system, method, and computer program product are provided for. The method includes the steps of executing a block of translated binary instructions by multiple threads and gathering profiling data during execution of the block of translated binary instructions. The multiple threads are then synchronized at a barrier instruction associated with the block of translated binary instructions and the block of translated binary instructions is replaced with optimized binary instructions, where the optimized binary instructions are produced based on the profiling data.
- Inventors:
- Issue Date:
- Research Org.:
- NVIDIA Corp., Santa Clara, CA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1532140
- Patent Number(s):
- 9207919
- Application Number:
- 14/158,749
- Assignee:
- NVIDIA Corporation (Santa Clara, CA)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B599861; HR0011-13-3-0001
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2014-01-17
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Diamos, Gregory Frederick. System, method, and computer program product for bulk synchronous binary program translation and optimization. United States: N. p., 2015.
Web.
Diamos, Gregory Frederick. System, method, and computer program product for bulk synchronous binary program translation and optimization. United States.
Diamos, Gregory Frederick. Tue .
"System, method, and computer program product for bulk synchronous binary program translation and optimization". United States. https://www.osti.gov/servlets/purl/1532140.
@article{osti_1532140,
title = {System, method, and computer program product for bulk synchronous binary program translation and optimization},
author = {Diamos, Gregory Frederick},
abstractNote = {A system, method, and computer program product are provided for. The method includes the steps of executing a block of translated binary instructions by multiple threads and gathering profiling data during execution of the block of translated binary instructions. The multiple threads are then synchronized at a barrier instruction associated with the block of translated binary instructions and the block of translated binary instructions is replaced with optimized binary instructions, where the optimized binary instructions are produced based on the profiling data.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Dec 08 00:00:00 EST 2015},
month = {Tue Dec 08 00:00:00 EST 2015}
}
Save to My Library
You must Sign In or Create an Account in order to save documents to your library.
Works referenced in this record:
Using a transactional execution mechanism to free up processor resources used by a busy-waiting thread
patent, July 2008
- Caprioli, Paul; Mesard, Wayne
- US Patent Document 7,395,418
Monitoring System Noises in Parallel Computer Systems
patent-application, January 2015
- Yim, Keun Soo; Seelam, Seetharami R.; Fong, Liana L.
- US Patent Application 13/945559; 20150026687
Initiation of Cache Flushes and Invalidations on Graphics Processors
patent-application, December 2014
- Gupta, Niraj; Jiang, Hong
- US Patent Application 13/926328; 20140379993
Method and system for transparent dynamic optimization in a multiprocessing environment
patent-application, March 2004
- Nair, Ravi; O'Brien, John Kevin; O'Brien, Kathryn Mary
- US Patent Application 10/244564; 20040054992
Relationships Derived from Trace Data
patent-application, August 2013
- Li, Ying; Gounares, Alexander G.; Garrett, Charles D.
- US Patent Application 13/853816; 20130219057
System and Method for Optimizing a Code Section by Forcing a Code Section to be Executed Atomically
patent-application, October 2012
- Moir, Mark S.; Dice, David; Tirthapura, Srikanta N.
- US Patent Application 13/077793; 20120254846
Method and system for converting a single-threaded software program into an application-specific supercomputer
patent-application, May 2013
- Ebcioglu, Kemal; Kultursay, Emre; Kandemir, Mahmut Taylan
- US Patent Application 13/296232; 20130125097
