Dynamic wavefront creation for processing units using a hybrid compactor
Abstract
A method, a non-transitory computer readable medium, and a processor for repacking dynamic wavefronts during program code execution on a processing unit, each dynamic wavefront including multiple threads are presented. If a branch instruction is detected, a determination is made whether all wavefronts following a same control path in the program code have reached a compaction point, which is the branch instruction. If no branch instruction is detected in executing the program code, a determination is made whether all wavefronts following the same control path have reached a reconvergence point, which is a beginning of a program code segment to be executed by both a taken branch and a not taken branch from a previous branch instruction. The dynamic wavefronts are repacked with all threads that follow the same control path, if all wavefronts following the same control path have reached the branch instruction or the reconvergence point.
- Inventors:
- Issue Date:
- Research Org.:
- Advanced Micro Devices, Inc., Sunnyvale, CA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1422752
- Patent Number(s):
- 9898287
- Application Number:
- 14/682,971
- Assignee:
- Advanced Micro Devices, Inc. (Sunnyvale, CA)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- AC52-07NA27344
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2015 Apr 09
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Puthoor, Sooraj, Beckmann, Bradford M., and Yudanov, Dmitri. Dynamic wavefront creation for processing units using a hybrid compactor. United States: N. p., 2018.
Web.
Puthoor, Sooraj, Beckmann, Bradford M., & Yudanov, Dmitri. Dynamic wavefront creation for processing units using a hybrid compactor. United States.
Puthoor, Sooraj, Beckmann, Bradford M., and Yudanov, Dmitri. Tue .
"Dynamic wavefront creation for processing units using a hybrid compactor". United States. https://www.osti.gov/servlets/purl/1422752.
@article{osti_1422752,
title = {Dynamic wavefront creation for processing units using a hybrid compactor},
author = {Puthoor, Sooraj and Beckmann, Bradford M. and Yudanov, Dmitri},
abstractNote = {A method, a non-transitory computer readable medium, and a processor for repacking dynamic wavefronts during program code execution on a processing unit, each dynamic wavefront including multiple threads are presented. If a branch instruction is detected, a determination is made whether all wavefronts following a same control path in the program code have reached a compaction point, which is the branch instruction. If no branch instruction is detected in executing the program code, a determination is made whether all wavefronts following the same control path have reached a reconvergence point, which is a beginning of a program code segment to be executed by both a taken branch and a not taken branch from a previous branch instruction. The dynamic wavefronts are repacked with all threads that follow the same control path, if all wavefronts following the same control path have reached the branch instruction or the reconvergence point.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2018},
month = {2}
}
Works referenced in this record:
Creating and Managing Links to Deduplication Information
patent-application, January 2011
- Spackman, Stephen Philip
- US Patent Application 12/506400; 20110022825
Dynamic warp subdivision for integrated branch and memory latency divergence tolerance
patent-application, September 2011
- Skadron, Kevin; Meng, Jiayuan; Tarjan, David
- US Patent Application 13/040045; 20110219221
Techniques for Handling Divergent Threads in a Multi-Threaded Processing System
patent-application, March 2013
- Chen, Lin; Garcia Garcia, David Rigel; Gruber, Andrew E.
- US Patent Application 13/227274; 20130061027
System And Method For Synchronizing Threads In A Divergent Region Of Code
patent-application, March 2014
- Jones, Stephen
- US Patent Application 13/608912; 20140075160
Compiler-Controlled Region Scheduling for SIMD Execution of Threads
patent-application, June 2014
- Diamos, Gregory; Mehrara, Mojtaba
- US Patent Application 13/710279; 20140165049
Microarchitectural Design Space Exploration Using an Architecture-Centric Approach
conference, December 2007
- Dubach, Christophe; Jones, Timothy; O'Boyle, Michael
- 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2007)
Dynamic warp formation: Efficient MIMD control flow on SIMD graphics hardware
journal, June 2009
- Fung, Wilson W. L.; Sham, Ivan; Yuan, George
- ACM Transactions on Architecture and Code Optimization, Vol. 6, Issue 2, p. 1-37
Thread block compaction for efficient SIMT control flow
conference, February 2011
- Fung, Wilson W. L.; Aamodt, Tor M.
- 2011 IEEE 17th International Symposium on High Performance Computer Architecture
SIMD divergence optimization through intra-warp compaction
journal, July 2013
- Vaidya, Aniruddha S.; Shayesteh, Anahita; Woo, Dong Hyuk
- ACM SIGARCH Computer Architecture News, Vol. 41, Issue 3, p. 368-379