Hardware accelerated dynamic work creation on a graphics processing unit
Abstract
A processor core is configured to execute a parent task that is described by a data structure stored in a memory. A coprocessor is configured to dispatch a child task to the at least one processor core in response to the coprocessor receiving a request from the parent task concurrently with the parent task executing on the at least one processor core. In some cases, the parent task registers the child task in a task pool and the child task is a future task that is configured to monitor a completion object and enqueue another task associated with the future task in response to detecting the completion object. The future task is configured to self-enqueue by adding a continuation future task to a continuation queue for subsequent execution in response to the future task failing to detect the completion object.
- Inventors:
- Issue Date:
- Research Org.:
- Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States); Advanced Micro Devices, Inc., Santa Clara, CA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1987029
- Patent Number(s):
- 11550627
- Application Number:
- 17/215,171
- Assignee:
- Advanced Micro Devices, Inc. (Santa Clara, CA)
- DOE Contract Number:
- AC52-07NA27344; B620717
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 03/29/2021
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Gutierrez, Anthony T., and Puthoor, Sooraj. Hardware accelerated dynamic work creation on a graphics processing unit. United States: N. p., 2023.
Web.
Gutierrez, Anthony T., & Puthoor, Sooraj. Hardware accelerated dynamic work creation on a graphics processing unit. United States.
Gutierrez, Anthony T., and Puthoor, Sooraj. Tue .
"Hardware accelerated dynamic work creation on a graphics processing unit". United States. https://www.osti.gov/servlets/purl/1987029.
@article{osti_1987029,
title = {Hardware accelerated dynamic work creation on a graphics processing unit},
author = {Gutierrez, Anthony T. and Puthoor, Sooraj},
abstractNote = {A processor core is configured to execute a parent task that is described by a data structure stored in a memory. A coprocessor is configured to dispatch a child task to the at least one processor core in response to the coprocessor receiving a request from the parent task concurrently with the parent task executing on the at least one processor core. In some cases, the parent task registers the child task in a task pool and the child task is a future task that is configured to monitor a completion object and enqueue another task associated with the future task in response to detecting the completion object. The future task is configured to self-enqueue by adding a continuation future task to a continuation queue for subsequent execution in response to the future task failing to detect the completion object.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2023},
month = {1}
}
Works referenced in this record:
Adaptive Scheduling for Task Assignment among Heterogenous Processor Cores
patent-application, February 2016
- Barik, Rajkishore; Shpeisman, Tatiana; Lewis, Brain T.
- US Patent Application 14/583247; 20160055612
Control flow in a heterogeneous computer system
patent, November 2017
- Boudier, Pierre
- US Patent Document 9,830,163
Using Asymmetric Single-ISA CMPs to Save Energy on Operating Systems
journal, May 2008
- Mogul, Jeffrey C.; Mudigonda, Jayaram; Binkert, Nathan
- IEEE Micro, Vol. 28, Issue 3
Multi-Core Communications Acceleration Using Hardware Queue Device
patent-application, July 2017
- Wang, Ren; Wang, Yipeng; Herdrich, Andrew J.
- US Patent Application 14/987676; 20170192921
Facilitating Dynamic Parallel Scheduling of Command Packets at Graphics Processing Units on Computing Devices
patent-application, August 2017
- Mrozek, Michael Andrzej
- US Patent Application 15/503201; 20170236246
Method and System of a Hierchical Task Scheduler for a Multi-thread System
patent-application, August 2016
- Kipp, Timothy James
- US Patent Application 15/011127; 20160239346
Robust, Efficient Multiprocessor-coprocessor Interface
patent-application, February 2020
- Babich, Ronald; Burgess, John; Choquette, Jack
- US Patent Application 16/101247; 20200050451
System and Method For Swarm Intelligence Using Dynamically Configurable Proactive Autonomous Agents Cells
patent-application, October 2018
- Iniguez, Alfonso
- US Patent Application 15/852480; 20180293096
Heterogeneous enqueuing and dequeuing mechanism for task scheduling
patent, August 2016
- Sander, Benjamin T.; Houston, Michael; Cheung, Newton
- US Patent Document 9,430,281
Heterogenous Enqueuing and Dequeuing Mechanism for Task Scheduling
patent-application, December 2016
- Sander, Benjamin Thomas; Houston, Michael; Cheung, Newton
- US Patent Application 15/250385; 20160371116