skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Technique for grouping instructions into independent strands

Patent ·
OSTI ID:1531988

A device compiler and linker is configured to group instructions into different strands for execution by different threads based on the dependence of those instructions on other, long-latency instructions. A thread may execute a strand that includes long-latency instructions, and then hardware resources previously allocated for the execution of that thread may be de-allocated from the thread and re-allocated to another thread. The other thread may then execute another strand while the long-latency instructions are in flight. With this approach, the other thread is not required to wait for the long-latency instructions to complete before acquiring hardware resources and initiating execution of the other strand, thereby eliminating at least a portion of the time that the other thread would otherwise spend waiting.

Research Organization:
NVIDIA Corp., Santa Clara, CA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
B599861; HR0011-13-3-0001
Assignee:
NVIDIA Corporation (Santa Clara, CA)
Patent Number(s):
9,645,802
Application Number:
13/961,097
OSTI ID:
1531988
Resource Relation:
Patent File Date: 2013-08-07
Country of Publication:
United States
Language:
English

References (10)

Apparatus and method for speculatively executing instructions in a computer system patent May 1995
Multithreaded data processing method with long latency subinstructions patent April 2001
Local stall control method and structure in a microprocessor patent August 2001
Generation of compiler description from architecture description patent March 2014
Scheduling of instructions patent April 2014
Value speculation on an assist processor to facilitate prefetching for a primary processor patent-application December 2001
Supporting out-of-order issue in an execute-ahead processor patent-application August 2007
Diagnostic apparatus and method patent-application June 2008
Credit-Based Streaming Multiprocessor Warp Scheduling patent-application March 2011
Opcode Counting for Performance Measurement patent-application July 2011

Cited By (2)


Similar Records

System, method, and computer program product for bulk synchronous binary program translation and optimization
Patent · Tue Dec 08 00:00:00 EST 2015 · OSTI ID:1531988

Two fundamental issues in multiprocessing. Technical report
Technical Report · Thu Oct 01 00:00:00 EDT 1987 · OSTI ID:1531988

Single-pass parallel prefix scan with dynamic look back
Patent · Tue Mar 27 00:00:00 EDT 2018 · OSTI ID:1531988

Related Subjects