skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Methods for multitasking among real-time embedded compute tasks running on the GPU: Methods for Multitasking Real-time Embedded GPU Computing Tasks

Journal Article · · Concurrency and Computation. Practice and Experience
DOI:https://doi.org/10.1002/cpe.4118· OSTI ID:1528898
 [1];  [2]
  1. California State Univ., Sacramento, CA (United States)
  2. Univ. of California, Davis, CA (United States)

Here, we provide an extensive survey on wide spectrum of scheduling methods for multitasking among graphics processing unit (GPU) computing tasks. We then design several schedulers and explain in detail the selected methods we have developed to implement our scheduling strategies. Next, we compare the performance of schedulers on various workloads running on Fermi and Kepler architectures and arrive at the following major conclusions: (1) Small kernels benefit from running kernels concurrently. (2) The combination of small kernels, high-priority kernels with longer runtimes, and lower-priority kernels with shorter runtimes benefits from a CPU scheduler that dynamically changes kernel order on the Fermi architecture. (3) Because of limitations of existing GPU architectures, currently CPU schedulers outperform their GPU counterparts. We also provide results and observations obtained from implementing and evaluating our schedulers on the NVIDIA Jetson TX1 system-on-chip architecture. We observe that although TX1 has the newer Maxwell architecture, the mechanism used for scheduler timings behaves differently on TX1 compared to Kepler leading to incorrect timings. In this paper, we describe our methods that allow us to report correct timings for CPU schedulers running on TX1. Lastly, we propose new research directions involving the investigation of additional scheduling strategies.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1528898
Journal Information:
Concurrency and Computation. Practice and Experience, Vol. 29, Issue 15; ISSN 1532-0626
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 1 work
Citation information provided by
Web of Science

References (20)

Softshell: dynamic scheduling on GPUs journal November 2012
The synchronous languages 12 years later journal January 2003
The ESTEREL language journal January 1991
OptiX: a general purpose ray tracing engine journal July 2010
GRAMPS: A programming model for graphics pipelines journal January 2009
The synchronous data flow programming language LUSTRE journal January 1991
Programming real-time applications with SIGNAL journal January 1991
Out-of-core Data Management for Path Tracing on Hybrid Resources journal April 2009
Real-Time Speed-Limit-Sign Recognition on an Embedded System Using a GPU book January 2011
Multitasking Real-time Embedded GPU Computing Tasks
  • Muyan-Özçelik, Pιnar; Owens, John D.
  • Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores - PMAM'16 https://doi.org/10.1145/2883404.2883408
conference January 2016
Fragment-Parallel Composite and Filter journal June 2010
Cooperative Multitasking for GPU-Accelerated Grid Systems conference May 2010
Efficiently Using a CUDA-enabled GPU as Shared Resource
  • Peters, Hagen; Köper, Martin; Luttenberger, Norbert
  • 2010 IEEE 10th International Conference on Computer and Information Technology (CIT), 2010 10th IEEE International Conference on Computer and Information Technology https://doi.org/10.1109/CIT.2010.204
conference June 2010
Understanding the efficiency of ray traversal on GPUs conference January 2009
Message passing on data-parallel architectures conference May 2009
GPU-to-CPU Callbacks book January 2011
Portable and transparent software managed scheduling on accelerators for fair resource sharing conference January 2016
PTask: operating system abstractions to manage GPUs as compute devices conference January 2011
Simultaneous Multikernel GPU: Multi-tasking throughput processors via fine-grained sharing conference March 2016
Analyzing CUDA workloads using a detailed GPU simulator conference April 2009

Similar Records

ODDS: Real-Time Object Detection Using Depth Sensors on Embedded GPUs
Conference · Wed Apr 11 00:00:00 EDT 2018 · 2018 17th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) · OSTI ID:1528898

A performance model for GPUs with caches
Journal Article · Tue Jun 24 00:00:00 EDT 2014 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1528898

Computational Particle Dynamic Simulations on Multicore Processors (CPDMu) Final Report Phase I
Technical Report · Sun Jul 24 00:00:00 EDT 2011 · OSTI ID:1528898