Efficient Parallelization of Irregular Applications on GPU Architectures
Thesis/Dissertation
·
OSTI ID:2349242
- College of William and Mary, Williamsburg, VA (United States)
With the enlarging computation capacity of general Graphics Processing Units (GPUs), leveraging GPUs to accelerate parallel applications has become a critical topic in academia and industry. However, a wide range of irregular applications with the computation-/memory-intensive nature cannot easily achieve high GPU utilization. The challenges mainly involve the following aspects: first, data dependence leads to coarse-grained kernel and inefficient parallelism; second, heavy GPU memory usage may cause frequent memory evictions and extra overhead of I/O; third, specific computation patterns produce memory redundancies; last, workload balance and data reusability conjunctly benefit the overall performance, but there may exist a dynamic trade-off between them. Targeting these challenges, this dissertation proposes multiple optimizations to accelerate two real-world applications: many-body correlation functions to simulate nuclear physics in a large-scale scientific system; the other is the eALS-based matrix factorization recommendation system. To accelerate the calculations of many-body correlation functions, this dissertation presents three frameworks in GPU memory management and multi-GPU scheduling. Firstly, an optimized systematic GPU memory management framework, MemHC, utilizes a series of new memory reduction designs in GPU memory allocation, CPU/GPU communications, and GPU memory oversubscription. Secondly, an enhanced multi-GPU scheduling framework, MICCO, particularly by taking both data dimension (e.g., data reuse and data eviction) and computation dimension into account. MICCO designs a heuristic scheduling algorithm and a machine learning-based regression model to generate the optimal settings of a proposed new concept to manage the trade-off. Thirdly, a locality-aware multi-GPU scheduling framework. This scheduler leverages pipeline batch generation with a looking-ahead strategy by building local dependency graphs for memory transfer reduction and better data reuse, achieving up to 79.92% memory cost reduction and 1.67x speedup. To parallelize the eALS-based recommendation system, this dissertation proposes an efficient CPU/GPU heterogeneous recommendation system, HEALS. HEALS employs newly designed architecture-adaptive data formats to achieve load balance and good data locality on CPU and GPU. To mitigate the data dependence, HEALS presents a CPU/GPU collaboration model for both task parallelism and data parallelism with multiple kernel computation optimizations. In summary, this dissertation efficiently accelerates two typical irregular applications on GPUs by building four frameworks, including CPU/GPU collaboration, GPU memory management, and multi-GPU scheduling.
- Research Organization:
- Thomas Jefferson National Accelerator Facility, Newport News, VA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Nuclear Physics (NP)
- DOE Contract Number:
- AC05-06OR23177
- OSTI ID:
- 2349242
- Report Number(s):
- JLAB-CST-24-4038; DOE/OR/23177-7472
- Country of Publication:
- United States
- Language:
- English
Similar Records
MemHC: An Optimized GPU Memory Management Framework for Accelerating Many-body Correlation
MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-Body Correlation Functions
Efficient Machine Learning Approach for Optimizing Scientific Computing Applications on Emerging HPC Architectures
Journal Article
·
Wed Mar 23 20:00:00 EDT 2022
· ACM Transactions on Architecture and Code Optimization
·
OSTI ID:1867362
MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-Body Correlation Functions
Conference
·
Sun May 01 00:00:00 EDT 2022
· 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
·
OSTI ID:1886910
Efficient Machine Learning Approach for Optimizing Scientific Computing Applications on Emerging HPC Architectures
Thesis/Dissertation
·
Mon May 01 00:00:00 EDT 2017
·
OSTI ID:1422715