High performance graphics processor based computed tomography reconstruction algorithms for nuclear and other large scale applications.
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
The goal of this work is to develop a fast computed tomography (CT) reconstruction algorithm based on graphics processing units (GPU) that achieves significant improvement over traditional central processing unit (CPU) based implementations. The main challenge in developing a CT algorithm that is capable of handling very large datasets is parallelizing the algorithm in such a way that data transfer does not hinder performance of the reconstruction algorithm. General Purpose Graphics Processing (GPGPU) is a new technology that the Science and Technology (S&T) community is starting to adopt in many fields where CPU-based computing is the norm. GPGPU programming requires a new approach to algorithm development that utilizes massively multi-threaded environments. Multi-threaded algorithms in general are difficult to optimize since performance bottlenecks occur that are non-existent in single-threaded algorithms such as memory latencies. If an efficient GPU-based CT reconstruction algorithm can be developed; computational times could be improved by a factor of 20. Additionally, cost benefits will be realized as commodity graphics hardware could potentially replace expensive supercomputers and high-end workstations. This project will take advantage of the CUDA programming environment and attempt to parallelize the task in such a way that multiple slices of the reconstruction volume are computed simultaneously. This work will also take advantage of the GPU memory by utilizing asynchronous memory transfers, GPU texture memory, and (when possible) pinned host memory so that the memory transfer bottleneck inherent to GPGPU is amortized. Additionally, this work will take advantage of GPU-specific hardware (i.e. fast texture memory, pixel-pipelines, hardware interpolators, and varying memory hierarchy) that will allow for additional performance improvements.
- Research Organization:
- Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1121931
- Report Number(s):
- SAND2013--8059; 482802
- Country of Publication:
- United States
- Language:
- English
Similar Records
Development of a GPU-based high-performance radiative transfer model for the Infrared Atmospheric Sounding Interferometer (IASI)
Optimizing High Performance Markov Clustering for Pre-Exascale Architectures
RACB: Resource Aware Cache Bypass on GPUs
Journal Article
·
Sun Mar 20 00:00:00 EDT 2011
· Journal of Computational Physics
·
OSTI ID:21499779
Optimizing High Performance Markov Clustering for Pre-Exascale Architectures
Journal Article
·
Thu Apr 30 20:00:00 EDT 2020
· Proceedings - IEEE International Parallel and Distributed Processing Symposium (IPDPS)
·
OSTI ID:1650092
RACB: Resource Aware Cache Bypass on GPUs
Conference
·
Wed Oct 01 00:00:00 EDT 2014
· 2014 International Symposium on Computer Architecture and High Performance Computing Workshop; 22-24 Oct. 2014; Paris, France
·
OSTI ID:1567596