Hands-on Performance Tuning of 3D Finite Difference Earthquake Simulation on GPU Fermi Chipset
- Univ. of California, San Diego, CA (United States)
3D simulation of earthquake ground motion is one of the most challenging computational problems in science. The emergence of graphic processing units (GPU) as an effective alternative to traditional general purpose processors has become increasingly capable in terms of accelerating scientific computing research. In this paper, we describe our experiences in porting AWP-ODC, a 3D finite difference seismic wave propagation code, to the latest GPU Fermi chipset. We completely rewrote this Fortran-based 13-point asymmetric stencil computation code in C and MPI-CUDA in order to take advantage of the powerful GPU computing capabilities. Our new CUDA code implemented the asymmetric 3D stencil on Fermi to make the best use of GPU on-chip memory for an aggressive parallel efficiency. Benchmark on NVIDIA Tesla M2090 demonstrated 10x speedup versus the original fully optimized AWP-ODC FORTRAN MPI code running on a single Intel Nehalem 2.4 GHz CPU socket (4 cores/CPU), and 15x speedup versus the same MPI code running on a single AMD Istanbul 2.6 GHz CPU socket (6 cores/CPU). Sustained single-GPU performance of 143.8 GFLOPS in single precision is benchmarked for the testing case of 128x128x960 mesh size.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- OSTI ID:
- 1567289
- Journal Information:
- Procedia Computer Science, Vol. 9, Issue C; ISSN 1877-0509
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Web of Science
Improving Performance and Energy Efficiency of Geophysics Applications on GPU Architectures
|
book | January 2019 |
Geometric validation of a computer simulator used in radiography education
|
journal | November 2020 |
Multi-GPU Implementation of a 3D Finite Difference Time Domain Earthquake Code on Heterogeneous Supercomputers
|
journal | January 2013 |
Towards GPU Acceleration of Phonon Computation with ShengBTE
|
conference | January 2020 |
Similar Records
Quantum Monte Carlo Endstation for Petascale Computing
Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems