Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms
Journal Article
·
· International Journal of High Performance Computing Applications
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Pennsylvania State Univ., University Park, PA (United States)
- Princeton Univ., NJ (United States). Princeton Inst. of Computational Science and Engineering
- Princeton Plasma Physics Lab. (PPPL), Princeton, NJ (United States)
The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma microturbulence. This paper presents novel analysis and optimization techniques to enhance the performance of GTC on large-scale machines. We introduce cell access analysis to better manage locality vs. synchronization tradeoffs on CPU and GPU-based architectures. Finally, our optimized hybrid parallel implementation of GTC uses MPI, OpenMP, and NVIDIA CUDA, achieves up to a 2× speedup over the reference Fortran version on multiple parallel systems, and scales efficiently to tens of thousands of cores.
- Research Organization:
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- AC02-05CH11231; AC02-09CH11466; AC02-06CH11357; AC05-00OR22725
- OSTI ID:
- 1407202
- Journal Information:
- International Journal of High Performance Computing Applications, Vol. 27, Issue 4; ISSN 1094-3420
- Publisher:
- SAGECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Cited by: 9 works
Citation information provided by
Web of Science
Web of Science
Similar Records
Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems
Gyrokinetic particle-in-cell optimization on emerging multi- and manycore platforms
Quantum Monte Carlo Endstation for Petascale Computing
Conference
·
Sat Jan 01 00:00:00 EST 2011
·
OSTI ID:1407202
+4 more
Gyrokinetic particle-in-cell optimization on emerging multi- and manycore platforms
Journal Article
·
Wed Mar 02 00:00:00 EST 2011
· Parallel Computing
·
OSTI ID:1407202
+3 more
Quantum Monte Carlo Endstation for Petascale Computing
Technical Report
·
Wed Mar 02 00:00:00 EST 2011
·
OSTI ID:1407202