skip to main content

DOE PAGESDOE PAGES

Title: Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms

The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma microturbulence. This paper presents novel analysis and optimization techniques to enhance the performance of GTC on large-scale machines. We introduce cell access analysis to better manage locality vs. synchronization tradeoffs on CPU and GPU-based architectures. Finally, our optimized hybrid parallel implementation of GTC uses MPI, OpenMP, and NVIDIA CUDA, achieves up to a 2× speedup over the reference Fortran version on multiple parallel systems, and scales efficiently to tens of thousands of cores.
Authors:
 [1] ;  [2] ;  [1] ;  [3] ;  [4] ;  [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Pennsylvania State Univ., University Park, PA (United States)
  3. Princeton Univ., NJ (United States). Princeton Inst. of Computational Science and Engineering
  4. Princeton Plasma Physics Lab. (PPPL), Princeton, NJ (United States)
Publication Date:
Grant/Contract Number:
AC02-05CH11231; AC02-09CH11466; AC02-06CH11357; AC05-00OR22725
Type:
Accepted Manuscript
Journal Name:
International Journal of High Performance Computing Applications
Additional Journal Information:
Journal Volume: 27; Journal Issue: 4; Journal ID: ISSN 1094-3420
Publisher:
SAGE
Research Org:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Princeton Plasma Physics Lab. (PPPL), Princeton, NJ (United States)
Sponsoring Org:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING
OSTI Identifier:
1407202

Ibrahim, Khaled Z., Madduri, Kamesh, Williams, Samuel, Wang, Bei, Ethier, Stephane, and Oliker, Leonid. Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms. United States: N. p., Web. doi:10.1177/1094342013492446.
Ibrahim, Khaled Z., Madduri, Kamesh, Williams, Samuel, Wang, Bei, Ethier, Stephane, & Oliker, Leonid. Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms. United States. doi:10.1177/1094342013492446.
Ibrahim, Khaled Z., Madduri, Kamesh, Williams, Samuel, Wang, Bei, Ethier, Stephane, and Oliker, Leonid. 2013. "Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms". United States. doi:10.1177/1094342013492446. https://www.osti.gov/servlets/purl/1407202.
@article{osti_1407202,
title = {Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms},
author = {Ibrahim, Khaled Z. and Madduri, Kamesh and Williams, Samuel and Wang, Bei and Ethier, Stephane and Oliker, Leonid},
abstractNote = {The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma microturbulence. This paper presents novel analysis and optimization techniques to enhance the performance of GTC on large-scale machines. We introduce cell access analysis to better manage locality vs. synchronization tradeoffs on CPU and GPU-based architectures. Finally, our optimized hybrid parallel implementation of GTC uses MPI, OpenMP, and NVIDIA CUDA, achieves up to a 2× speedup over the reference Fortran version on multiple parallel systems, and scales efficiently to tens of thousands of cores.},
doi = {10.1177/1094342013492446},
journal = {International Journal of High Performance Computing Applications},
number = 4,
volume = 27,
place = {United States},
year = {2013},
month = {7}
}