A High-Performance Design for Hierarchical Parallelism in the QMCPACK Monte Carlo code
- Argonne National Laboratory
- ORNL
We introduce a new high-performance design for parallelism within the Quantum Monte Carlo code QMCPACK. We demonstrate that the new design is better able to exploit the hierarchical parallelism of heterogeneous architectures compared to the previous GPU implementation. The new version is able to achieve higher GPU occupancy via the new concept of crowds of Monte Carlo walkers, and by enabling more host CPU threads to effectively offload to the GPU. The higher performance is expected to be achieved independent of the underlying hardware, significantly improving developer productivity and reducing code maintenance costs. Scientific productivity is also improved with full support for fallback to CPU execution when GPU implementations are not available or CPU execution is more optimal.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE; USDOE Office of Science (SC)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1963152
- Country of Publication:
- United States
- Language:
- English
Similar Records
Quantum Monte Carlo Endstation for Petascale Computing
The Metropolis Monte Carlo method with CUDA enabled Graphic Processing Units
Development of MGMC: A proxy Multi-Group Monte Carlo Particle Transport Application
Technical Report
·
Tue Mar 01 23:00:00 EST 2011
·
OSTI ID:1007216
The Metropolis Monte Carlo method with CUDA enabled Graphic Processing Units
Journal Article
·
Fri Jan 31 23:00:00 EST 2014
· Journal of Computational Physics
·
OSTI ID:22230865
Development of MGMC: A proxy Multi-Group Monte Carlo Particle Transport Application
Technical Report
·
Wed Jan 29 23:00:00 EST 2020
·
OSTI ID:1599007