OpenSHMEM Implementation of HPCG Benchmark
- ORNL
We describe the effort to implement the HPCG benchmark using OpenSHMEM and MPI one-sided communication. Unlike the High Performance LINPACK (HPL) benchmark that places em- phasis on large dense matrix computations, the HPCG benchmark is dominated by sparse operations such as sparse matrix-vector product, sparse matrix triangular solve, and long vector operations. The MPI one-sided implementation is developed using the one-sided OpenSHMEM implementation. Pre- liminary results comparing the original MPI, OpenSHMEM, and MPI one-sided implementations on an SGI cluster, Cray XK7 and Cray XC30 are presented. The results suggest the MPI, OpenSHMEM, and MPI one-sided implementations all obtain similar overall performance but the MPI one-sided im- plementation seems to slightly increase the run time for multigrid preconditioning in HPCG on the Cray XK7 and Cray XC30.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
- Sponsoring Organization:
- Work for Others (WFO)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1346653
- Resource Relation:
- Conference: OpenSHMEM 2016: Third workshop on OpenSHMEM and Related Technologies., Baltimore, MD, USA, 20160802, 20160804
- Country of Publication:
- United States
- Language:
- English
Similar Records
OpenSHMEM Implementation of IOR Benchmark
Graph 500 in OpenSHMEM, In: OpenSHMEM 2014: OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies