skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems

Journal Article · · Lecture Notes in Computer Science

In this paper we present research on applying a domain specific high-level abstractions (HLA) development strategy with the aim to “future-proof” a key class of high performance computing (HPC) applications that simulate hydrodynamics computations at AWE plc. We build on an existing high-level abstraction framework, OPS, that is being developed for the solution of multi-block structured mesh-based applications at the University of Oxford. OPS uses an “active library” approach where a single application code written using the OPS API can be transformed into different highly optimized parallel implementations which can then be linked against the appropriate parallel library enabling execution on different back-end hardware platforms. The target application in this work is the CloverLeaf mini-app from Sandia National Laboratory’s Mantevo suite of codes that consists of algorithms of interest from hydrodynamics workloads. Specifically, we present (1) the lessons learnt in re-engineering an industrial representative hydro-dynamics application to utilize the OPS high-level framework and subsequent code generation to obtain a range of parallel implementations, and (2) the performance of the auto-generated OPS versions of CloverLeaf compared to that of the performance of the hand-coded original CloverLeaf implementations on a range of platforms. Benchmarked systems include Intel multi-core CPUs and NVIDIA GPUs, the Archer (Cray XC30) CPU cluster and the Titan (Cray XK7) GPU cluster with different parallelizations (OpenMP, OpenACC, CUDA, OpenCL and MPI). Our results show that the development of parallel HPC applications using a high-level framework such as OPS is no more time consuming nor difficult than writing a one-off parallel program targeting only a single parallel implementation. However the OPS strategy pays off with a highly maintainable single application source, through which multiple parallelizations can be realized, without compromising performance portability on a range of parallel systems.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1567379
Journal Information:
Lecture Notes in Computer Science, Vol. 8966
Country of Publication:
United States
Language:
English

References (11)

SBLOCK: A Framework for Efficient Stencil-Based PDE Solvers on Multi-core Platforms
  • Brandvik, Tobias; Pullan, Graham
  • 2010 IEEE 10th International Conference on Computer and Information Technology (CIT), 2010 10th IEEE International Conference on Computer and Information Technology https://doi.org/10.1109/CIT.2010.214
conference June 2010
Generative Programming and Active Libraries book September 2001
Liszt: a domain specific language for building portable mesh-based PDE solvers
  • DeVito, Zachary; Duraisamy, Karthik; Darve, Eric
  • Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '11 https://doi.org/10.1145/2063384.2063396
conference January 2011
A set of level 3 basic linear algebra subprograms journal March 1990
Deriving Efficient Data Movement from Decoupled Access/Execute Specifications
  • Howes, Lee W.; Lokhmotov, Anton; Donaldson, Alastair F.
  • High Performance Embedded Architectures and Compilers: Fourth International Conference, HiPEAC 2009, Paphos, Cyprus, January 25-28, 2009. Proceedings, p. 168-182 https://doi.org/10.1007/978-3-540-92990-1_14
book January 2009
Beyond Traditional Microprocessors for Geoscience High-Performance Computing Applications journal March 2011
Finite element assembly strategies on multi-core and many-core architectures: FINITE ELEMENT ASSEMBLY ON MULTI-CORE AND MANY-CORE ARCHITECTURES journal January 2012
Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems journal November 2013
Paraiso: an automated tuning framework for explicit solvers of partial differential equations journal January 2012
Ypnos: declarative, parallel structured grid programming
  • Orchard, Dominic A.; Bolingbroke, Max; Mycroft, Alan
  • Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming - DAMP '10 https://doi.org/10.1145/1708046.1708053
conference January 2010
The pochoir stencil compiler conference January 2011

Similar Records

Improving resilience of scientific software through a domain-specific approach
Journal Article · Sat Jun 01 00:00:00 EDT 2019 · Journal of Parallel and Distributed Computing · OSTI ID:1567379

Experiences in porting mini-applications to OpenACC and OpenMP on heterogeneous systems
Journal Article · Fri Apr 24 00:00:00 EDT 2020 · Concurrency and Computation. Practice and Experience · OSTI ID:1567379

Parallelization and checkpointing of GPU applications through program transformation
Thesis/Dissertation · Sun Jan 01 00:00:00 EST 2012 · OSTI ID:1567379

Related Subjects