Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems

None, None

doi:10.1007/978-3-319-17248-4_5

Title: Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems

Journal Article · · Lecture Notes in Computer Science

DOI:https://doi.org/10.1007/978-3-319-17248-4_5· OSTI ID:1567379

In this paper we present research on applying a domain specific high-level abstractions (HLA) development strategy with the aim to “future-proof” a key class of high performance computing (HPC) applications that simulate hydrodynamics computations at AWE plc. We build on an existing high-level abstraction framework, OPS, that is being developed for the solution of multi-block structured mesh-based applications at the University of Oxford. OPS uses an “active library” approach where a single application code written using the OPS API can be transformed into different highly optimized parallel implementations which can then be linked against the appropriate parallel library enabling execution on different back-end hardware platforms. The target application in this work is the CloverLeaf mini-app from Sandia National Laboratory’s Mantevo suite of codes that consists of algorithms of interest from hydrodynamics workloads. Specifically, we present (1) the lessons learnt in re-engineering an industrial representative hydro-dynamics application to utilize the OPS high-level framework and subsequent code generation to obtain a range of parallel implementations, and (2) the performance of the auto-generated OPS versions of CloverLeaf compared to that of the performance of the hand-coded original CloverLeaf implementations on a range of platforms. Benchmarked systems include Intel multi-core CPUs and NVIDIA GPUs, the Archer (Cray XC30) CPU cluster and the Titan (Cray XK7) GPU cluster with different parallelizations (OpenMP, OpenACC, CUDA, OpenCL and MPI). Our results show that the development of parallel HPC applications using a high-level framework such as OPS is no more time consuming nor difficult than writing a one-off parallel program targeting only a single parallel implementation. However the OPS strategy pays off with a highly maintainable single application source, through which multiple parallelizations can be realized, without compromising performance portability on a range of parallel systems.

Cite

Export

Save

Research Organization:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)

Sponsoring Organization:: USDOE Office of Science (SC)

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1567379

Journal Information:: Lecture Notes in Computer Science, Vol. 8966

Country of Publication:: United States

Language:: English

References (11)

SBLOCK: A Framework for Efficient Stencil-Based PDE Solvers on Multi-core Platforms Brandvik, Tobias; Pullan, Graham 2010 IEEE 10th International Conference on Computer and Information Technology (CIT), 2010 10th IEEE International Conference on Computer and Information Technology https://doi.org/10.1109/CIT.2010.214	conference	June 2010
Generative Programming and Active Libraries Czarnecki, Krzysztof; Eisenecker, Ulrich; Glück, Robert Generic Programming, p. 25-39 https://doi.org/10.1007/3-540-39953-4_3	book	September 2001
Liszt: a domain specific language for building portable mesh-based PDE solvers DeVito, Zachary; Duraisamy, Karthik; Darve, Eric Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '11 https://doi.org/10.1145/2063384.2063396	conference	January 2011
A set of level 3 basic linear algebra subprograms Dongarra, J. J.; Du Croz, Jeremy; Hammarling, Sven ACM Transactions on Mathematical Software, Vol. 16, Issue 1 https://doi.org/10.1145/77626.79170	journal	March 1990
Deriving Efficient Data Movement from Decoupled Access/Execute Specifications Howes, Lee W.; Lokhmotov, Anton; Donaldson, Alastair F. High Performance Embedded Architectures and Compilers: Fourth International Conference, HiPEAC 2009, Paphos, Cyprus, January 25-28, 2009. Proceedings, p. 168-182 https://doi.org/10.1007/978-3-540-92990-1_14	book	January 2009
Beyond Traditional Microprocessors for Geoscience High-Performance Computing Applications Lindtjorn, Olav; Clapp, Robert; Pell, Oliver IEEE Micro, Vol. 31, Issue 2 https://doi.org/10.1109/MM.2011.17	journal	March 2011
Finite element assembly strategies on multi-core and many-core architectures: FINITE ELEMENT ASSEMBLY ON MULTI-CORE AND MANY-CORE ARCHITECTURES Markall, G. R.; Slemmer, A.; Ham, D. A. International Journal for Numerical Methods in Fluids, Vol. 71, Issue 1 https://doi.org/10.1002/fld.3648	journal	January 2012
Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems Mudalige, G. R.; Giles, M. B.; Thiyagalingam, J. Parallel Computing, Vol. 39, Issue 11 https://doi.org/10.1016/j.parco.2013.09.004	journal	November 2013
Paraiso: an automated tuning framework for explicit solvers of partial differential equations Muranushi, Takayuki Computational Science & Discovery, Vol. 5, Issue 1 https://doi.org/10.1088/1749-4699/5/1/015003	journal	January 2012
Ypnos: declarative, parallel structured grid programming Orchard, Dominic A.; Bolingbroke, Max; Mycroft, Alan Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming - DAMP '10 https://doi.org/10.1145/1708046.1708053	conference	January 2010
The pochoir stencil compiler Tang, Yuan; Chowdhury, Rezaul Alam; Kuszmaul, Bradley C. Proceedings of the 23rd ACM symposium on Parallelism in algorithms and architectures - SPAA '11 https://doi.org/10.1145/1989493.1989508	conference	January 2011

Similar Records

Improving resilience of scientific software through a domain-specific approach

Journal Article · Sat Jun 01 00:00:00 EDT 2019 · Journal of Parallel and Distributed Computing · OSTI ID:1567379

Reguly, I. Z.; Mudalige, G. R.; Giles, M. B.; +1 more

Experiences in porting mini-applications to OpenACC and OpenMP on heterogeneous systems

Journal Article · Fri Apr 24 00:00:00 EDT 2020 · Concurrency and Computation. Practice and Experience · OSTI ID:1567379

Vergara Larrea, Verónica G.; Budiardja, Reuben D.; Gayatri, Rahulkumar; +3 more

Parallelization and checkpointing of GPU applications through program transformation

Thesis/Dissertation · Sun Jan 01 00:00:00 EST 2012 · OSTI ID:1567379

Solano-Quinde, Lizandro Damian

Related Subjects

Computer Science

Title: Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems

Citation Formats

References (11)

Similar Records

Related Subjects