Experiences in extending parallware to support OpenACC. In: WACCPD '15 Proceedings of the Second Workshop on Accelerator Programming using Directives, Article No. 4

Lobeiras, Jacobo; Arenaz, Manuel; Hernández, Oscar

doi:10.1145/2832105.2832112

Title: Experiences in extending parallware to support OpenACC. In: WACCPD '15 Proceedings of the Second Workshop on Accelerator Programming using Directives, Article No. 4

Conference · Thu Jan 01 00:00:00 EST 2015

DOI:https://doi.org/10.1145/2832105.2832112· OSTI ID:1567642

Lobeiras, Jacobo ^[1]; Arenaz, Manuel ^[2]; Hernández, Oscar ^[3]

Appentra Solutions, A Coruna (Spain)
Appentra Solutions, A Coruna (Spain); Univ. of A Coruna (Spain)
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Porting scientific codes to accelerator-based computers using OpenACC and OpenMP is an important topic for the HPC community. Programmability, performance portability and developer productivity are key issues for the widespread use of these systems. In the scope of general-purpose parallel computing, Parallware is a new commercial OpenMP-enabling source-to-source compiler that automatically adds OpenMP capabilities in scientific programs. Thus, extending Parallware with OpenACC or OpenMP 4.x support would contribute to improve programmability and developer productivity. In contrast, the performance portability of such approach needs to be demonstrated in practice. This paper presents a preliminary study to extend Parallware with OpenACC support for GPU devices. A simple benchmark suite has been designed to mimic important features and computational patterns of real scientific applications. Handcoded OpenACC versions are compared to OpenMP versions automatically generated by Parallware. Performance is evaluated with the PGI OpenACC compiler on systems accelerated with NVIDIA GPUs.

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)

Sponsoring Organization:: USDOE Office of Science (SC)

OSTI ID:: 1567642

Resource Relation:: Conference: Second Workshop on Accelerator Programming using Directives Austin, Texas — November 15 - 15, 2015

Country of Publication:: United States

Language:: English

References (15)

A novel compiler support for automatic parallelization on multicore systems Andión, José M.; Arenaz, Manuel; Rodríguez, Gabriel Parallel Computing, Vol. 39, Issue 9 https://doi.org/10.1016/j.parco.2013.04.003	journal	September 2013
A GSA-based compiler infrastructure to extract parallelism from complex loops Arenaz, Manuel; Touriño, Juan; Doallo, Ramón Proceedings of the 17th annual international conference on Supercomputing - ICS '03 https://doi.org/10.1145/782814.782842	conference	January 2003
The NAS parallel benchmarks---summary and preliminary results Bailey, D. H.; Schreiber, R. S.; Simon, H. D. Proceedings of the 1991 ACM/IEEE conference on Supercomputing - Supercomputing '91 https://doi.org/10.1145/125826.125925	conference	January 1991
Parallel programming with Polaris Blume, W.; Doallo, R.; Eigenmann, R. Computer, Vol. 29, Issue 12 https://doi.org/10.1109/2.546612	journal	January 1996
Anatomy of high-performance matrix multiplication Goto, Kazushige; Geijn, Robert A. van de ACM Transactions on Mathematical Software, Vol. 34, Issue 3 https://doi.org/10.1145/1356052.1356053	journal	May 2008
Auto-tuning a high-level language targeted to GPU codes Grauer-Gray, Scott; Xu, Lifan; Searles, Robert 2012 Innovative Parallel Computing (InPar) https://doi.org/10.1109/InPar.2012.6339595	conference	May 2012
Optimizing Scientific Workflows in the Cloud: A Montage Example Jiang, Qingye; Lee, Young Choon; Arenaz, Manuel 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing (UCC) https://doi.org/10.1109/UCC.2014.77	conference	December 2014
The ParaWise Expert Assistant – Widening Accessibility to Efficient and Scalable Tool Generated OpenMP Code Johnson, Stephen; Evans, Emyr; Jin, Haoqiang Lecture Notes in Computer Science https://doi.org/10.1007/978-3-540-31832-3_7	book	January 2005
A unified framework for optimizing locality, parallelism, and communication in out-of-core computations Kandemir, M.; Choudhary, A.; Ramanujam, J. IEEE Transactions on Parallel and Distributed Systems, Vol. 11, Issue 7 https://doi.org/10.1109/71.877759	journal	July 2000
SUIF Explorer: an interactive and interprocedural parallelizer Liao, Shih-Wei; Diwan, Amer; Bosch, Robert P. Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '99 https://doi.org/10.1145/301104.301108	conference	January 1999
Exploiting locality in the run-time parallelization of irregular loops Martin, M. J.; Singh, D. E.; Tourino, J. Proceedings International Conference on Parallel Processing https://doi.org/10.1109/ICPP.2002.1040856	conference	January 2002
A compiler optimization algorithm for shared-memory multiprocessors McKinley, K. S. IEEE Transactions on Parallel and Distributed Systems, Vol. 9, Issue 8 https://doi.org/10.1109/71.706049	journal	January 1998
Optimal weighted loop fusion for parallel programs Megiddo, Nimrod; Sarkar, Vivek Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures - SPAA '97 https://doi.org/10.1145/258492.258520	conference	January 1997
Combining loop transformations considering caches and scheduling Wolf, M. E.; Maydan, D. E. Proceedings of the 29th Annual IEEE/ACM International Symposium on Microarchitecture. MICRO 29 https://doi.org/10.1109/MICRO.1996.566468	conference	January 1996
Adaptive reduction parallelization techniques Yu, Hao; Rauchwerger, Lawrence Proceedings of the 14th international conference on Supercomputing - ICS '00 https://doi.org/10.1145/335231.335238	conference	January 2000

Similar Records

Clacc: Translating OpenACC to OpenMP in Clang

Conference · Thu Nov 01 00:00:00 EDT 2018 · OSTI ID:1567642

Denny, Joel; Lee, Seyong; Vetter, Jeffrey S.

OpenACC to FPGA: A Framework for Directive-based High-Performance Reconfigurable Computing

Conference · Sun May 01 00:00:00 EDT 2016 · OSTI ID:1567642

Lee, Seyong; Kim, Jungwon; Vetter, Jeffrey S.

KokkACC: Enhancing Kokkos with OpenACC

Conference · Tue Nov 01 00:00:00 EDT 2022 · OSTI ID:1567642

Valero Lara, Pedro; Lee, Seyong; Gonzalez Tallada, Marc; +2 more

Title: Experiences in extending parallware to support OpenACC. In: WACCPD '15 Proceedings of the Second Workshop on Accelerator Programming using Directives, Article No. 4

Citation Formats

References (15)

Similar Records

Related Subjects