skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Parallel Performance Optimizations on Unstructured Mesh-based Simulations

Conference · · Procedia Computer Science

© The Authors. Published by Elsevier B.V. This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra- and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter- and intranode data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cache efficiency, as well as communication reduction approaches. We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2×. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1407282
Journal Information:
Procedia Computer Science, Vol. 51, Issue C; Conference: International Conference On Computational Science (ICCS 2015), Reykjavík (Iceland) , 1-3 Jun 2015; ISSN 1877-0509
Publisher:
Elsevier
Country of Publication:
United States
Language:
English

References (11)

Optimal Cache-Oblivious Mesh Layouts journal October 2009
The Combinatorial BLAS: design, implementation, and applications journal May 2011
Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication journal July 1999
A new metric enabling an exact hypergraph model for the communication volume in distributed-memory parallel applications journal August 2013
Analysis of the clustering properties of the Hilbert space-filling curve journal January 2001
A multi-resolution approach to global ocean modeling journal September 2013
Parallel static and dynamic multi-constraint graph partitioning journal January 2002
Revisiting Hypergraph Models for Sparse Matrix Partitioning journal January 2007
Simple and Efficient Mesh Layout with Space-Filling Curves journal January 2012
Architecture Aware Partitioning Algorithms book January 2008
A new metric for dynamic load balancing journal December 2000

Cited By (3)

A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake journal January 2016
Progress in Fast, Accurate Multi-scale Climate Simulations journal January 2015
A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake text January 2016

Similar Records

Parallel performance optimizations on unstructured mesh-based simulations
Journal Article · Mon Jun 01 00:00:00 EDT 2015 · Procedia Computer Science · OSTI ID:1407282

Improving Unstructured Mesh Partitions for Multiple Criteria Using Mesh Adjacencies
Journal Article · Tue Feb 13 00:00:00 EST 2018 · SIAM Journal on Scientific Computing · OSTI ID:1407282

Robust and efficient overset grid assembly for partitioned unstructured meshes
Journal Article · Sat Mar 01 00:00:00 EST 2014 · Journal of Computational Physics · OSTI ID:1407282

Related Subjects