Generating Performance Models for Irregular Applications
Many applications have irregular behavior --- non-uniform input data, input-dependent solvers, irregular memory accesses, unbiased branches --- that cannot be captured using today's automated performance modeling techniques. We describe new hierarchical critical path analyses for the \Palm model generation tool. To create a model's structure, we capture tasks along representative MPI critical paths. We create a histogram of critical tasks with parameterized task arguments and instance counts. To model each task, we identify hot instruction-level sub-paths and model each sub-path based on data flow, instruction scheduling, and data locality. We describe application models that generate accurate predictions for strong scaling when varying CPU speed, cache speed, memory speed, and architecture. We present results for the Sweep3D neutron transport benchmark; Page Rank on multiple graphs; Support Vector Machine with pruning; and PFLOTRAN's reactive flow/transport solver with domain-induced load imbalance.
- Research Organization:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1373859
- Report Number(s):
- PNNL-SA-123945; KJ0404000
- Resource Relation:
- Conference: IEEE International Parallel and Distributed Processing Symposium (IPDPS 2017), May 29-June 2, 2017, Orlando, Florida, 317-326
- Country of Publication:
- United States
- Language:
- English
Similar Records
Center for Technology for Advanced Scientific Componet Software (TASCS)
Quantum Monte Carlo Endstation for Petascale Computing