Improved Accuracy for Automated Communication Pattern Characterization Using Communication Graphs and Aggressive Search Space Pruning
- ORNL
An understanding of a parallel application’s communication behavior is useful for a range of activities including debugging and optimization, job scheduling, target system selection, and system design. Because it can be challenging to understand communication behavior, especially for those who lack expertise or who are not familiar with the application, I and two colleagues recently developed an automated, search-based approach for recognizing and parameterizing application communication behavior using a library of common communication patterns. This initial approach was effective for characterizing the behavior of many workloads, but I identified some combinations of communication patterns for which the method was inefficient or would fail. In this paper, I discuss one such troublesome pattern combination and propose modifications to the recognition method to handle it. Specifically, I propose an alternative approach that uses communication graphs instead of traditional communication matrices to improve recognition accuracy for collective communication operations, and that uses a non-greedy recognition technique to avoid search space dead-ends that trap the original greedy recognition approach. My modified approach uses aggressive search space pruning and heuristics to control the potential for state explosion caused by its non-greedy pattern recognition method. I demonstrate the improved recognition accuracy and pruning efficacy of the modified approach using several synthetic and real-world communication pattern combinations.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1509589
- Resource Relation:
- Journal Volume: 11027; Conference: 6TH WORKSHOP ON EXTREME-SCALE PROGRAMMING TOOLS - Denver, Colorado, United States of America - 11/12/2017 5:00:00 AM-
- Country of Publication:
- United States
- Language:
- English
The Paradyn parallel performance measurement tool
|
journal | January 1995 |
Automatic performance analysis with periscope
|
journal | January 2009 |
Fast Parallel Algorithms for Short-Range Molecular Dynamics
|
journal | March 1995 |
Automated Characterization of Parallel Application Communication Patterns
|
conference | June 2015 |
Statistical scalability analysis of communication operations in distributed applications
|
journal | June 2001 |
Similar Records
Sixth International Conference on Systems Biology (ICSB 2005)
Genome‐enabled exploration of microbial ecology and evolution in the sea: a rising tide lifts all boats