skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Improved Accuracy for Automated Communication Pattern Characterization Using Communication Graphs and Aggressive Search Space Pruning

Conference ·

An understanding of a parallel application’s communication behavior is useful for a range of activities including debugging and optimization, job scheduling, target system selection, and system design. Because it can be challenging to understand communication behavior, especially for those who lack expertise or who are not familiar with the application, I and two colleagues recently developed an automated, search-based approach for recognizing and parameterizing application communication behavior using a library of common communication patterns. This initial approach was effective for characterizing the behavior of many workloads, but I identified some combinations of communication patterns for which the method was inefficient or would fail. In this paper, I discuss one such troublesome pattern combination and propose modifications to the recognition method to handle it. Specifically, I propose an alternative approach that uses communication graphs instead of traditional communication matrices to improve recognition accuracy for collective communication operations, and that uses a non-greedy recognition technique to avoid search space dead-ends that trap the original greedy recognition approach. My modified approach uses aggressive search space pruning and heuristics to control the potential for state explosion caused by its non-greedy pattern recognition method. I demonstrate the improved recognition accuracy and pruning efficacy of the modified approach using several synthetic and real-world communication pattern combinations.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1509589
Resource Relation:
Journal Volume: 11027; Conference: 6TH WORKSHOP ON EXTREME-SCALE PROGRAMMING TOOLS - Denver, Colorado, United States of America - 11/12/2017 5:00:00 AM-
Country of Publication:
United States
Language:
English

References (5)

The Paradyn parallel performance measurement tool journal January 1995
Automatic performance analysis with periscope journal January 2009
Fast Parallel Algorithms for Short-Range Molecular Dynamics journal March 1995
Automated Characterization of Parallel Application Communication Patterns
  • Roth, Philip C.; Meredith, Jeremy S.; Vetter, Jeffrey S.
  • Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing https://doi.org/10.1145/2749246.2749278
conference June 2015
Statistical scalability analysis of communication operations in distributed applications journal June 2001

Similar Records

Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges
Conference · Mon Apr 01 00:00:00 EDT 2019 · OSTI ID:1509589

Sixth International Conference on Systems Biology (ICSB 2005)
Technical Report · Sat Oct 22 00:00:00 EDT 2005 · OSTI ID:1509589

Genome‐enabled exploration of microbial ecology and evolution in the sea: a rising tide lifts all boats
Journal Article · Tue Feb 02 00:00:00 EST 2021 · Environmental Microbiology · OSTI ID:1509589

Related Subjects