Performance Analysis of Distributed Applications using Automatic Classification of Communication Inefficiencies
We present a technique for performance analysis that helps users understand the communication behavior of their message passing applications. Our method automatically classifies individual communication operations and it reveals the cause of communication inefficiencies in the application. This classification allows the developer to focus quickly on the culprits of truly inefficient behavior, rather than manually foraging through massive amounts of performance data. Specifically, we trace the message operations of MPI applications and then classify each individual communication event using decision tree classification, a supervised learning technique. We train our decision tree using microbenchmarks that demonstrate both efficient and inefficient communication. Since our technique adapts to the target system's configuration through these microbenchmarks, we can simultaneously automate the performance analysis process and improve classification accuracy. Our experiments on four applications demonstrate that our technique can improve the accuracy of performance analysis, and dramatically reduce the amount of data that users must encounter.
- Research Organization:
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
- Sponsoring Organization:
- USDOE Office of Defense Programs (DP) (US)
- DOE Contract Number:
- W-7405-Eng-48
- OSTI ID:
- 793599
- Report Number(s):
- UCRL-JC-136200; TRN: US200222%%334
- Resource Relation:
- Conference: International Conference on Measurement and Modeling of Computer Systems, Santa Clara, CA (US), 06/18/2000--06/21/2000; Other Information: PBD: 1 Nov 1999
- Country of Publication:
- United States
- Language:
- English
Similar Records
Compiled MPI: Cost-Effective Exascale Applications Development
A uGNI-based Asynchronous Message-driven Runtime System for Cray Supercomputers with Gemini Interconnect