skip to main content

DOE PAGESDOE PAGES

Title: MPI Runtime Error Detection with MUST: Advances in Deadlock Detection

The widely used Message Passing Interface (MPI) is complex and rich. As a result, application developers require automated tools to avoid and to detect MPI programming errors. We present the Marmot Umpire Scalable Tool (MUST) that detects such errors with significantly increased scalability. We present improvements to our graph-based deadlock detection approach for MPI, which cover future MPI extensions. Our enhancements also check complex MPI constructs that no previous graph-based detection approach handled correctly. Finally, we present optimizations for the processing of MPI operations that reduce runtime deadlock detection overheads. Existing approaches often require 𝒪( p ) analysis time per MPI operation, for p processes. We empirically observe that our improvements lead to sub-linear or better analysis time per operation for a wide range of real world applications.
Authors:
 [1] ;  [2] ;  [3] ;  [3] ;  [2]
  1. Technische Universität Dresden, Dresden, Germany
  2. Technische Universität Dresden, Dresden, Germany, RWTH Aachen University, Aachen, Germany, JARA – High Performance Computing, Aachen, Germany
  3. Lawrence Livermore National Laboratory, Livermore, CA, USA
Publication Date:
OSTI Identifier:
1197888
Grant/Contract Number:
AC52-07NA27344; 287703
Type:
Published Article
Journal Name:
Scientific Programming
Additional Journal Information:
Journal Volume: 21; Journal Issue: 3-4; Related Information: CHORUS Timestamp: 2016-08-23 03:55:07; Journal ID: ISSN 1058-9244
Publisher:
Hindawi Publishing Corporation
Sponsoring Org:
USDOE
Country of Publication:
Egypt
Language:
English