Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Runtime optimization of an application executing on a parallel computer

Patent ·
OSTI ID:1164339
Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.
Research Organization:
International Business Machines Corporation, Armonk, NY (United States)
Sponsoring Organization:
USDOE
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Number(s):
8,898,678
Application Number:
13/663,545
OSTI ID:
1164339
Country of Publication:
United States
Language:
English

References (16)

An empirically derived framework for classifying parallel program performance tuning problems conference January 1998
Overview of the Blue Gene/L system architecture journal March 2005
Automated cluster-based web service performance tuning conference January 2004
STAR-MPI: self tuned adaptive routines for MPI collective operations conference January 2006
Optimization of MPI collective communication on BlueGene/L systems conference January 2005
Performance analysis of parallel programs via message-passing graph traversal conference January 2006
Automatic generation and tuning of MPI collective communication routines conference January 2005
The Blue Gene/L Supercomputer: A Hardware and Software Story journal May 2007
The Autopilot performance-directed adaptive control system journal September 2001
Broadcasting on Meshes with Wormhole Routing journal June 1996
A Study of Process Arrival Patterns for MPI Collective Operations journal February 2008
Blue Gene/L torus interconnection network journal March 2005
MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
  • Faraj, Ahmad; Kumar, Sameer; Smith, Brian
  • 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects https://doi.org/10.1109/HOTI.2009.12
conference August 2009
Collective communication on architectures that support simultaneous communication over multiple links
  • Chan, Ernie; van de Geijn, Robert; Gropp, William
  • Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '06 https://doi.org/10.1145/1122971.1122975
conference January 2006
Web Information Systems Engineering – WISE 2005 book January 2005
Visual Programming for Message-Passing Systems journal August 1999