DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Runtime optimization of an application executing on a parallel computer

Abstract

Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.

Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1164339
Patent Number(s):
8898678
Application Number:
13/663,545
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B554331
Resource Type:
Patent
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2014. Web.
. Runtime optimization of an application executing on a parallel computer. United States.
. Tue . "Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1164339.
@article{osti_1164339,
title = {Runtime optimization of an application executing on a parallel computer},
author = {},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {11}
}

Works referenced in this record:

Optimization of MPI collective communication on BlueGene/L systems
conference, January 2005


Collective communication on architectures that support simultaneous communication over multiple links
conference, January 2006

  • Chan, Ernie; van de Geijn, Robert; Gropp, William
  • Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '06
  • https://doi.org/10.1145/1122971.1122975

Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006


Visual Programming for Message-Passing Systems
journal, August 1999


STAR-MPI: self tuned adaptive routines for MPI collective operations
conference, January 2006


The Autopilot performance-directed adaptive control system
journal, September 2001


Automated cluster-based web service performance tuning
conference, January 2004


An empirically derived framework for classifying parallel program performance tuning problems
conference, January 1998


Overview of the Blue Gene/L system architecture
journal, March 2005


Blue Gene/L torus interconnection network
journal, March 2005


Broadcasting on Meshes with Wormhole Routing
journal, June 1996


MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009

  • Faraj, Ahmad; Kumar, Sameer; Smith, Brian
  • 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
  • https://doi.org/10.1109/HOTI.2009.12

A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008


Automatic generation and tuning of MPI collective communication routines
conference, January 2005


The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007