DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Runtime optimization of an application executing on a parallel computer

Abstract

Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.

Inventors:
;
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1082649
Patent Number(s):
8365186
Application Number:
12/760,111
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B554331
Resource Type:
Patent
Resource Relation:
Patent File Date: 2010 Apr 14
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Faraj, Daniel A., and Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2013. Web.
Faraj, Daniel A., & Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States.
Faraj, Daniel A., and Smith, Brian E. Tue . "Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1082649.
@article{osti_1082649,
title = {Runtime optimization of an application executing on a parallel computer},
author = {Faraj, Daniel A. and Smith, Brian E.},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2013},
month = {1}
}

Works referenced in this record:

MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009

  • Faraj, Ahmad; Kumar, Sameer; Smith, Brian
  • 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
  • https://doi.org/10.1109/HOTI.2009.12

Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006


Visual Programming for Message-Passing Systems
journal, August 1999


The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007


Automatic generation and tuning of MPI collective communication routines
conference, January 2005


A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008