DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Runtime optimization of an application executing on a parallel computer

Abstract

Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.

Inventors:
;
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1082649
Patent Number(s):
8365186
Application Number:
12/760,111
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B554331
Resource Type:
Patent
Resource Relation:
Patent File Date: 2010 Apr 14
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Faraj, Daniel A., and Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2013. Web.
Faraj, Daniel A., & Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States.
Faraj, Daniel A., and Smith, Brian E. Tue . "Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1082649.
@article{osti_1082649,
title = {Runtime optimization of an application executing on a parallel computer},
author = {Faraj, Daniel A. and Smith, Brian E.},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Jan 29 00:00:00 EST 2013},
month = {Tue Jan 29 00:00:00 EST 2013}
}

Works referenced in this record:

Visualization tool for graphically displaying trace data
patent, January 1999


Parallel computer
patent, September 1998


Power consumption-based thread scheduling
patent-application, May 2006


Method for pacing data flow in a packet-based network
patent, September 2000


Collaborative host masquerading system
patent-application, May 2002


MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009

  • Faraj, Ahmad; Kumar, Sameer; Smith, Brian
  • 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
  • https://doi.org/10.1109/HOTI.2009.12

Distributed model compilation
patent, March 2009


Method of determining graph isomorphism in polynomial-time
patent-application, August 2007


Data processing system and method for pacing information transfers in a communications network
patent, June 1999


Management system and method for parallel computer system
patent, August 1999


Performance mechanism for presenting integrated information in a graphical user interface
patent-application, June 2004


Method and apparatus for source rate pacing in an ATM network
patent, September 1999


Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006


Communicating data between TDM and packet based networks
patent-application, January 2003


Treatment of Data in Networks
patent-application, October 2008


Profiling wide-area networks using peer cooperation
patent-application, September 2006


Controlling bandwidth allocation using a pace counter
patent, November 1999


Visual Programming for Message-Passing Systems
journal, August 1999


Coherent data sharing
patent, May 2009


Method and apparatus for pacing asynchronous transfer mode (ATM) data cell transmission
patent, February 1996


Determining a communication schedule between processors
patent, June 2001


Routing protocol based redundancy design for shared-access networks
patent, January 2005


The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007


Method for deadlock-free message passing in MIMD systems using routers and buffers
patent, January 1999


Using global memory information to manage memory in a computer network
patent, December 2000


Lease based safety protocol for distributed system with multiple networks
patent, August 2004


Flow control for high speed networks
patent, November 1991


Automatic generation and tuning of MPI collective communication routines
conference, January 2005


Automatic tuning of communication protocol performance
patent-application, May 2009


Determining a communication schedule between processors
patent, October 2000


A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008


Visualization tool for graphically displaying trace data produced by a parallel processing computer
patent, May 2000


    Works referencing / citing this record:

    Initiating a collective operation in a parallel computer
    patent, April 2014


    Developing collective operations for a parallel computer
    patent, November 2016


    Developing collective operations for a parallel computer
    patent, November 2016


    Performing a deterministic reduction operation in a parallel computer
    patent, February 2015


    Performing a global barrier operation in a parallel computer
    patent, December 2014


    Performing an allreduce operation using shared memory
    patent, June 2014


    Performing a deterministic reduction operation in a parallel computer
    patent, February 2015


    Send-side matching of data communications messages
    patent, July 2014


    Collective operation protocol selection in a parallel computer
    patent, November 2014


    Collective operation protocol selection in a parallel computer
    patent, June 2015


    Broadcasting a message in a parallel computer
    patent, November 2014


    Send-side matching of data communications messages
    patent, June 2014


    Improving efficiency of a global barrier operation in a parallel computer
    patent, October 2016