skip to main content
DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Runtime optimization of an application executing on a parallel computer

Abstract

Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.

Inventors:
;
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1163970
Patent Number(s):
8,893,150
Application Number:
13/767,997
Assignee:
International Business Machines Corporation (Armonk, NY)
DOE Contract Number:  
B554331
Resource Type:
Patent
Resource Relation:
Patent File Date: 2013 Feb 15
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Faraj, Daniel A, and Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2014. Web.
Faraj, Daniel A, & Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States.
Faraj, Daniel A, and Smith, Brian E. Tue . "Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1163970.
@article{osti_1163970,
title = {Runtime optimization of an application executing on a parallel computer},
author = {Faraj, Daniel A and Smith, Brian E},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {11}
}

Patent:

Save / Share:

Works referenced in this record:

Profiling wide-area networks using peer cooperation
patent-application, September 2006


Heuristic Status Polling
patent-application, December 2008


STAR-MPI: self tuned adaptive routines for MPI collective operations
conference, January 2006

  • Faraj, Ahmad; Yuan, Xin; Lowenthal, David
  • Proceedings of the 20th annual international conference on Supercomputing - ICS '06
  • DOI: 10.1145/1183401.1183431

Optimization of MPI collective communication on BlueGene/L systems
conference, January 2005

  • Almási, George; Heidelberger, Philip; Archer, Charles J.
  • Proceedings of the 19th annual international conference on Supercomputing - ICS '05
  • DOI: 10.1145/1088149.1088183

Collective communication on architectures that support simultaneous communication over multiple links
conference, January 2006

  • Chan, Ernie; van de Geijn, Robert; Gropp, William
  • Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '06
  • DOI: 10.1145/1122971.1122975

Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006

  • Sottile, M. J.; Chandu, V. P.; Bader, D. A.
  • Proceedings 20th IEEE International Parallel & Distributed Processing Symposium
  • DOI: 10.1109/IPDPS.2006.1639321

Visual Programming for Message-Passing Systems
journal, August 1999

  • Stankovic, Nenad; Zhang, Kang
  • International Journal of Software Engineering and Knowledge Engineering, Vol. 09, Issue 04
  • DOI: 10.1142/S0218194099000231

The Autopilot performance-directed adaptive control system
journal, September 2001

  • Ribler, Randy L.; Simitci, Huseyin; Reed, Daniel A.
  • Future Generation Computer Systems, Vol. 18, Issue 1, p. 175-187
  • DOI: 10.1016/S0167-739X(01)00051-6

Automated cluster-based web service performance tuning
conference, January 2004

  • I-Hsin Chung, ; Hollingsworth, J. K.
  • Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004.
  • DOI: 10.1109/HPDC.2004.1323484

An empirically derived framework for classifying parallel program performance tuning problems
conference, January 1998

  • Hondroudakis, Anna; Procter, Rob
  • Proceedings of the SIGMETRICS symposium on Parallel and distributed tools - SPDT '98
  • DOI: 10.1145/281035.281047

Overview of the Blue Gene/L system architecture
journal, March 2005

  • Gara, A.; Blumrich, M. A.; Chen, D.
  • IBM Journal of Research and Development, Vol. 49, Issue 2.3
  • DOI: 10.1147/rd.492.0195

Blue Gene/L torus interconnection network
journal, March 2005

  • Adiga, N. R.; Blumrich, M. A.; Chen, D.
  • IBM Journal of Research and Development, Vol. 49, Issue 2.3
  • DOI: 10.1147/rd.492.0265

MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009

  • Faraj, Ahmad; Kumar, Sameer; Smith, Brian
  • 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
  • DOI: 10.1109/HOTI.2009.12

A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008

  • Faraj, Ahmad; Patarasuk, Pitch; Yuan, Xin
  • International Journal of Parallel Programming, Vol. 36, Issue 6
  • DOI: 10.1007/s10766-008-0070-9

Automatic generation and tuning of MPI collective communication routines
conference, January 2005

  • Faraj, Ahmad; Yuan, Xin
  • Proceedings of the 19th annual international conference on Supercomputing - ICS '05
  • DOI: 10.1145/1088149.1088202

The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007

  • Moreira, José E.; Salapura, Valentina; Almasi, George
  • International Journal of Parallel Programming, Vol. 35, Issue 3, p. 181-206
  • DOI: 10.1007/s10766-007-0037-2