Runtime optimization of an application executing on a parallel computer
Abstract
Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.
- Issue Date:
- Research Org.:
- International Business Machines Corp., Armonk, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1164339
- Patent Number(s):
- 8898678
- Application Number:
- 13/663,545
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2014.
Web.
. Runtime optimization of an application executing on a parallel computer. United States.
. Tue .
"Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1164339.
@article{osti_1164339,
title = {Runtime optimization of an application executing on a parallel computer},
author = {},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {11}
}
Works referenced in this record:
Optimization of MPI collective communication on BlueGene/L systems
conference, January 2005
- Almási, George; Heidelberger, Philip; Archer, Charles J.
- Proceedings of the 19th annual international conference on Supercomputing - ICS '05
Collective communication on architectures that support simultaneous communication over multiple links
conference, January 2006
- Chan, Ernie; van de Geijn, Robert; Gropp, William
- Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '06
Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006
- Sottile, M. J.; Chandu, V. P.; Bader, D. A.
- Proceedings 20th IEEE International Parallel & Distributed Processing Symposium
Visual Programming for Message-Passing Systems
journal, August 1999
- Stankovic, Nenad; Zhang, Kang
- International Journal of Software Engineering and Knowledge Engineering, Vol. 09, Issue 04
STAR-MPI: self tuned adaptive routines for MPI collective operations
conference, January 2006
- Faraj, Ahmad; Yuan, Xin; Lowenthal, David
- Proceedings of the 20th annual international conference on Supercomputing - ICS '06
The Autopilot performance-directed adaptive control system
journal, September 2001
- Ribler, Randy L.; Simitci, Huseyin; Reed, Daniel A.
- Future Generation Computer Systems, Vol. 18, Issue 1, p. 175-187
Automated cluster-based web service performance tuning
conference, January 2004
- I-Hsin Chung, ; Hollingsworth, J. K.
- Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004.
An empirically derived framework for classifying parallel program performance tuning problems
conference, January 1998
- Hondroudakis, Anna; Procter, Rob
- Proceedings of the SIGMETRICS symposium on Parallel and distributed tools - SPDT '98
Overview of the Blue Gene/L system architecture
journal, March 2005
- Gara, A.; Blumrich, M. A.; Chen, D.
- IBM Journal of Research and Development, Vol. 49, Issue 2.3
Blue Gene/L torus interconnection network
journal, March 2005
- Adiga, N. R.; Blumrich, M. A.; Chen, D.
- IBM Journal of Research and Development, Vol. 49, Issue 2.3
Broadcasting on Meshes with Wormhole Routing
journal, June 1996
- Barnett, Mike; Payne, David G.; van de Geijn, Robert A.
- Journal of Parallel and Distributed Computing, Vol. 35, Issue 2
MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009
- Faraj, Ahmad; Kumar, Sameer; Smith, Brian
- 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008
- Faraj, Ahmad; Patarasuk, Pitch; Yuan, Xin
- International Journal of Parallel Programming, Vol. 36, Issue 6
Automatic generation and tuning of MPI collective communication routines
conference, January 2005
- Faraj, Ahmad; Yuan, Xin
- Proceedings of the 19th annual international conference on Supercomputing - ICS '05
The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007
- Moreira, José E.; Salapura, Valentina; Almasi, George
- International Journal of Parallel Programming, Vol. 35, Issue 3, p. 181-206