Runtime optimization of an application executing on a parallel computer
Abstract
Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.
- Inventors:
- Issue Date:
- Research Org.:
- International Business Machines Corp., Armonk, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1082649
- Patent Number(s):
- 8365186
- Application Number:
- 12/760,111
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2010 Apr 14
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Faraj, Daniel A., and Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2013.
Web.
Faraj, Daniel A., & Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States.
Faraj, Daniel A., and Smith, Brian E. Tue .
"Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1082649.
@article{osti_1082649,
title = {Runtime optimization of an application executing on a parallel computer},
author = {Faraj, Daniel A. and Smith, Brian E.},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2013},
month = {1}
}
Works referenced in this record:
MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009
- Faraj, Ahmad; Kumar, Sameer; Smith, Brian
- 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006
- Sottile, M. J.; Chandu, V. P.; Bader, D. A.
- Proceedings 20th IEEE International Parallel & Distributed Processing Symposium
Visual Programming for Message-Passing Systems
journal, August 1999
- Stankovic, Nenad; Zhang, Kang
- International Journal of Software Engineering and Knowledge Engineering, Vol. 09, Issue 04
The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007
- Moreira, José E.; Salapura, Valentina; Almasi, George
- International Journal of Parallel Programming, Vol. 35, Issue 3, p. 181-206
Automatic generation and tuning of MPI collective communication routines
conference, January 2005
- Faraj, Ahmad; Yuan, Xin
- Proceedings of the 19th annual international conference on Supercomputing - ICS '05
A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008
- Faraj, Ahmad; Patarasuk, Pitch; Yuan, Xin
- International Journal of Parallel Programming, Vol. 36, Issue 6