Runtime optimization of an application executing on a parallel computer
Abstract
Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.
- Inventors:
- Issue Date:
- Research Org.:
- International Business Machines Corp., Armonk, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1082649
- Patent Number(s):
- 8365186
- Application Number:
- 12/760,111
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2010 Apr 14
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Faraj, Daniel A., and Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States: N. p., 2013.
Web.
Faraj, Daniel A., & Smith, Brian E. Runtime optimization of an application executing on a parallel computer. United States.
Faraj, Daniel A., and Smith, Brian E. Tue .
"Runtime optimization of an application executing on a parallel computer". United States. https://www.osti.gov/servlets/purl/1082649.
@article{osti_1082649,
title = {Runtime optimization of an application executing on a parallel computer},
author = {Faraj, Daniel A. and Smith, Brian E.},
abstractNote = {Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Jan 29 00:00:00 EST 2013},
month = {Tue Jan 29 00:00:00 EST 2013}
}
Works referenced in this record:
Visualization tool for graphically displaying trace data
patent, January 1999
- Advani, Deepak; Byron, Michael Justin; Hansell, Steven Robert
- US Patent Document 5,862,381
GPIB system and method which performs autopolling using heuristics
patent, October 2003
- Thomson, Andrew
- US Patent Document 6,633,937
Power consumption-based thread scheduling
patent-application, May 2006
- Bodas, Devadatta V.; Nakajima, Jun
- US Patent Application 10/982613;20060107262
Method for pacing data flow in a packet-based network
patent, September 2000
- Packer, Robert L.; Galloway, Brett
- US Patent Document 6,115,357
Incidence graph based communications and operations method and apparatus for parallel processing architecture
patent, June 1999
- Rolfe, David B.; Wack, Andrew P.
- US Patent Document 5,912,893
System and method for measuring inter-nodal transmission delays in a communications network
patent, March 1992
- Motles, Luis
- US Patent Document 5,095,444
Collaborative host masquerading system
patent-application, May 2002
- Rhodes, David L.
- US Patent Application 09/726087; 20020065930
End node pacing for QOS and bandwidth management
patent, July 2003
- Edholm, Phil
- US Patent Document 6,600,721
MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
conference, August 2009
- Faraj, Ahmad; Kumar, Sameer; Smith, Brian
- 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects
System and method for providing competing local exchange carriers unbundled access to subscriber access lines
patent, September 2000
- Sylvain, Dany
- US Patent Document 6,118,777
Remote DMA systems and methods for supporting synchronization of distributed processes in a multi-processor system using collective operations
patent-application, May 2008
- Leonard, Judson S.; Stewart, Lawrence C.; Gingold, David
- US Patent Application 11/594427; 20080109569
Method and apparatus for multimedia data interchange with pacing capability in a distributed data processing system
patent, January 2002
- Diedrich, Richard Alan; Kiel, Harvey Gene
- US Patent Document 6,336,143
Distributed model compilation
patent, March 2009
- Shakeri, Mojdeh; Mosterman, Pieter J.
- US Patent Document 7,509,244
System and method for dynamically adjusting a thread scheduling quantum value
patent, November 2008
- Duke, Jeremy
- US Patent Document 7,458,077
Method of determining graph isomorphism in polynomial-time
patent-application, August 2007
- Smith, Joshua R.
- US Patent Application 11/326971; 20070179760
Data processing system and method for pacing information transfers in a communications network
patent, June 1999
- Blackard, Joe Wayne; Gillaspy, Richard Adams; Henthorn, William John
- US Patent Document 5,918,020
Management system and method for parallel computer system
patent, August 1999
- Matsushita, Masayuki; Ugajin, Atsushi
- US Patent Document 5,937,201
Performance mechanism for presenting integrated information in a graphical user interface
patent-application, June 2004
- England, Laurence Edward; Xia, Chenhong
- US Patent Application 10/314471; 20040111398
Method and apparatus for source rate pacing in an ATM network
patent, September 1999
- Moore, Mark; Stark, Gavin J.
- US Patent Document 5,953,336
Performance analysis of parallel programs via message-passing graph traversal
conference, January 2006
- Sottile, M. J.; Chandu, V. P.; Bader, D. A.
- Proceedings 20th IEEE International Parallel & Distributed Processing Symposium
Communicating data between TDM and packet based networks
patent-application, January 2003
- Lee, Charles; Kapoor, Harsh
- US Patent Application 10/137197; 20030021287
Treatment of Data in Networks
patent-application, October 2008
- Briscoe, Robert J.; Jacquet, Arnaud; Soppera, Andrea
- US Patent Application 10/593442; 20080240115
Profiling wide-area networks using peer cooperation
patent-application, September 2006
- Padmanabhan, Venkata N.; Padhye, Jitendra D.; Ramabhadran, Narayanan Sriram
- US Patent Application 11/079792; 20060203739
Controlling bandwidth allocation using a pace counter
patent, November 1999
- Caldara, Stephen A.; Hauser, Stephen A.; Manning, Thomas A.
- US Patent Document 5,982,771
Visual Programming for Message-Passing Systems
journal, August 1999
- Stankovic, Nenad; Zhang, Kang
- International Journal of Software Engineering and Knowledge Engineering, Vol. 09, Issue 04
Method and apparatus for pacing asynchronous transfer mode (ATM) data cell transmission
patent, February 1996
- Shtayer, Ronen; Alon, Naveh; Alexander, Joffe
- US Patent Document 5,491,691
Determining a communication schedule between processors
patent, June 2001
- Komatsu, Hideaki; Ogasawara, Takeshi
- US Patent Document 6,253,372
Routing protocol based redundancy design for shared-access networks
patent, January 2005
- Daruwalla, Feisal; Forster, James R.; Roeck, Guenter E.
- US Patent Document 6,839,829
Method for providing a precise network time service
patent, August 2002
- Hodge, James E.
- US Patent Document 6,438,702
Asyncronous Broadcast for Ordered Delivery Between Compute Nodes in a Parallel Computing System Where Packet Header Space is Limited
patent-application, January 2009
- Kumar, Sameer
- US Patent Application 11/768619; 20090003344
The Blue Gene/L Supercomputer: A Hardware and Software Story
journal, May 2007
- Moreira, José E.; Salapura, Valentina; Almasi, George
- International Journal of Parallel Programming, Vol. 35, Issue 3, p. 181-206
Method for deadlock-free message passing in MIMD systems using routers and buffers
patent, January 1999
- Levin, Vladimir K.; Karatanov, Vjacheslav V.; Jalin, Valerii V.
- US Patent Document 5,859,981
Using global memory information to manage memory in a computer network
patent, December 2000
- Levy, Henry; Feeley, Michael J.; Karlin, Anna R.
- US Patent Document 6,167,490
Lease based safety protocol for distributed system with multiple networks
patent, August 2004
- Burns, Randal Chilton; Goel, Atul; Long, Darrell D. E.
- US Patent Document 6,775,703
Flow control for high speed networks
patent, November 1991
- Barzilai, Tsipora; Chen, Mon-Song; Kadaba, Bharath
- US Patent Document 5,063,562
Automatic generation and tuning of MPI collective communication routines
conference, January 2005
- Faraj, Ahmad; Yuan, Xin
- Proceedings of the 19th annual international conference on Supercomputing - ICS '05
Automatic tuning of communication protocol performance
patent-application, May 2009
- Supalov, Alexander V.; Sapronov, Sergey I.; Syrov, Artyom A.
- US Patent Application 11/984322; 20090129277
Determining a communication schedule between processors
patent, October 2000
- Komatsu, Hideaki; Ogasawara, Takeshi
- US Patent Document 6,126,331
A Study of Process Arrival Patterns for MPI Collective Operations
journal, February 2008
- Faraj, Ahmad; Patarasuk, Pitch; Yuan, Xin
- International Journal of Parallel Programming, Vol. 36, Issue 6
Graph-based schedule builder for tightly constrained scheduling problems
patent, December 2002
- Schmidt, Craig W.
- US Patent Document 6,490,566
Method for the secure and timely delivery of large messages over a distributed communication network
patent, April 2006
- Kadakia, Viral; Chinta, Ramakrishna; Menna, Randy
- US Patent Document 7,032,224
Visualization tool for graphically displaying trace data produced by a parallel processing computer
patent, May 2000
- Advani, Deepak; Byron, Michael Justin; Hansell, Steven Robert
- US Patent Document 6,057,839
Compiler-oriented apparatus for parallel compilation, simulation and execution of computer programs and hardware models
patent, December 1999
- Willis, John Christopher; Newshutz, Robert Neill
- US Patent Document 5,999,734
Works referencing / citing this record:
Initiating a collective operation in a parallel computer
patent, April 2014
- Archer, Charles J.; Carey, James E.; Sanders, Philip J.
- US Patent Document 8,706,847
Developing collective operations for a parallel computer
patent, November 2016
- Archer, Charles J.; Carey, James E.; Sanders, Philip J.
- US Patent Document 9,501,265
Developing collective operations for a parallel computer
patent, November 2016
- Archer, Charles J.; Carey, James E.; Sanders, Philip J.
- US Patent Document 9,495,135
Performing a deterministic reduction operation in a parallel computer
patent, February 2015
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 8,949,577
Performing a global barrier operation in a parallel computer
patent, December 2014
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 8,910,178
Performing an allreduce operation using shared memory
patent, June 2014
- Archer, Charles J.; Dozsa, Gabor J.; Ratterman, Joseph D.
- US Patent Document 8,752,051
Performing a deterministic reduction operation in a parallel computer
patent, February 2015
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 8,966,224
Send-side matching of data communications messages
patent, July 2014
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 8,776,081
Processing data communications events by awakening threads in parallel active messaging interface of a parallel computer
patent, March 2016
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 9,286,145
Collective operation protocol selection in a parallel computer
patent, November 2014
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 8,893,083
Collective operation protocol selection in a parallel computer
patent, June 2015
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 9,047,091
Performing an all-to-all data exchange on a plurality of data buffers by performing swap operations
patent, July 2014
- Archer, Charles J.; Peters, Amanda; Smith, Brian E.
- US Patent Document 8,775,698
Broadcasting a message in a parallel computer
patent, November 2014
- Archer, Charles J.; Faraj, Daniel A.
- US Patent Document 8,891,408
Send-side matching of data communications messages
patent, June 2014
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 8,756,612
Improving efficiency of a global barrier operation in a parallel computer
patent, October 2016
- Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.
- US Patent Document 9,459,934