Implementing asyncronous collective operations in a multi-node processing system
Abstract
A method, system, and computer program product are disclosed for implementing an asynchronous collective operation in a multi-node data processing system. In one embodiment, the method comprises sending data to a plurality of nodes in the data processing system, broadcasting a remote get to the plurality of nodes, and using this remote get to implement asynchronous collective operations on the data by the plurality of nodes. In one embodiment, each of the nodes performs only one task in the asynchronous operations, and each nodes sets up a base address table with an entry for a base address of a memory buffer associated with said each node. In another embodiment, each of the nodes performs a plurality of tasks in said collective operations, and each task of each node sets up a base address table with an entry for a base address of a memory buffer associated with the task.
- Inventors:
- Issue Date:
- Research Org.:
- International Business Machines Corporation, Armonk, NY (USA)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1143663
- Patent Number(s):
- 8782164
- Application Number:
- 12/697,043
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Chen, Dong, Eisley, Noel A., Heidelberger, Philip, Kumar, Sameer, Salapura, Valentina, and Steinmacher-Burow, Burkhard. Implementing asyncronous collective operations in a multi-node processing system. United States: N. p., 2014.
Web.
Chen, Dong, Eisley, Noel A., Heidelberger, Philip, Kumar, Sameer, Salapura, Valentina, & Steinmacher-Burow, Burkhard. Implementing asyncronous collective operations in a multi-node processing system. United States.
Chen, Dong, Eisley, Noel A., Heidelberger, Philip, Kumar, Sameer, Salapura, Valentina, and Steinmacher-Burow, Burkhard. Tue .
"Implementing asyncronous collective operations in a multi-node processing system". United States. https://www.osti.gov/servlets/purl/1143663.
@article{osti_1143663,
title = {Implementing asyncronous collective operations in a multi-node processing system},
author = {Chen, Dong and Eisley, Noel A. and Heidelberger, Philip and Kumar, Sameer and Salapura, Valentina and Steinmacher-Burow, Burkhard},
abstractNote = {A method, system, and computer program product are disclosed for implementing an asynchronous collective operation in a multi-node data processing system. In one embodiment, the method comprises sending data to a plurality of nodes in the data processing system, broadcasting a remote get to the plurality of nodes, and using this remote get to implement asynchronous collective operations on the data by the plurality of nodes. In one embodiment, each of the nodes performs only one task in the asynchronous operations, and each nodes sets up a base address table with an entry for a base address of a memory buffer associated with said each node. In another embodiment, each of the nodes performs a plurality of tasks in said collective operations, and each task of each node sets up a base address table with an entry for a base address of a memory buffer associated with the task.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2014},
month = {7}
}
Works referenced in this record:
Cluster-based aggregated switching technique (CAST) for routing data packets and information objects in computer networks
patent-application, September 2002
- Garci-Luna-Aceves, J. J.; Samanta, Arindam
- US Patent Application 09/945104; 20020129086
Asyncronous Broadcast for Ordered Delivery Between Compute Nodes in a Parallel Computing System Where Packet Header Space is Limited
patent-application, January 2009
- Kumar, Sameer
- US Patent Application 11/768619; 20090003344
Multiple Node Remote Messaging
patent-application, January 2009
- Blumrich, Matthias A.; Chen, Dong; Gara01, Alan G.
- US Patent Application 11/768784; 20090006546
Mechanism to Support Generic Collective Communication Across a Variety of Programming Models
patent-application, January 2009
- Almasi, Gheorghe; Dozsa, Gabor; Kumar, Sameer
- US Patent Application 11/768669; 20090006810
Reducing Layering Overhead in Collective Communication Operations
patent-application, January 2009
- Jia, Bin
- US Patent Application 11/771311; 20090007140
Message Passing with a Limited Number of DMA Byte Counters
patent-application, January 2009
- Blocksome, Michael; Chen, Dong; Giampapi, Mark E.
- US Patent Application 11/768813; 20090007141
Chaining Direct Memory Access Data Transfer Operations for Compute Nodes in a Parallel Computer
patent-application, January 2009
- Archer, Charles J.; Blocksome, Michael A.
Facilitating Intra-Node Data Transfer in Collective Communications, and Methods Therefor
patent-application, August 2009
- Blackmore, Robert S.; Jia, Bin; Treumann, Richard R.
- US Patent Application 12/435500; 20090210635
Recording A Communication Pattern and Replaying Messages in a Parallel Computing System
patent-application, January 2011
- Heidelberger, Philip; Kumar, Sameer
- US Patent Application 12/500715; 20110010471