Performing optimized collective operations in a irregular subcommunicator of compute nodes in a parallel computer
Abstract
In a parallel computer, performing optimized collective operations in an irregular subcommunicator of compute nodes may be carried out by: identifying, within the irregular subcommunicator, regular neighborhoods of compute nodes; selecting, for each neighborhood from the compute nodes of the neighborhood, a local root node; assigning each local root node to a node of a neighborhood-wide tree topology; mapping, for each neighborhood, the compute nodes of the neighborhood to a local tree topology having, at its root, the local root node of the neighborhood; and performing a one way, rooted collective operation within the subcommunicator including: performing, in one phase, the collective operation within each neighborhood; and performing, in another phase, the collective operation amongst the local root nodes.
- Inventors:
- Issue Date:
- Research Org.:
- International Business Machines Corp., Armonk, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1805594
- Patent Number(s):
- 10938889
- Application Number:
- 16/437,119
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
H - ELECTRICITY H04 - ELECTRIC COMMUNICATION TECHNIQUE H04L - TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- DOE Contract Number:
- B554431
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 06/11/2019
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Davis, Kristan Suzanne D., and Faraj, Daniel A. Performing optimized collective operations in a irregular subcommunicator of compute nodes in a parallel computer. United States: N. p., 2021.
Web.
Davis, Kristan Suzanne D., & Faraj, Daniel A. Performing optimized collective operations in a irregular subcommunicator of compute nodes in a parallel computer. United States.
Davis, Kristan Suzanne D., and Faraj, Daniel A. Tue .
"Performing optimized collective operations in a irregular subcommunicator of compute nodes in a parallel computer". United States. https://www.osti.gov/servlets/purl/1805594.
@article{osti_1805594,
title = {Performing optimized collective operations in a irregular subcommunicator of compute nodes in a parallel computer},
author = {Davis, Kristan Suzanne D. and Faraj, Daniel A.},
abstractNote = {In a parallel computer, performing optimized collective operations in an irregular subcommunicator of compute nodes may be carried out by: identifying, within the irregular subcommunicator, regular neighborhoods of compute nodes; selecting, for each neighborhood from the compute nodes of the neighborhood, a local root node; assigning each local root node to a node of a neighborhood-wide tree topology; mapping, for each neighborhood, the compute nodes of the neighborhood to a local tree topology having, at its root, the local root node of the neighborhood; and performing a one way, rooted collective operation within the subcommunicator including: performing, in one phase, the collective operation within each neighborhood; and performing, in another phase, the collective operation amongst the local root nodes.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2021},
month = {3}
}
Works referenced in this record:
Switches and a Network of Switches
patent-application, May 2012
- Coppola, Antonio-Marcello; Locatelli, Riccardo; Flinch, Cardo Jose
- US Patent Application 13/272342; 20120134364
Performing Optimized Collective Operations in an Irregular Subcommunicator of Compute Nodes in a Parallel Computer
patent-application, April 2015
- Davis, Kristan D.; Faraj, Daniel A.
- US Patent Application 14/055402; 20150106419
Identifying A Largest Logical Plane From A Plurality Of Logical Planes Formed Of Compute Nodes Of A Subcommunicator In A Parallel Computer
patent-application, April 2015
- Davis, Kristan D.; Faraj, Daniel A.
- US Patent Application 14/053171; 20150106482
Method for deadlock-free message passing in MIMD systems using routers and buffers
patent, January 1999
- Levin, Vladimir K.; Karatanov, Vjacheslav V.; Jalin, Valerii V.
- US Patent Document 5,859,981
Method and apparatus for self-organizing node groups on a network
patent, December 2008
- Abdelaziz, Mohamed M.; Traversat, Bernard A.; da Fonseca, Andre Marques
- US Patent Document 7,461,130
Method and apparatus for increasing throughput in a storage server
patent-application, April 2007
- Lango, Jason A.; English, Robert M.; Endo, Yasuhiro
- US Patent Application 11/255859; 20070094529
On-demand instantiation in a high-performance computing (HPC) system
patent-application, June 2006
- Davidson, Shannon V.
- US Patent Application 10/991994; 20060117208
Identifying Logical Planes Formed Of Compute Nodes Of A Subcommunicator In A Parallel Computer
patent-application, September 2014
- Davis, Kristan D.; Faraj, Daniel A.
- US Patent Application 13/795854; 20140281374
Line-Plane Broadcasting in a Data Communications Network of a Parallel Computer
patent-application, February 2009
- Archer, Charles J.; Berg, Jeremy E.; Blocksome, Michael A.
- US Patent Application 11/843090; 20090055474
Virtual channel assignment in large torus systems
patent, August 2000
- Passint, Randal S.; Thorson, Greg; Galles, Michael B.
- US Patent Document 6,101,181
Mapping of nodes in an interconnection fabric
patent, February 2006
- Lee, Whay Sing
- US Patent Document 7,000,033
Storage array interconnection fabric using a torus topology
patent, April 2004
- Lee, Whay Sing; Rettberg, Randall D.; Talagala, Nisha
- US Patent Document 6,718,428
Broadcast Latency Optimization in Multihop Wireless Networks
patent-application, June 2010
- Lee, Seungjoon; Grandhi, Rajiv; Kim, Yoo-Ah
- US Patent Application 12/335727; 20100149983
Constructing A Logical, Regular Axis Topology From An Irregular Topology
patent-application, June 2013
- Faraj, Daniel A.
- US Patent Application 13/309022; 20130145003
High performance storage array interconnection fabric using multiple independent paths
patent, July 2008
- Lee, Whay Sing; Rettberg, Randall D.; Talagala, Nisha
- US Patent Document 7,401,161
Methods and apparatus for maintaining a map of node relationships for a network
patent-application, September 2006
- O'Toole, James; Jannotti, John J.
- US Patent Application 11/431600; 20060218301
Communication system and method providing optimal restoration of failed paths
patent, November 1998
- Allen, John
- US Patent Document 5,835,482
Protocol and structure for self-organizing network
patent-application, January 2004
- Maeda, Masahiro; Bourgeois, Monique; Callaway, JR., Edgar H.
- US Patent Application 10/125939; 20040003111
Identifying Logical Planes Formed Of Compute Nodes Of A Subcommunicator In A Parallel Computer
patent-application, September 2014
- Davis, Kristan D.; Faraj, Daniel A.
- US Patent Application 13/800226; 20140281377
Executing an Allgather Operation on a Parallel Computer
patent-application, October 2007
- Archer, Charles J.; Moreira, JOse F.; Ratterman, Joseph D.
- US Patent Application 11/279620; 20070245122
Master node selection in clustered node configurations
patent-application, July 2003
- Sampathkumar, Govindaraj
- US Patent Application 10/052551; 20030140108
Network system, spanning tree configuration method and configuration program, and spanning tree configuration node
patent-application, August 2004
- Enomoto, Nobuyuki; Umayabashi, Masaki; Hidaka, Youichi
- US Patent Application 10/642203; 20040160904
Constructing a Logical, Regular Axis Topology from an Irregular Topology
patent-application, June 2013
- Faraj, Daniel A.
- US Patent Application 13/742453; 20130145012
Configuring Compute Nodes of a Parallel Computer in an Operational Group into a Plurality of Independent Non-Overlapping Collective Networks
patent-application, February 2009
- Archer, Charles J.; Inglett, Todd A.; Ratterman, Joseph D.
- US Patent Application 11/837015; 20090043988