DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Identifying messaging completion in a parallel computer by checking for change in message received and transmitted count at each node

Abstract

Methods, parallel computers, and products are provided for identifying messaging completion on a parallel computer. The parallel computer includes a plurality of compute nodes, the compute nodes coupled for data communications by at least two independent data communications networks including a binary tree data communications network optimal for collective operations that organizes the nodes as a tree and a torus data communications network optimal for point to point operations that organizes the nodes as a torus. Embodiments include reading all counters at each node of the torus data communications network; calculating at each node a current node value in dependence upon the values read from the counters at each node; and determining for all nodes whether the current node value for each node is the same as a previously calculated node value for each node. If the current node is the same as the previously calculated node value for all nodes of the torus data communications network, embodiments include determining that messaging is complete and if the current node is not the same as the previously calculated node value for all nodes of the torus data communications network, embodiments include determining that messaging is currently incomplete.

Inventors:
 [1];  [2];  [1];  [3]
  1. Rochester, MN
  2. Fayetteville, NC
  3. Eden Prairie, MN
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
988401
Patent Number(s):
7552312
Application Number:
11/672,989
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B519700
Resource Type:
Patent
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Archer, Charles J, Hardwick, Camesha R, McCarthy, Patrick J, and Wallenfelt, Brian P. Identifying messaging completion in a parallel computer by checking for change in message received and transmitted count at each node. United States: N. p., 2009. Web.
Archer, Charles J, Hardwick, Camesha R, McCarthy, Patrick J, & Wallenfelt, Brian P. Identifying messaging completion in a parallel computer by checking for change in message received and transmitted count at each node. United States.
Archer, Charles J, Hardwick, Camesha R, McCarthy, Patrick J, and Wallenfelt, Brian P. Tue . "Identifying messaging completion in a parallel computer by checking for change in message received and transmitted count at each node". United States. https://www.osti.gov/servlets/purl/988401.
@article{osti_988401,
title = {Identifying messaging completion in a parallel computer by checking for change in message received and transmitted count at each node},
author = {Archer, Charles J and Hardwick, Camesha R and McCarthy, Patrick J and Wallenfelt, Brian P},
abstractNote = {Methods, parallel computers, and products are provided for identifying messaging completion on a parallel computer. The parallel computer includes a plurality of compute nodes, the compute nodes coupled for data communications by at least two independent data communications networks including a binary tree data communications network optimal for collective operations that organizes the nodes as a tree and a torus data communications network optimal for point to point operations that organizes the nodes as a torus. Embodiments include reading all counters at each node of the torus data communications network; calculating at each node a current node value in dependence upon the values read from the counters at each node; and determining for all nodes whether the current node value for each node is the same as a previously calculated node value for each node. If the current node is the same as the previously calculated node value for all nodes of the torus data communications network, embodiments include determining that messaging is complete and if the current node is not the same as the previously calculated node value for all nodes of the torus data communications network, embodiments include determining that messaging is currently incomplete.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2009},
month = {6}
}