skip to main content
DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Method and apparatus for obtaining stack traceback data for multiple computing nodes of a massively parallel computer system

Abstract

A data collector for a massively parallel computer system obtains call-return stack traceback data for multiple nodes by retrieving partial call-return stack traceback data from each node, grouping the nodes in subsets according to the partial traceback data, and obtaining further call-return stack traceback data from a representative node or nodes of each subset. Preferably, the partial data is a respective instruction address from each node, nodes having identical instruction address being grouped together in the same subset. Preferably, a single node of each subset is chosen and full stack traceback data is retrieved from the call-return stack within the chosen node.

Inventors:
 [1];  [1]
  1. Rochester, MN
Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1015491
Patent Number(s):
7673182
Application Number:
11/425,778
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B591700
Resource Type:
Patent
Country of Publication:
United States
Language:
English

Citation Formats

Gooding, Thomas Michael, and McCarthy, Patrick Joseph. Method and apparatus for obtaining stack traceback data for multiple computing nodes of a massively parallel computer system. United States: N. p., 2010. Web.
Gooding, Thomas Michael, & McCarthy, Patrick Joseph. Method and apparatus for obtaining stack traceback data for multiple computing nodes of a massively parallel computer system. United States.
Gooding, Thomas Michael, and McCarthy, Patrick Joseph. Tue . "Method and apparatus for obtaining stack traceback data for multiple computing nodes of a massively parallel computer system". United States. https://www.osti.gov/servlets/purl/1015491.
@article{osti_1015491,
title = {Method and apparatus for obtaining stack traceback data for multiple computing nodes of a massively parallel computer system},
author = {Gooding, Thomas Michael and McCarthy, Patrick Joseph},
abstractNote = {A data collector for a massively parallel computer system obtains call-return stack traceback data for multiple nodes by retrieving partial call-return stack traceback data from each node, grouping the nodes in subsets according to the partial traceback data, and obtaining further call-return stack traceback data from a representative node or nodes of each subset. Preferably, the partial data is a respective instruction address from each node, nodes having identical instruction address being grouped together in the same subset. Preferably, a single node of each subset is chosen and full stack traceback data is retrieved from the call-return stack within the chosen node.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2010},
month = {3}
}

Patent:

Save / Share: