Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Probabilistic method for grouping data. [PMGPER, PMGCLS, PMGEST, in FORTRAN for IBM 360/91]

Technical Report ·
DOI:https://doi.org/10.2172/5751061· OSTI ID:5751061

A probabilistic method for grouping data was developed to incorporate measurement error into standard cluster analysis procedures. In the analysis, the data are perturbed by Monte Carlo techniques to simulate the experimental error, and the resultant data sets are clustered. By varying the number of clusters, a procedure is given to estimate the unknown number of groups. This technique and other standard procedures for determining the number of groups are described and compared for three different examples. The probabilistic method is shown to have advantages for determining the number of groups and the probabilities for a sample's membership in the hypothesized groups. 33 figures, 8 tables.

Research Organization:
Union Carbide Corp., Oak Ridge, TN (USA). Nuclear Div.
DOE Contract Number:
W-7405-ENG-26
OSTI ID:
5751061
Report Number(s):
ORNL/CSD/TM-65
Country of Publication:
United States
Language:
English