Probabilistic method for grouping data. [PMGPER, PMGCLS, PMGEST, in FORTRAN for IBM 360/91]
A probabilistic method for grouping data was developed to incorporate measurement error into standard cluster analysis procedures. In the analysis, the data are perturbed by Monte Carlo techniques to simulate the experimental error, and the resultant data sets are clustered. By varying the number of clusters, a procedure is given to estimate the unknown number of groups. This technique and other standard procedures for determining the number of groups are described and compared for three different examples. The probabilistic method is shown to have advantages for determining the number of groups and the probabilities for a sample's membership in the hypothesized groups. 33 figures, 8 tables.
- Research Organization:
- Union Carbide Corp., Oak Ridge, TN (USA). Nuclear Div.
- DOE Contract Number:
- W-7405-ENG-26
- OSTI ID:
- 5751061
- Report Number(s):
- ORNL/CSD/TM-65
- Country of Publication:
- United States
- Language:
- English
Similar Records
Probabilistic analysis of nuclear fuel rod behavior using a quasi-Monte Carlo method
Unsupervised Group Discovery and LInk Prediction in Relational Datasets: a nonparametric Bayesian approach