Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Model-Based Clustering of Regression Time Series Data via APECM -- An AECM Algorithm Sung to an Even Faster Beat

Journal Article · · Statistical Analysis and Data Mining
DOI:https://doi.org/10.1002/sam.10143· OSTI ID:1033186

We propose a model-based approach for clustering time series regression data in an unsupervised machine learning framework to identify groups under the assumption that each mixture component follows a Gaussian autoregressive regression model of order p. Given the number of groups, the traditional maximum likelihood approach of estimating the parameters using the expectation-maximization (EM) algorithm can be employed, although it is computationally demanding. The somewhat fast tune to the EM folk song provided by the Alternating Expectation Conditional Maximization (AECM) algorithm can alleviate the problem to some extent. In this article, we develop an alternative partial expectation conditional maximization algorithm (APECM) that uses an additional data augmentation storage step to efficiently implement AECM for finite mixture models. Results on our simulation experiments show improved performance in both fewer numbers of iterations and computation time. The methodology is applied to the problem of clustering mutual funds data on the basis of their average annual per cent returns and in the presence of economic indicators.

Research Organization:
Oak Ridge National Laboratory (ORNL)
Sponsoring Organization:
SC USDOE - Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1033186
Journal Information:
Statistical Analysis and Data Mining, Journal Name: Statistical Analysis and Data Mining Journal Issue: 6 Vol. 4; ISSN 1932-1864
Country of Publication:
United States
Language:
English

Similar Records

Probabilistic partition of unity networks for high–dimensional regression problems
Journal Article · Thu Jan 26 23:00:00 EST 2023 · International Journal for Numerical Methods in Engineering · OSTI ID:2305532

A stepwise time series regression procedure for water demand model identification
Journal Article · Sat Sep 01 00:00:00 EDT 1990 · Water Resources Research; (United States) · OSTI ID:5339258