A Doubly Enhanced EM Algorithm for Model-Based Tensor Clustering
成果类型:
Article
署名作者:
Mai, Qing; Zhang, Xin; Pan, Yuqing; Deng, Kai
署名单位:
State University System of Florida; Florida State University
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2021.1904959
发表日期:
2022
页码:
2120-2134
关键词:
feature-selection
DISCRIMINANT-ANALYSIS
maximum-likelihood
variable selection
number
regression
decompositions
CLASSIFICATION
CONVERGENCE
mixtures
摘要:
Modern scientific studies often collect datasets in the form of tensors. These datasets call for innovative statistical analysis methods. In particular, there is a pressing need for tensor clustering methods to understand the heterogeneity in the data. We propose a tensor normal mixture model approach to enable probabilistic interpretation and computational tractability. Our statistical model leverages the tensor covariance structure to reduce the number of parameters for parsimonious modeling, and at the same time explicitly exploits the correlations for better variable selection and clustering. We propose a doubly enhanced expectation-maximization (DEEM) algorithm to perform clustering under this model. Both the expectation-step and the maximization-step are carefully tailored for tensor data in order to maximize statistical accuracy and minimize computational costs in high dimensions. Theoretical studies confirm that DEEM achieves consistent clustering even when the dimension of each mode of the tensors grows at an exponential rate of the sample size. Numerical studies demonstrate favorable performance of DEEM in comparison to existing methods.