VARIATIONAL BAYESIAN ANALYSIS OF NONHOMOGENEOUS HIDDEN MARKOV MODELS WITH LONG AND ULTRALONG SEQUENCES
成果类型:
Article
署名作者:
Chen, Xinyuan; Li, Yiwei; Feng, Xiangnan; Chang, Joseph T.
署名单位:
Mississippi State University; Lingnan University; Fudan University; Yale University
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/22-AOAS1685
发表日期:
2023
页码:
1615-1640
关键词:
inference
approximation
摘要:
Nonhomogeneous hidden Markov models (NHMMs) are useful in mod-eling sequential and autocorrelated data. Bayesian approaches, particularly Markov chain Monte Carlo (MCMC) methods, are principal statistical in-ference tools for NHMMs. However, MCMC sampling is computationally demanding, especially for long observation sequences. We develop a vari-ational Bayes (VB) method for NHMMs, which utilizes a structured varia-tional family of Gaussian distributions with factorized covariance matrices to approximate target posteriors, combining a forward-backward algorithm and stochastic gradient ascent in estimation. To improve efficiency and handle ul-tralong sequences, we further propose a subsequence VB (SVB) method that works on subsamples. The SVB method exploits the memory decay property of NHMMs and uses buffers to control for bias caused by breaking sequen-tial dependence from subsampling. We highlight that the local nonhomogene-ity of NHMMs substantially affects the required buffer lengths and propose the use of local Lyapunov exponents that characterize local memory decay rates of NHMMs and adaptively determine buffer lengths. Our methods are validated in simulation studies and in modeling ultralong sequences of cus-tomers' telecom records to uncover the relationship between their mobile In-ternet usage behaviors and conventional telecommunication behaviors.
来源URL: