PREDICTION OF THE NASH THROUGH PENALIZED MIXTURE OF LOGISTIC REGRESSION MODELS
成果类型:
Article
署名作者:
Morvan, Marie; Devijver, Emilie; Giacofci, Madison; Monbet, Valerie
署名单位:
Universite de Rennes; Centre National de la Recherche Scientifique (CNRS); CNRS - National Institute for Mathematical Sciences (INSMI); Communaute Universite Grenoble Alpes; Universite Grenoble Alpes (UGA); Inria; Centre National de la Recherche Scientifique (CNRS); Institut National Polytechnique de Grenoble
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/20-AOAS1409
发表日期:
2021
页码:
952-970
关键词:
finite mixture
maximum-likelihood
variable selection
em algorithm
摘要:
In this paper an appropriate and interpretable diagnosis statistical model is proposed to predict Nonalcoholic Steatohepatitis (NASH) from near infrared spectrometry data. In this disease, unknown patients' profiles are expected to lead to a different diagnosis. The model has then to take into account the heterogeneity of the data and the dimension of the spectrometric data. To this end, we propose to fit a mixture model on the joint distribution of the diagnostic binary variable and the covariates selected in the spectra. The penalized maximum likelihood estimator is considered. In practice, a twofold penalty on both regression coefficients and covariance parameters is imposed. Automatic selection criteria, such as the AIC and BIC, are used to select the amount of shrinkage and the number of clusters. The performance of the overall procedure is evaluated by a simulation study, and its application on the NASH data set is analyzed. The model leads to better prediction performance than competitive methods and provides highly interpretable results.
来源URL: