PAN-DISEASE CLUSTERING ANALYSIS OF THE TREND OF PERIOD PREVALENCE
成果类型:
Article
署名作者:
Jadhav, Sneha; Ma, Chenjin; Jiang, Yefei; Shia, Ben-Chang; Ma, Shuangge
署名单位:
Wake Forest University; Beijing University of Technology; Fu Jen Catholic University; Yale University
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/21-AOAS1470
发表日期:
2021
页码:
1945-1958
关键词:
HEART-DISEASE
cancer
mortality
survival
RISK
摘要:
Prevalence is of essential importance in biomedical and public health research. In the classic paradigm it has been studied for each disease individually. Accumulating evidence has shown that diseases can be correlated. Joint analysis of prevalence can potentially provide important insights beyond individual-disease analysis but has not been well pursued. In this study we take advantage of the unique Taiwan National Health Insurance Research Database (NHIRD) and conduct the first pan-disease analysis of period prevalence trend. The goal is to identify clusters within which diseases have similar period prevalence trends. A novel penalization pursuit approach is applied which has an intuitive formulation and preferable numerical performance. In data analysis the period prevalence values are computed using the records on close to one million subjects and 14 years of observation. With 405 diseases, 35 clusters with sizes larger than one and 27 clusters with sizes one are identified. The clustering results have sound interpretations and differ significantly from those of the alternatives.
来源URL: