INTEGRATIVE SPARSE K-MEANS WITH OVERLAPPING GROUP LASSO IN GENOMIC APPLICATIONS FOR DISEASE SUBTYPE DISCOVERY
成果类型:
Article
署名作者:
Huo, Zhiguang; Tseng, George
署名单位:
Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh; Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/17-AOAS1033
发表日期:
2017
页码:
1011-1039
关键词:
molecular subtypes
cancer
breast
prediction
selection
MODEL
CLASSIFICATION
metaanalysis
validation
FRAMEWORK
摘要:
Cancer subtypes discovery is the first step to deliver personalized medicine to cancer patients. With the accumulation of massive multi-level omics datasets and established biological knowledge databases, omics data integration with incorporation of rich existing biological knowledge is essential for deciphering a biological mechanism behind the complex diseases. In this manuscript, we propose an integrative sparse K-means (IS-Kmeans) approach to discover disease subtypes with the guidance of prior biological knowledge via sparse overlapping group lasso. An algorithm using an alternating direction method of multiplier (ADMM) will be applied for fast optimization. Simulation and three real applications in breast cancer and leukemia will be used to compare IS-Kmeans with existing methods and demonstrate its superior clustering accuracy, feature selection, functional annotation of detected molecular features and computing efficiency.
来源URL: