Machine learning-based penetrance of genetic variants

成果类型:
Article
署名作者:
Forrest, Iain S.; Vy, Ha My T.; Rocheleau, Ghislain; Jordan, Daniel M.; Petrazzini, Ben O.; Nadkarni, Girish N.; Cho, Judy H.; Ganapathi, Mythily; Huang, Kuan-Lin; Chung, Wendy K.; Do, Ron
署名单位:
Icahn School of Medicine at Mount Sinai; Icahn School of Medicine at Mount Sinai; Icahn School of Medicine at Mount Sinai; Icahn School of Medicine at Mount Sinai; Icahn School of Medicine at Mount Sinai; Columbia University; NewYork-Presbyterian Hospital; Harvard University; Harvard University Medical Affiliates; Boston Children's Hospital; Harvard Medical School
刊物名称:
SCIENCE
ISSN/ISSBN:
0036-8262
DOI:
10.1126/science.adm7066
发表日期:
2025-08-28
关键词:
hypertrophic cardiomyopathy variable expressivity reduced penetrance medical genetics american-college diagnosis genomics identification GUIDELINES RISK
摘要:
Accurate variant penetrance estimation is crucial for precision medicine. We constructed machine learning (ML) models for 10 diseases using 1,347,298 participants with electronic health records, then applied them to an independent cohort with linked exome data. Resulting probabilities were used to evaluate ML penetrance of 1648 rare variants in 31 autosomal dominant disease-predisposition genes. ML penetrance was variable across variant classes, but highest for pathogenic and loss-of-function variants, and was associated with clinical outcomes and functional data. Compared with conventional case-versus-control approaches, ML penetrance provided refined quantitative estimates and aided the interpretation of variants of uncertain significance and loss-of-function variants by delineating clinical trajectories over time. By leveraging ML and deep phenotyping, we present a scalable approach to accurately quantify disease risk of variants.