Toward equitable major histocompatibility complex binding predictions
成果类型:
Article
署名作者:
Glynn, Eric; Ghersi, Dario; Singh, Mona
署名单位:
Princeton University; Princeton University; University of Nebraska System; University of Nebraska Omaha; Princeton University
刊物名称:
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
ISSN/ISSBN:
0027-9919
DOI:
10.1073/pnas.2405106122
发表日期:
2025-02-25
关键词:
摘要:
Deep learning tools that predict peptide binding by major histocompatibility complex (MHC) proteins play an essential role in developing personalized cancer immunotherapies and vaccines. In order to ensure equitable health outcomes from their application, MHC binding prediction methods must work well across the vast landscape of MHC alleles observed across human populations. Here, we show that there are alarming disparities across individuals in different racial and ethnic groups in how much binding data are associated with their MHC alleles. We introduce a machine learning framework to assess the impact of this data imbalance for predicting binding for any given MHC allele, and apply it to develop a state- of- the- art MHC binding prediction model that additionally provides per- allele performance estimates. We demonstrate that our MHC binding model successfully mitigates much of the data disparities observed across racial groups. To address remaining inequities, we devise an algorithmic strategy for targeted data collection. Our work lays the foundation for further development of equitable MHC binding models for use in personalized immunotherapies.