Rapid in silico directed evolution by a protein language model with EVOLVEpro

成果类型:
Article
署名作者:
Jiang, Kaiyi; Yan, Zhaoqing; Di Bernardo, Matteo; Sgrizzi, Samantha R.; Villiger, Lukas; Kayabolen, Alisan; Kim, B. J.; Carscadden, Josephine K.; Hiraizumi, Masahiro; Nishimasu, Hiroshi; Gootenberg, Jonathan S.; Abudayyeh, Omar O.
署名单位:
Harvard University; Harvard University Medical Affiliates; Brigham & Women's Hospital; Harvard Medical School; Harvard University; Harvard Medical School; Harvard University; Harvard Medical School; Harvard University Medical Affiliates; Beth Israel Deaconess Medical Center; Massachusetts Institute of Technology (MIT); Massachusetts Institute of Technology (MIT); Whitehead Institute; Kantonsspital St. Gallen; Massachusetts Institute of Technology (MIT); University of Tokyo; University of Tokyo
刊物名称:
SCIENCE
ISSN/ISSBN:
0036-13394
DOI:
10.1126/science.adr6006
发表日期:
2025-01-24
页码:
377-+
关键词:
prediction algorithm compact DESIGN set
摘要:
Directed protein evolution is central to biomedical applications but faces challenges such as experimental complexity, inefficient multiproperty optimization, and local maxima traps. Although in silico methods that use protein language models (PLMs) can provide modeled fitness landscape guidance, they struggle to generalize across diverse protein families and map to protein activity. We present EVOLVEpro, a few-shot active learning framework that combines PLMs and regression models to rapidly improve protein activity. EVOLVEpro surpasses current methods, yielding up to 100-fold improvements in desired properties. We demonstrate its effectiveness across six proteins in RNA production, genome editing, and antibody binding applications. These results highlight the advantages of few-shot active learning with minimal experimental data over zero-shot predictions. EVOLVEpro opens new possibilities for artificial intelligence-guided protein engineering in biology and medicine.