Ab initio characterization of protein molecular dynamics with AI2BMD

成果类型:
Article
署名作者:
Wang, Tong; He, Xinheng; Li, Mingyu; Li, Yatao; Bi, Ran; Wang, Yusong; Cheng, Chaoran; Shen, Xiangzhen; Meng, Jiawei; Zhang, He; Liu, Haiguang; Wang, Zun; Li, Shaoning; Shao, Bin; Liu, Tie-Yan
署名单位:
Microsoft; Microsoft China
刊物名称:
Nature
ISSN/ISSBN:
0028-6424
DOI:
10.1038/s41586-024-08127-z
发表日期:
2024-11-28
关键词:
摘要:
Biomolecular dynamics simulation is a fundamental technology for life sciences research, and its usefulness depends on its accuracy and efficiency1-3. Classical molecular dynamics simulation is fast but lacks chemical accuracy4,5. Quantum chemistry methods such as density functional theory can reach chemical accuracy but cannot scale to support large biomolecules6. Here we introduce an artificial intelligence-based ab initio biomolecular dynamics system (AI2BMD) that can efficiently simulate full-atom large biomolecules with ab initio accuracy. AI2BMD uses a protein fragmentation scheme and a machine learning force field7 to achieve generalizable ab initio accuracy for energy and force calculations for various proteins comprising more than 10,000 atoms. Compared to density functional theory, it reduces the computational time by several orders of magnitude. With several hundred nanoseconds of dynamics simulations, AI2BMD demonstrated its ability to efficiently explore the conformational space of peptides and proteins, deriving accurate 3J couplings that match nuclear magnetic resonance experiments, and showing protein folding and unfolding processes. Furthermore, AI2BMD enables precise free-energy calculations for protein folding, and the estimated thermodynamic properties are well aligned with experiments. AI2BMD could potentially complement wet-lab experiments, detect the dynamic processes of bioactivities and enable biomedical research that is impossible to conduct at present. A study introduces AI2BMD, an artificial intelligence-based dynamics simulation program that uses protein fragmentation with a machine learning force field to accurately and efficiently model the folding and unfolding of large proteins.