Large-scale chemoproteomics expedites ligand discovery and predicts ligand behavior in cells

成果类型:
Article
署名作者:
Offensperger, Fabian; Tin, Gary; Duran-Frigola, Miquel; Hahn, Elisa; Dobner, Sarah; am Ende, Christopher W.; Strohbach, Joseph W.; Rukavina, Andrea; Brennsteiner, Vincenth; Ogilvie, Kevin; Marella, Nara; Kladnik, Katharina; Ciuffa, Rodolfo; Majmudar, Jaimeen D.; Field, S. Denise; Bensimon, Ariel; Ferrari, Luca; Ferrada, Evandro; Ng, Amanda; Zhang, Zhechun; Degliesposti, Gianluca; Boeszoermenyi, Andras; Martens, Sascha; Stanton, Robert; Mueller, Andre C.; Hannich, J. Thomas; Hepworth, David; Superti-Furga, Giulio; Kubicek, Stefan; Schenone, Monica; Winter, Georg E.
署名单位:
Austrian Academy of Sciences; CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences; Pfizer; Pfizer USA; Pfizer; Pfizer USA; Medical University of Vienna; University of Vienna; Vienna Biocenter (VBC); Max F. Perutz Laboratories (MFPL); Medical University of Vienna; University of Vienna; Vienna Biocenter (VBC); Max F. Perutz Laboratories (MFPL); Pfizer; Pfizer USA; Medical University of Vienna
刊物名称:
SCIENCE
ISSN/ISSBN:
0036-12238
DOI:
10.1126/science.adk5864
发表日期:
2024-04-26
关键词:
targeted protein-degradation therapeutics map
摘要:
INTRODUCTION Chemical modulation of protein function is an important experimental approach to illuminate biological mechanisms and represents the most frequently used strategy to treat human disease. Nevertheless, around 80% of the human proteome lacks annotated small-molecule ligands, thus leaving many proteins, including validated disease targets, outside the reach of mechanistic elucidation and therapeutic innovation. RATIONALE To close this gap, unbiased approaches to advance ligand discovery are urgently needed. We set out to determine the proteome-wide binding preferences of more than 400 small-molecule fragments through a chemoproteomics strategy that is based on treatment of intact cells. With these data at hand, we aimed to (i) identify hundreds of fragment-protein interactions and advance selected fragments toward cell-active ligands, (ii) leverage machine learning (ML) binary classifiers to develop models to predict small-molecule behavior in native biological systems, and (iii) build an interactive open-source interface to empower the broad exploration of the data and of all predictive models. RESULTS Through this quantitative chemoproteomics strategy, we experimentally determined the interactome of 407 small-molecule fragments. This led to the identification of 47,658 discrete fragment-protein interactions involving more than 2600 proteins, of which 86% previously lacked any annotated ligand. To provide evidence for the translational potential of these starting points, we advanced various hits toward elaborated fragments. With focused synthetic efforts, we developed ligands that (i) engage the E3 ligase adaptor protein DDB1, (ii) functionally block the human equilibrative nucleoside transporter SLC29A1 (hENT1), or (iii) selectively inhibit a subset of cyclin dependent kinases (CDKs), including the orphan CDK16. In addition to advancing individual fragment-protein hits, we leveraged the depth of the global dataset to develop an ML framework to build models that can predict how fragments interact with native proteins on a proteome-wide scale. This framework included inference of quantitative fragment interactomes, which enabled us to predict to how many proteins a given fragment will bind and whether the bound proteins themselves are chemically broadly accessible or otherwise typically refractory to small-molecule ligands. Moreover, ML models allowed us to capture and predict qualitative interactome signatures. This made it possible for us to investigate and predict whether fragments tend to interact with subsets of proteins of coherent function, such as transporters or RNA-binding proteins. Likewise, ML models allowed us to analyze and predict whether fragments tend to interact with groups of proteins that reside in defined subcellular localizations or compartments, such as lysosomes or mitochondria, which can be indicative of intracellular fragment partitioning and accumulation. Last, we have also provided a platform to develop bespoke ML models that are based on a user-defined input of target proteins, and hence enable the prediction of fragment binding to a custom set of proteins. CONCLUSION Our large-scale chemical proteomics survey led to the identification of hundreds of fragment-protein interactions that are poised for future exploration and chemical optimization. Moreover, we found that the generated data is amenable to ML-based models that enable us to predict how chemical matter interacts with native proteomes in intact cells by using their chemical structure as input. To maximize the practical use for the scientific community, all interactomes, enrichment tools, and ML models have been made publicly available for exploration through a web-based application (https://ligand-discovery.ai). Collectively, these data and tools should form a resource to interpret fragment-binding data and expedite ligand discovery efforts. Schematic representation of the ligand discovery approach. Chemoproteomics was used to assess 407 small-molecule fragments. Hundreds of fragment-protein interactions were identified as starting points for probe development. System-level analyses coupled to machine learning enabled prediction of fragment binding and behavior in living cells. An interactive web resource has been provided for data exploration, which also allows the generation and application of bespoke predictive models.