Accelerating protein engineering with fitness landscape modelling and reinforcement learning

Haoran Sun; Liang He; Pan Deng; Guoqing Liu; Zhiyu Zhao; Yuliang Jiang; Chuan Cao; Fusong Ju; Lijun Wu; Haiguang Liu; Tao Qin; Tie-Yan Liu

doi:10.1038/s42256-025-01103-w

化学、生物与自动化实验室颠覆级有讲解视频

策展与解读：DAST AI · 收录方法与内容透明度

发表时间: 2025-09-08
DOI: 10.1038/s42256-025-01103-w

收录解读

蛋白工程的核心瓶颈一直是实验搜索空间过大：单点突变还能穷举，多位点组合一旦展开，湿实验成本和迭代速度都会迅速失控。这篇论文正面解决的是“如何把实验蛋白工程变成可持续、可加速的主动探索过程”，而不是只再做一个更准的序列打分器。

论文把 fitness landscape modelling 和 reinforcement learning 结合起来，形成面向实验迭代的闭环：模型不只预测当前序列的优劣，还主动决定下一轮该探索哪里，从而在有限实验预算下更快找到高性能变体。这个 framing 的关键在于，它把 protein engineering 从静态筛选问题改写成 sequential decision-making 问题。

这篇论文值得高于普通 breakthrough，因为它给出了蛋白工程中很强的可复用模式：代理模型、实验回传、策略更新和多轮探索统一在一个可迭代回路里。对 protein engineering、enzyme optimization、directed evolution 甚至更广的实验设计型 AI for science 都有明显外溢。

它还没有升到 paradigm，原因是适用性和通用性仍需在更多蛋白家族、更多性质目标和更复杂实验条件下进一步证明。但作为把 RL 明确推进到实验蛋白工程闭环里的代表工作，它已经足够 disruptive。

原始摘要与中文对照

中文对照翻译

蛋白质工程在设计具有定制功能的蛋白质方面具有巨大潜力，然而，潜在突变的广阔图景与有限的实验室能力限制了最佳序列的发现。为此，我们提出了μProtein框架，该框架通过结合μFormer（一个用于准确突变效应预测的深度学习模型）与μSearch（一个旨在利用μFormer作为预言机高效探索蛋白质适应度景观的强化学习算法）来加速蛋白质工程。μProtein利用单点突变数据，通过其对上位性相互作用的建模和多步搜索策略，预测具有复杂多氨基酸突变的最佳序列。除了在基准数据集上表现出色外，μProtein还为酶β-内酰胺酶识别出高功能增益的多点突变体，这些突变体在湿实验室中，仅通过单点突变数据训练，就超越了已知最高活性水平之一。这些结果证明了μProtein在广阔的蛋白质序列空间中发现有影响力的突变的能力，为蛋白质优化提供了一种稳健高效的方法。

原始摘要

Protein engineering holds substantial promise for designing proteins with customized functions, yet the vast landscape of potential mutations versus limited laboratory capacity constrains the discovery of optimal sequences. Here, to address this, we present the μProtein framework, which accelerates protein engineering by combining μFormer, a deep learning model for accurate mutational effect prediction, with μSearch, a reinforcement learning algorithm designed to efficiently navigate the protein fitness landscape using μFormer as an oracle. μProtein leverages single-mutation data to predict optimal sequences with complex, multi-amino-acid mutations through its modelling of epistatic interactions and a multi-step search strategy. In addition to strong performance on benchmark datasets, μProtein identified high-gain-of-function multi-point mutants for the enzyme β-lactamase, surpassing one of the highest-known activity levels, in wet laboratory, trained solely on single-mutation data. These results demonstrate μProtein’s capability to discover impactful mutations across the vast protein sequence space, offering a robust, efficient approach for protein optimization.

解读视频

视频观看页 B 站 YouTube

链接

论文链接

收录解读

原始摘要与中文对照

中文对照翻译

原始摘要

解读视频

相关论文

链接