SMDD-Bench: Can LLMs Solve Real-World Small Molecule Drug Design Tasks?

科学发现旗舰工作突破级暂无讲解视频

发表时间: 2026-05-20
arXiv: 2605.21740

收录解读

这篇论文提供了面向小分子药物设计 agent 的高难度标准化评测。

它把药物设计从单轮性质预测提升为多轮、长程、工具约束的真实设计任务，覆盖 scaffold hopping、lead optimization 等关键环节。

它值得收录，因为它为 autonomous computational drug design 提供了可复用 benchmark 和 leaderboard。

局限在于当前证据主要来自预印本实验与作者自建评测，后续需要独立复现和更大范围部署验证。

原始摘要与中文对照

中文对照翻译

SMDD-Bench：LLM 能解决真实世界的小分子药物设计任务吗？LLM 智能体在科学发现应用方面具有巨大的潜力。然而，LLM 智能体在真实世界、跨多样化学和靶点的小分子药物设计 (SMDD) 任务上的表现尚不明确。当前的评估方法要么是临时的、对真实世界发现来说过于简单、规模有限，要么仅限于单轮问答。为了标准化 LLM 智能体在小分子设计方面的评估，我们引入了 SMDDBench，这是一个具有挑战性的、多轮、长周期智能体基准，由

原始摘要

LLM agents have incredible potential for scientific discovery applications. However, the performance of LLM agents on real-world, small molecule drug design (SMDD) tasks across diverse chemistries and targets is unclear. Current evaluation methods are either ad hoc, too simple for real-world discovery, limited in scale, or restricted to single-turn question answering. In effort to standardize the evaluation of LLM agents on small molecule design, we introduce SMDDBench, a challenging, multi-turn, long-horizon agentic benchmark consisting of

链接

论文链接论文链接

收录解读

原始摘要与中文对照

中文对照翻译

原始摘要

相关论文

链接