HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness

智能体与自主科学突破级暂无讲解视频

发表时间: 2026-06-11
arXiv: 2606.12882

收录解读

这篇论文把 agent harness 从人工工程层推进到可学习控制器。

它用 observation projection 和 action projection 双向建模 agent-environment interface，压缩轨迹并约束动作执行。

它值得收录，因为 harness 正成为 agent 能力边界的重要系统层。

局限在于当前证据主要来自预印本实验与作者自建评测，后续需要独立复现和更大范围部署验证。

原始摘要与中文对照

中文对照翻译

HarnessBridge 探讨了智能体线束能否由一个可学习的即插即用模块生成。它将智能体-环境接口参数化为双向投影：观测投影将原始轨迹提炼成决策相关状态，而动作投影则将提议的动作转换为可执行的转换或基于轨迹的拒绝。它通过统一指令微调进行训练，并在 Terminal-Bench 2.0 和 SWE-bench Verified 上进行评估，匹配或超越了专用线束，同时减少了 token 使用量和轨迹长度。

原始摘要

HarnessBridge asks whether the agent harness can be generated by a learnable plug-in module. It parameterizes the agent-environment interface as bidirectional projections: observation projection distills raw trajectories into decision-relevant states, while action projection converts proposed actions into executable transitions or trajectory-grounded rejections. It is trained with unified instruction tuning and evaluated on Terminal-Bench 2.0 and SWE-bench Verified, matching or surpassing specialized harnesses while reducing token usage and trajectory length.

链接

论文链接论文链接代码

收录解读

原始摘要与中文对照

中文对照翻译

原始摘要

相关论文

链接