Self-Refine: Iterative Refinement with Self-Feedback

智能体与自主科学突破级暂无讲解视频

发表时间: 2023-03-30
arXiv: 2303.17651

收录解读

Self-Refine studies a broad pattern that later became ubiquitous in LLM systems: generate an answer, critique it in natural language, and then rewrite it using that critique. The paper is not limited to one task or one agent benchmark; it instead tests the generality of iterative self-feedback as a reusable inference-time improvement mechanism.

Its main technical contribution is to show that a single LLM can alternate between generator, critic, and refiner roles without parameter updates or external training signals. That makes refinement a modular control loop rather than a task-specific recipe. The idea is simple, but it generalized unusually well and became part of the default toolkit for many later agent and reasoning systems.

For this repository, Self-Refine matters because it is one of the cleanest reusable formulations of language-space iterative improvement. It sits directly on the path from static prompting to self-correcting agents, and it remains a useful reference when evaluating newer systems that claim self-improvement via reflection, critique, or revision.

It is not ranked higher because the paper is intentionally broad and lightweight rather than a field-redefining systems result. It does not solve memory, tool use, or long-horizon adaptation by itself. But as a durable primitive for self-improving inference, it clearly deserves formal collection.

链接

论文链接