ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing

arXiv:2511.14584v3 Announce Type: replace Abstract: We present ReflexGrad, a dual-process architecture for within-episode failure recovery in LLM agents without demonstrations. When agents commit to a wrong approach early and exhaust the step budget, the post-failure trajectory contains the information to escape -- but no published architecture acts on it within a single episode. ReflexGrad routes between a fast process (TextGrad-style continuous refinement every $k{=}3$ steps) and a slow process (Reflexion-style causal diagnosis when $m{=}5$ consecutive low-progress scores fire a routing gate
This research addresses a critical limitation of current LLM agents regarding their ability to recover from failures within a single operational episode, leveraging post-failure trajectory data. The continuous improvement in LLM agent architectures reflects a drive towards more robust and autonomous systems.
This development significantly enhances the reliability and efficiency of LLM agents, enabling them to self-correct and complete complex tasks without human intervention or restarting. It pushes the frontier of agentic AI towards greater autonomy and practical applicability.
LLM agents can now perform more reliably in dynamic and unpredictable environments by autonomously identifying and rectifying errors within a task execution, reducing the need for costly restarts or human oversight.
- · AI agent developers
- · Businesses adopting AI agents
- · Deep learning researchers
- · Companies relying on static, non-adaptive automation
- · Competitors with less robust agent architectures
Increased successful task completion rates for LLM agents across various applications.
Accelerated adoption of autonomous AI agents in areas requiring high reliability and self-correction, such as advanced customer service or complex process automation.
Reduced operational costs and increased efficiency across sectors due to highly autonomous and resilient AI systems, potentially leading to new business models built directly on agentic capabilities.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.LG