Improving Long-Horizon LLM Planning via Symbolic Feedback

The Challenge of Long-Horizon Planning

Large Language Models often struggle with long-horizon decision-making, frequently generating infeasible or logically incorrect plans. These failures stem from the model's inability to maintain strict adherence to task constraints and semantics over extended sequences. The proposed framework addresses this by moving away from pure natural language generation toward a structured, feedback-driven refinement loop.

The core of the framework is a three-part mechanism designed to enforce reliability:

Symbolic Mapping: The system maps logical symbols into natural language descriptions. This bridges the gap between formal constraints and the model's linguistic capabilities, ensuring the LLM understands the underlying task requirements.
Symbolic Verifier: Instead of relying on the LLM to self-correct based on vague intuition, a dedicated verifier checks the generated plan against formal constraints. When errors are detected, the verifier translates these failures into specific, actionable corrective instructions that the LLM can interpret and execute.
Plan Recognizer: To prevent the model from drifting, a plan recognizer evaluates goal reachability. This provides a signal that guides the refinement process toward valid, achievable end-states.

By combining these components, the framework enables an iterative self-refinement loop where the LLM continuously updates its plan based on concrete, symbolic feedback rather than probabilistic guessing. This approach consistently improves both the feasibility and correctness of plans in complex, multi-step tasks.

Edge

Improving Long-Horizon LLM Planning via Symbolic Feedback

The Challenge of Long-Horizon Planning

Symbolic Feedback and Iterative Refinement

The Challenge of Long-Horizon Planning

Symbolic Feedback and Iterative Refinement

More from Agents & Orchestration

Agent-Native Immune System (ANIS): Architecture for Runtime Defense

Personality Prompting in Multi-Agent Teams: Task-Dependent Impact

Internalizing Future-Aware Planning in LLM Agents

Claude Tag: Moving AI from Chat to Team-Based Delegation