Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents
arXiv cs.AI / 5/1/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that many embodied agents rely on passive instruction-following or reactive behaviors, which prevents stable, long-term, value-guided self-direction and proper resolution of motivational conflicts.
- It introduces ValuePlanner, a hierarchical architecture that separates high-level value scheduling from low-level action execution, using an LLM to reason over abstract value trade-offs and a classical PDDL planner to turn subgoals into executable plans.
- The system is improved with a closed-loop feedback mechanism to refine planning and execution over time.
- To evaluate autonomy beyond simple task success, the authors propose a value-centric evaluation suite that measures cumulative value gain, preference alignment, and behavioral diversity.
- Experiments in the TongSim household environment show that ValuePlanner can arbitrate competing values and produce coherent, long-horizon, self-directed behavior compared with instruction-following and needs-driven baselines.
Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works
Dev.to

Why Enterprise AI Pilots Fail
Dev.to

The PDF Feature Nobody Asked For (That I Use Every Day)
Dev.to

How to Fix OpenClaw Tool Calling Issues
Dev.to

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model
THE DECODER