The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning
arXiv cs.RO / 3/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces CRISP (Critique-and-Replan for Interactive Social Presence), an autonomous framework that enables a robot to critique and replan its own actions by using a Vision-Language Model as a social critic.
- CRISP workflow includes extracting movable joints and constraints from the robot's description file (e.g., MJCF), generating step-by-step behavior plans from context, producing low-level joint control code via visual information, a VLM-based evaluation of social appropriateness, and iterative refinement through reward-based search.
- The approach is not tied to a specific robot API and can generate subtly different, human-like motions across platforms using only the robot's structure file.
- In a user study spanning five robot types and 20 scenarios, CRISP achieved significantly higher preference and situational appropriateness ratings than previous methods.
- Detailed result videos and supplementary information are available at the project page linked in the paper.
Related Articles

Interactive Web Visualization of GPT-2
Reddit r/artificial
Stop Treating AI Interview Fraud Like a Proctoring Problem
Dev.to
[R] Causal self-attention as a probabilistic model over embeddings
Reddit r/MachineLearning
The 5 software development trends that actually matter in 2026 (and what they mean for your startup)
Dev.to
InVideo AI Review: Fast Finished
Dev.to