WM-DAgger: Enabling Efficient Data Aggregation for Imitation Learning with World Models
arXiv cs.RO / 4/14/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses imitation learning’s compounding-error problem, where small inaccuracies can push robots into out-of-distribution states that cause escalating failures.
- It introduces WM-DAgger, a data aggregation framework that uses world models to synthesize OOD recovery data without continuous human labeling, improving scalability beyond standard DAgger.
- To reduce the risk of “misleading supervision” from world-model hallucinations, it adds a Corrective Action Synthesis Module for task-oriented recovery actions.
- It further adds a Consistency-Guided Filtering Module that rejects physically implausible trajectories by anchoring synthesized terminal frames to real expert-demonstration frames.
- Experiments on multiple real-world manipulation tasks show substantial gains, including 93.3% success in soft bag pushing using only five demonstrations, and the authors provide public code.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

Black Hat Asia
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Don't forget, there is more than forgetting: new metrics for Continual Learning
Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale
Dev.to
Bit of a strange question?
Reddit r/artificial