CubeDAgger: Interactive Imitation Learning for Dynamic Systems with Efficient yet Low-risk Interaction
arXiv cs.RO / 4/23/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses a limitation of interactive imitation learning in dynamic systems, where mismatched supervision timing can cause abrupt actions that destabilize a robot.
- It introduces CubeDAgger, built on a baseline called EnsembleDAgger, to improve robustness while reducing dynamic stability violations in dynamic tasks.
- CubeDAgger incorporates three key enhancements: supervision-timing threshold regularization, an optimal consensus mechanism over multiple expert/action candidates, and autoregressive colored-noise injection for temporally consistent exploration.
- Simulation results indicate the learned policies are robust and maintain dynamic stability during interaction.
- Real-robot scooping experiments show the method can learn from scratch using only about 30 minutes of interaction with a human expert.
Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

10 AI Tools Every Developer Should Try in 2026
Dev.to

Why use an AI gateway at all?
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to