HiPolicy: Hierarchical Multi-Frequency Action Chunking for Policy Learning

arXiv cs.RO / 4/8/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces HiPolicy, a hierarchical multi-frequency action chunking framework for robotic imitation learning that aims to balance long-horizon dependency modeling with fine-grained closed-loop control.
  • HiPolicy predicts action sequences at multiple frequencies by extracting and fusing history-conditioned features aligned to each temporal scale, enabling both coarse high-level plans and precise reactive motions.
  • It adds an entropy-guided execution mechanism that adaptively trades off planning horizon versus control precision based on action uncertainty.
  • Experiments across simulated benchmarks and real-world manipulation tasks indicate that HiPolicy can be integrated into existing 2D and 3D generative policies, improving performance while boosting execution efficiency.

Abstract

Robotic imitation learning faces a fundamental trade-off between modeling long-horizon dependencies and enabling fine-grained closed-loop control. Existing fixed-frequency action chunking approaches struggle to achieve both. Building on this insight, we propose HiPolicy, a hierarchical multi-frequency action chunking framework that jointly predicts action sequences at different frequencies to capture both coarse high-level plans and precise reactive motions. We extract and fuse hierarchical features from history observations aligned to each frequency for multi-frequency chunk generation, and introduce an entropy-guided execution mechanism that adaptively balances long-horizon planning with fine-grained control based on action uncertainty. Experiments on diverse simulated benchmarks and real-world manipulation tasks show that HiPolicy can be seamlessly integrated into existing 2D and 3D generative policies, delivering consistent improvements in performance while significantly enhancing execution efficiency.