HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation
arXiv cs.RO / 4/10/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- HEX introduces a state-centric framework for stable whole-body manipulation on full-sized bipedal humanoid robots, addressing the instability that arises when VLA models treat body parts independently.
- The approach uses a humanoid-aligned universal state representation to enable scalable learning across heterogeneous robot embodiments and incorporates a Mixture-of-Experts proprioceptive predictor for coordinated motion modeling.
- HEX leverages lightweight history tokens to retain temporal visual context efficiently, reducing the need to repeatedly encode past images during inference.
- A residual-gated fusion mechanism combined with a flow-matching action head integrates visual-language cues with proprioceptive dynamics to generate actions.
- Real-world humanoid manipulation experiments report state-of-the-art task success and improved generalization, especially for fast-reaction and long-horizon tasks.
Related Articles

Black Hat Asia
AI Business
v0.20.5
Ollama Releases

Inside Anthropic's Project Glasswing: The AI Model That Found Zero-Days in Every Major OS
Dev.to
Gemma 4 26B fabricated an entire code audit. I have the forensic evidence from the database.
Reddit r/LocalLLaMA
SoloEngine: Low-Code Agentic AI Development Platform with Native Support for Multi-Agent Collaboration, MCP, and Skill System
Dev.to