AI Navigate

Towards Interactive Intelligence for Digital Humans

arXiv cs.CL / 3/16/2026

💬 OpinionModels & Research

Key Points

  • The paper proposes Interactive Intelligence as a new paradigm for digital humans that enables personality-aligned expression, adaptive interaction, and self-evolution.
  • It introduces Mio, an end-to-end five-module framework (Thinker, Talker, Face Animator, Body Animator, Renderer) that unifies cognitive reasoning with real-time multimodal embodiment for fluid interaction.
  • A new benchmark is established to rigorously evaluate interactive intelligence, enabling standardized comparisons across methods.
  • Experiments show Mio achieving superior performance versus state-of-the-art methods across evaluated dimensions, moving digital humans beyond superficial imitation toward intelligent interaction.

Abstract

We introduce Interactive Intelligence, a novel paradigm of digital human that is capable of personality-aligned expression, adaptive interaction, and self-evolution. To realize this, we present Mio (Multimodal Interactive Omni-Avatar), an end-to-end framework composed of five specialized modules: Thinker, Talker, Face Animator, Body Animator, and Renderer. This unified architecture integrates cognitive reasoning with real-time multimodal embodiment to enable fluid, consistent interaction. Furthermore, we establish a new benchmark to rigorously evaluate the capabilities of interactive intelligence. Extensive experiments demonstrate that our framework achieves superior performance compared to state-of-the-art methods across all evaluated dimensions. Together, these contributions move digital humans beyond superficial imitation toward intelligent interaction.