AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References

arXiv cs.CV / 3/27/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • AnyID は、従来の「単一のアイデンティティ参照」に最適化された動画生成手法の限界を解消し、顔・肖像・動画など多様な参照から一貫したアイデンティティを保った映像生成を目指すフレームワークです。
  • 異種入力を統一表現にまとめる「omni-referenced architecture」と、1つの参照をアンカーにして属性レベルで制御できる「primary-referenced generation paradigm(差分プロンプト)」の2つの中核提案が示されています。
  • 大規模で厳密にキュレーションされたデータで学習した後、最終的に強化学習による微調整を行い、人間評価に基づく嗜好データ(対比較)で「アイデンティティ忠実度」と「プロンプト制御性」を同時に高めます。
  • 評価では、複数タスク設定において超高いアイデンティティ維持と、従来より優れた属性レベルの制御性を達成したと報告されています。

Abstract

Identity-preserving video generation offers powerful tools for creative expression, allowing users to customize videos featuring their beloved characters. However, prevailing methods are typically designed and optimized for a single identity reference. This underlying assumption restricts creative flexibility by inadequately accommodating diverse real-world input formats. Relying on a single source also constitutes an ill-posed scenario, causing an inherently ambiguous setting that makes it difficult for the model to faithfully reproduce an identity across novel contexts. To address these issues, we present AnyID, an ultra-fidelity identity-preservation video generation framework that features two core contributions. First, we introduce a scalable omni-referenced architecture that effectively unifies heterogeneous identity inputs (e.g., faces, portraits, and videos) into a cohesive representation. Second, we propose a primary-referenced generation paradigm, which designates one reference as a canonical anchor and uses a novel differential prompt to enable precise, attribute-level controllability. We conduct training on a large-scale, meticulously curated dataset to ensure robustness and high fidelity, and then perform a final fine-tuning stage using reinforcement learning. This process leverages a preference dataset constructed from human evaluations, where annotators performed pairwise comparisons of videos based on two key criteria: identity fidelity and prompt controllability. Extensive evaluations validate that AnyID achieves ultra-high identity fidelity as well as superior attribute-level controllability across different task settings.