ORACLE: Orchestrate NPC Daily Activities using Contrastive Learning with Transformer-CVAE

arXiv cs.CL / 3/26/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • ORACLE is presented as a new generative model to synthesize realistic indoor daily activity plans for NPCs, aiming to improve immersion beyond monotonous, repetitive routines from prior approaches.
  • The method learns from the CASAS smart home dataset (24-hour activity sequences) while addressing dataset issues including imbalanced sequential data and limited training samples, and the lack of pre-trained human daily activity representations.
  • ORACLE combines three key components: Transformer-based sequential modeling, controllable generation via Conditional Variational Autoencoders (CVAE), and discriminative refinement through contrastive learning.
  • Experiments reportedly show ORACLE outperforms existing methods in both the quality of generated NPC activity plans and the effectiveness of its overall design choices.

Abstract

The integration of Non-player characters (NPCs) within digital environments has been increasingly recognized for its potential to augment user immersion and cognitive engagement. The sophisticated orchestration of their daily activities, reflecting the nuances of human daily routines, contributes significantly to the realism of digital environments. Nevertheless, conventional approaches often produce monotonous repetition, falling short of capturing the intricacies of real human activity plans. In response to this, we introduce ORACLE, a novel generative model for the synthesis of realistic indoor daily activity plans, ensuring NPCs' authentic presence in digital habitats. Exploiting the CASAS smart home dataset's 24-hour indoor activity sequences, ORACLE addresses challenges in the dataset, including its imbalanced sequential data, the scarcity of training samples, and the absence of pre-trained models encapsulating human daily activity patterns. ORACLE's training leverages the sequential data processing prowess of Transformers, the generative controllability of Conditional Variational Autoencoders (CVAE), and the discriminative refinement of contrastive learning. Our experimental results validate the superiority of generating NPC activity plans and the efficacy of our design strategies over existing methods.