Flying by Inference: Active Inference World Models for Adaptive UAV Swarms

arXiv cs.RO / 5/1/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper proposes an expert-guided, active-inference-inspired probabilistic framework for adaptive trajectory planning in UAV swarms.
  • It reframes multi-UAV trajectory design as hierarchical probabilistic inference, using GA--RF to generate expert demonstrations offline and then learning symbolic world-model dictionaries (Mission/Route/Motion).
  • During flight, the swarm evaluates candidate actions by inferring posterior beliefs over symbolic states and minimizing KL-divergence-based abnormality indicators relative to expert reference distributions.
  • It integrates Bayesian state estimators (EKF and PF) at the motion level to improve trajectory correction under uncertainty, enabling mission allocation, route insertion, motion adaptation, and collision-aware replanning without rerunning the offline optimizer.
  • Simulation and real-flight trajectory validations show smoother, more stable behavior than a modified Q-learning approach, and improved robustness to noisy, non-smooth observations for symbolic prediction correction.

Abstract

This paper presents an expert-guided active-inference-inspired framework for adaptive UAV swarm trajectory planning. The proposed method converts multi-UAV trajectory design from a repeated combinatorial optimization problem into a hierarchical probabilistic inference problem. In the offline phase, a genetic-algorithm planner with repulsive-force collision avoidance (GA--RF) generates expert demonstrations, which are abstracted into Mission, Route, and Motion dictionaries. These dictionaries are used to learn a probabilistic world model that captures how expert mission allocations induce route orders and how route orders induce motion-level behaviors. During online operation, the UAV swarm evaluates candidate actions by forming posterior beliefs over symbolic states and minimizing KL-divergence-based abnormality indicators with respect to expert-derived reference distributions. This enables mission allocation, route insertion, motion adaptation, and collision-aware replanning without rerunning the offline optimizer. Bayesian state estimators, including EKF and PF modules, are integrated at the motion level to improve trajectory correction under uncertainty. Simulation results show that the proposed framework preserves expert-like planning structure while producing smoother and more stable behavior than modified Q-learning. Additional validation using real-flight UAV trajectory data demonstrates that the learned world model can correct symbolic predictions under noisy and non-smooth observations, supporting its applicability to adaptive UAV swarm autonomy.