NaviSlim: Adaptive Context-Aware Navigation and Sensing via Dynamic Slimmable Networks

arXiv cs.RO / 4/10/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • 提案論文は、マイクロドローン等の小型自律飛行体向けに、環境の難易度・現在の軌道・ナビ目的に応じて計算量とセンサ消費を動的に調整できるニューラルナビゲーションモデル「NaviSlim」を提示している。
  • NaviSlimはゲーティング付きのslimmableネットワークで、スリミング係数を状況に基づいて自律選択することで、実行時間とエネルギー消費を最適化する設計になっている。
  • センサフュージョンに関しても、オンボードセンサのパワーレベルを状況に応じて動的選択し、センサ取得の電力と時間を削減しつつ、ネットワーク切り替えなしで動作できるとしている。
  • Microsoft AirSimの頑健なシミュレーションでの広範な学習・評価により、動的に必要なモデル複雑度を下げた場合、静的モデルに比べ平均でモデル複雑度を57〜92%、センサ利用を61〜80%削減できたと報告している。

Abstract

Small-scale autonomous airborne vehicles, such as micro-drones, are expected to be a central component of a broad spectrum of applications ranging from exploration to surveillance and delivery. This class of vehicles is characterized by severe constraints in computing power and energy reservoir, which impairs their ability to support the complex state-of-the-art neural models needed for autonomous operations. The main contribution of this paper is a new class of neural navigation models -- NaviSlim -- capable of adapting the amount of resources spent on computing and sensing in response to the current context (i.e., difficulty of the environment, current trajectory, and navigation goals). Specifically, NaviSlim is designed as a gated slimmable neural network architecture that, different from existing slimmable networks, can dynamically select a slimming factor to autonomously scale model complexity, which consequently optimizes execution time and energy consumption. Moreover, different from existing sensor fusion approaches, NaviSlim can dynamically select power levels of onboard sensors to autonomously reduce power and time spent during sensor acquisition, without the need to switch between different neural networks. By means of extensive training and testing on the robust simulation environment Microsoft AirSim, we evaluate our NaviSlim models on scenarios with varying difficulty and a test set that showed a dynamic reduced model complexity on average between 57-92%, and between 61-80% sensor utilization, as compared to static neural networks designed to match computing and sensing of that required by the most difficult scenario.