異種IoTセンサ環境における誘導バイアスを考慮した融合設計によるチャネルレスな人間活動認識

arXiv cs.LG / 2026/4/24

💬 オピニオンIdeas & Deep AnalysisModels & Research

共有:

要点

本論文は、センサ種類・身体部位・モダリティ・チャネル構成がデータセットやデバイス間で変わる異種IoT環境における人間活動認識（HAR）を対象としています。
固定されたチャネル数・順序・意味を前提としない「チャネルレス」HARを提案し、センサ固有やデータセット固有のチャネルテンプレートに依存せずに、単一の共有モデルで推論します。
中核となる手法は、誘導バイアスを考慮した融合設計であり、チャネルごとのエンコーディングと共有エンコーダに加えて、conditional batch normalization によるメタデータ条件付きレイト融合と、チャネル単位の予測と融合後予測を同時に最適化する組合せ損失を用います。
PAMAP2での実験に加え、6つのHARデータセットに対する頑健性評価、アブレーション、感度・効率評価、クロスデータセット転移学習などにより、融合戦略とメタデータ条件付けの有効性を示す主要な知見が報告されています。

Abstract

Human activity recognition (HAR) in Internet of Things (IoT) environments must cope with heterogeneous sensor settings that vary across datasets, devices, body locations, sensing modalities, and channel compositions. This heterogeneity makes conventional channel-fixed models difficult to reuse across sensing environments because their input representations are tightly coupled to predefined channel structures. To address this problem, we investigate strict channel-free HAR, in which a single shared model performs inference without assuming a fixed number, order, or semantic arrangement of input channels, and without relying on sensor-specific input layers or dataset-specific channel templates. We argue that fusion design is the central issue in this setting. Accordingly, we propose a channel-free HAR framework that combines channel-wise encoding with a shared encoder, metadata-conditioned late fusion via conditional batch normalization, and joint optimization of channel-level and fused predictions through a combination loss. The proposed model processes each channel independently to handle varying channel configurations, while sensor metadata such as body location, modality, and axis help recover structural information that channel-independent processing alone cannot retain. In addition, the joint loss encourages both the discriminability of individual channels and the consistency of the final fused prediction. Experiments on PAMAP2, together with robustness analysis on six HAR datasets, ablation studies, sensitivity analysis, efficiency evaluation, and cross-dataset transfer learning, demonstrate three main findings...

この記事の続きは原文サイトでお読みいただけます。

原文を読む →