PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence

arXiv cs.AI / 4/7/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • PanLUNAは、EEG・ECG・PPGを単一の共有エンコーダで同時処理する「5.4Mパラメータ」のコンパクトな汎モーダル生体信号財団モデルとして提案されています。
  • センサ種別埋め込みを備えたユニファイド・クエリ集合により、クロスモーダルの早期融合を効率的に行いつつ、推論時のモダリティ欠損にも頑健である設計です。
  • TUABの異常EEG検出(balanced accuracy 81.21%)やHMCのマルチモーダル睡眠ステージング(balanced accuracy 0.7416)で、大型モデルに匹敵または上回る性能を示しています。
  • Quantization-aware trainingによりINT8量子化でもフル精度の≥96%性能を回復でき、GAP9の超低電力RISC-Vマイコン上でウェアラブル実装に近い低レイテンシ・低消費電力性能(例:12誘導ECGで325.6ms/18.8mJ)を報告しています。

Abstract

Physiological foundation models (FMs) have shown promise for biosignal representation learning, yet most remain confined to a single modality such as EEG, ECG, or PPG, largely because paired multimodal datasets are scarce. In this paper, we present PanLUNA, a compact 5.4M-parameter pan-modal FM that jointly processes EEG, ECG, and PPG within a single shared encoder. Extending LUNA's channel-unification module, PanLUNA treats multimodal channels as entries in a unified query set augmented with sensor-type embeddings, enabling efficient cross-modal early fusion while remaining inherently robust to missing modalities at inference time. Despite its small footprint, PanLUNA matches or exceeds models up to 57\times larger: 81.21% balanced accuracy on TUAB abnormal EEG detection and state-of-the-art 0.7416 balanced accuracy on HMC multimodal sleep staging. Quantization-aware training with INT8 weights recovers \geq96% of full-precision performance, and deployment on the GAP9 ultra-low-power RISC-V microcontroller for wearables achieves 325.6 ms latency and 18.8 mJ per 10-second, 12-lead ECG inference, and 1.206 s latency at 68.65 mJ for multimodal 5-channel sleep staging over 30-second epochs.

PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence | AI Navigate