mmAnomaly: Leveraging Visual Context for Robust Anomaly Detection in the Non-Visual World with mmWave Radar

arXiv cs.CV / 4/2/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • mmAnomalyは、カメラがプライバシーや遮蔽物で使いづらい「非視覚」環境向けに、mmWaveレーダとRGBDを組み合わせて異常検知を行うマルチモーダル手法です。
  • mmWave信号の反射が素材・クラッタ・多重反射などで非ガウス的に歪むため、既存手法が文脈の欠如から誤検知しやすい課題に対処します。
  • RGBD側でResNetベースの軽量分類器を用いてシーン形状や素材特性といったセマンティック手がかりを抽出し、その文脈に条件付潜在拡散モデルで「期待されるmmWaveスペクトル」を生成します。
  • 実測スペクトルと生成スペクトルの空間的差分を二入力比較モジュールで評価し、異常の局在化まで可能にしたことが、隠し武器・侵入者・転倒の3用途評価で示されています。
  • 複数データセットに対して最大94% F1スコア、サブメートルの局在誤差を達成し、衣類・遮蔽・クラッタ環境での汎化性能と解釈性をアピールしています。

Abstract

mmWave radar enables human sensing in non-visual scenarios-e.g., through clothing or certain types of walls-where traditional cameras fail due to occlusion or privacy limitations. However, robust anomaly detection with mmWave remains challenging, as signal reflections are influenced by material properties, clutter, and multipath interference, producing complex, non-Gaussian distortions. Existing methods lack contextual awareness and misclassify benign signal variations as anomalies. We present mmAnomaly, a multi-modal anomaly detection framework that combines mmWave radar with RGBD input to incorporate visual context. Our system extracts semantic cues-such as scene geometry and material properties-using a fast ResNet-based classifier, and uses a conditional latent diffusion model to synthesize the expected mmWave spectrum for the given visual context. A dual-input comparison module then identifies spatial deviations between real and generated spectra to localize anomalies. We evaluate mmAnomaly on two multi-modal datasets across three applications: concealed weapon localization, through-wall intruder localization, and through-wall fall localization. The system achieves up to 94% F1 score and sub-meter localization error, demonstrating robust generalization across clothing, occlusions, and cluttered environments. These results establish mmAnomaly as an accurate and interpretable framework for context-aware anomaly detection in mmWave sensing.