Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection
arXiv cs.CV / 3/19/2026
📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
Key Points
- The paper proposes a two-stream architecture that separately encodes radar motion with a Motion--Mamba branch and floor vibration with an Impact--Griffin branch, using cross-conditioned fusion and a Switch--MoE head to align tokens and suppress confounders.
- It demonstrates real-time edge inference on a Raspberry Pi 4B gateway with low latency (15.8 ms) and reduced energy per 2.56 s window (10,750 mJ) compared with a baseline.
- A bathroom fall detection benchmark was built with over 3 hours of synchronized mmWave radar and triaxial vibration data across eight scenarios, with subject-independent train/validation/test splits achieving 96.1% accuracy, 94.8% precision, 88.0% recall, 91.1% macro F1, and AUC 0.968.
- Compared to the strongest baseline, it improves accuracy by 2.0 percentage points and fall recall by 1.3 percentage points while reducing latency and energy costs for privacy-preserving, non-intrusive safety monitoring in wet bathroom environments.
Related Articles
Speaking of VoxtralResearchVoxtral TTS: A frontier, open-weights text-to-speech model that’s fast, instantly adaptable, and produces lifelike speech for voice agents.
Mistral AI Blog
Why I Switched from Cloud AI to a Dedicated AI Box (And Why You Should Too)
Dev.to
How to Use MiMo V2 API for Free in 2026: Complete Guide
Dev.to
The Agent Memory Problem Nobody Solves: A Practical Architecture for Persistent Context
Dev.to
Why We Ditched 6 APIs and Built One MCP Server for Our Entire Ecommerce Stack
Dev.to