Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching
arXiv cs.LG / 3/17/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- The paper proposes MultiCE-Flow, a multimodal channel estimation framework based on flow matching and a diffusion transformer to leverage environmental information in sensing-aided networks.
- It introduces a multimodal perception module that fuses LiDAR, camera, and location data as a semantic condition, while using sparse pilots as a structural condition to guide the DiT backbone.
- The approach uses flow matching to learn a linear trajectory from noise to data, enabling efficient one-step sampling beyond standard diffusion models.
- Experiments show MultiCE-Flow outperforms traditional baselines and existing generative models, with strong robustness to out-of-distribution scenarios and varying pilot densities, suitable for environment-aware communication systems.




