CognitionCapturerPro: Towards High-Fidelity Visual Decoding from EEG/MEG via Multi-modal Information and Asymmetric Alignment
arXiv cs.AI / 3/16/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- CognitionCapturerPro proposes an enhanced framework that fuses EEG with multi-modal priors (images, text, depth, and edges) via collaborative training.
- It introduces an uncertainty-weighted similarity scoring mechanism to quantify modality-specific fidelity and a fusion encoder for integrating shared representations.
- The approach employs a simplified alignment module and a pre-trained diffusion model to boost visual reconstruction from EEG.
- On the THINGS-EEG dataset, it outperforms the original CognitionCapturer, with Top-1 and Top-5 retrieval gains of 25.9% and 10.6%, respectively.
- The authors provide code at the linked GitHub repository for reproducibility.
