RayFormer: Modeling Inter- and Intra-Ray Similarity for NeRF-Based Video Snapshot Compressive Imaging
arXiv cs.CV / 5/1/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper targets video snapshot compressive imaging (SCI), aiming to reconstruct dynamic scenes from a single measurement, and notes that NeRF-based approaches often use random ray sampling that overlooks structural similarities.
- It introduces a patch-level ray sampling strategy to better model scene content structure before applying any learning-based reconstruction.
- The authors propose RayFormer, an Inter- and Intra-Ray Transformer that captures both inter-ray similarities among neighboring spatial points at the same depth and intra-ray correlations along each viewing ray.
- By combining patch-level sampling with a total variation prior in the optimization objective, the method improves spatial smoothness and reduces artifacts.
- Experiments on simulated and real-world scenes show the approach achieves state-of-the-the-art (SOTA) reconstruction quality.
Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works
Dev.to

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!
Reddit r/artificial

Automating FDA Compliance: AI for Specialty Food Producers
Dev.to

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model
THE DECODER
I hate this group but not literally
Reddit r/LocalLLaMA