Geometry Reinforced Efficient Attention Tuning Equipped with Normals for Robust Stereo Matching
arXiv cs.CV / 4/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The arXiv paper introduces GREATEN, a stereo-matching framework aimed at improving synthetic-to-real (Syn-to-Real) generalization by using surface normals as domain-invariant geometric cues.
- It proposes a Gated Contextual-Geometric Fusion (GCGF) module to suppress unreliable texture/context features and fuse them with normal-driven geometry for more discriminative representations.
- To handle non-Lambertian regions (e.g., specular/transparent surfaces), it adds a Specular-Transparent Augmentation (STA) strategy that makes the fusion more robust to misleading visual cues.
- The method uses sparse attention variants (SSA, SDMA, SVA) to preserve fine-grained global feature extraction for occlusions while reducing computational cost, improving inference speed and enabling high-resolution (3K) disparity estimation.
- Experiments show substantial error reductions when trained only on synthetic data, including 30% fewer errors on ETH3D and faster runtime (19.2% faster than its baseline variant), with support for disparity ranges up to 768 on Middlebury.
Related Articles

Why Fashion Trend Prediction Isn’t Enough Without Generative AI
Dev.to

Chatbot vs Voicebot: The Real Business Decision Nobody Talks About
Dev.to

วิธีใช้ AI ทำ SEO ให้เว็บติดอันดับ Google (2026)
Dev.to

Free AI Tools With No Message Limits — The Definitive List (2026)
Dev.to

Why Domain Knowledge Is Critical in Healthcare Machine Learning
Dev.to