MLG-Stereo: ViT Based Stereo Matching with Multi-Stage Local-Global Enhancement
arXiv cs.CV / 4/23/2026
📰 NewsModels & Research
Key Points
- The paper introduces MLG-Stereo, a ViT-based stereo matching system designed to improve detail prediction and robustness to arbitrary image resolutions versus existing ViT approaches.
- It proposes a Multi-Granularity Feature Network to better balance global context with local geometric information and to reduce the mismatch between training and inference scales.
- The method builds a Local-Global Cost Volume that jointly captures locally correlated cues and global-aware matching signals.
- It adds a Local-Global Guided Recurrent Unit to iteratively refine disparity estimates using guidance from global information.
- Experiments on multiple benchmarks show competitive results on Middlebury and KITTI-2015 and particularly strong performance on KITTI-2012.
Related Articles

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to

GPT Image 2 Subject-Lock Editing: A Practical Guide to input_fidelity
Dev.to

GPT Image 2 vs DALL-E 3: What Actually Changed in OpenAI's New Image Model
Dev.to

AI Tutor for Science Students — Physics Chemistry Biology Solved by AI
Dev.to