GGPT: Geometry Grounded Point Transformer
arXiv cs.CV / 3/13/2026
📰 NewsModels & Research
Key Points
- GGPT combines an improved Structure-from-Motion pipeline with a geometry-guided 3D point transformer to produce reconstructions that are both geometrically consistent and spatially complete.
- It uses dense feature matching and lightweight geometric optimization to estimate accurate camera poses and partial 3D point clouds from sparse input views.
- Trained solely on ScanNet++ with VGGT predictions, GGPT generalises across architectures and datasets and substantially outperforms state-of-the-art feed-forward reconstruction models in both in-domain and out-of-domain settings.
- The framework employs an optimized guidance encoding to inject explicit partial-geometry supervision, enabling effective integration of geometric priors with dense predictions.
Related Articles
Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI
TechCrunch
[R] Weekly digest: arXiv AI security papers translated for practitioners -- Cascade (cross-stack CVE+Rowhammer attacks on compound AI), LAMLAD (dual-LLM adversarial ML, 97% evasion), OpenClaw (4 vuln classes in agent frameworks)
Reddit r/MachineLearning
My Experience with Qwen 3.5 35B
Reddit r/LocalLLaMA

Cursor’s new coding model Composer 2 is here: It beats Claude Opus 4.6 but still trails GPT-5.4
VentureBeat
Qwen 3.5 122B completely falls apart at ~ 100K context
Reddit r/LocalLLaMA