DenTab: A Dataset for Table Recognition and Visual QA on Real-World Dental Estimates
arXiv cs.CV / 4/20/2026
📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research
Key Points
- The paper introduces DenTab, a new dataset of 2,000 real-world dental estimate table image crops with high-quality HTML annotations, aiming to better reflect noisy administrative capture conditions.
- DenTab supports both table recognition (TR) and table visual question answering (TableVQA) on the same inputs, totaling 2,208 questions across 11 categories including retrieval, aggregation, and logic/consistency checks.
- The authors benchmark 16 systems (14 VLMs plus two OCR baselines) and find that strong structure recovery often fails to deliver accurate results on multi-step arithmetic and consistency questions, even with ground-truth HTML tables.
- To improve arithmetic reliability without training, they propose the Table Router Pipeline, which routes arithmetic questions to deterministic execution using a VLM-generated structured representation and a rule-based exact computation executor.
- The dataset and code are planned for public release on GitHub to enable more realistic evaluation and research on reasoning over tables.



