Thought Graph Traversal for Test-time Scaling in Chest X-ray VLLMs
arXiv cs.CV / 5/4/2026
💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper proposes a test-time scaling method for vision-language large models to improve chest X-ray report generation without any additional training.
- It introduces a lightweight Thought Graph Traversal (TGT) framework that steers reasoning through organ-specific findings in a medically coherent sequence using structured medical priors embedded in prompts.
- The method further improves reasoning depth via a “reasoning budget forcing” strategy that dynamically extends the generation process at inference time.
- Experiments show the approach outperforms baseline prompting on standard benchmarks while enabling analysis of dataset biases through traceable reasoning paths, and the authors open-source the code and prompts for reproducibility.



