KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph
arXiv cs.AI / 2026/3/24
💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research
要点
- KLDrive is proposed as a knowledge-graph-augmented LLM reasoning framework for fine-grained question answering in autonomous driving that aims to reduce hallucinations and make reasoning more reliable.
- It introduces an energy-based scene fact construction module that consolidates multi-source evidence into a structured scene knowledge graph to improve the factual grounding of downstream reasoning.
- An LLM agent then performs fact-grounded reasoning over a constrained action space using explicit structural constraints, improving transparency and controllability of the reasoning process.
- The approach uses structured prompting and few-shot in-context exemplars to adapt across diverse QA tasks without heavy task-specific fine-tuning.
- Experiments on two autonomous-driving QA benchmarks report improved results, including 65.04% accuracy on NuScenes-QA, a best SPICE score of 42.45 on GVQA, and a 46.01-point improvement on the most challenging counting task, indicating stronger factual reasoning performance.
