Beyond the Embedding Bottleneck: Adaptive Retrieval-Augmented 3D CT Report Generation
arXiv cs.CV / 3/18/2026
📰 NewsModels & Research
Key Points
- The study reveals a bottleneck in 3D CT embeddings: highly discriminative pathology signals exist but are limited to a very small effective dimensionality (as few as 2 of 512), constraining both generation and retrieval.
- Scaling the language model does not improve performance, suggesting the bottleneck lies in the visual representation rather than in the text generator.
- The authors propose AdaRAG-CT, an adaptive augmentation framework that injects supplementary textual information through controlled retrieval and selectively fuses it during report generation to mitigate the bottleneck.
- On the CT-RATE benchmark, AdaRAG-CT delivers state-of-the-art clinical efficacy, raising Clinical F1 from 0.420 to 0.480, with ablations showing both retrieval and generation components contribute, and the authors provide code at the given GitHub URL; naive static retrieval can degrade performance.
Related Articles

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.
Reddit r/LocalLLaMA
QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!
Reddit r/LocalLLaMA
acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan
Reddit r/LocalLLaMA

**Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding**
Hugging Face Blog

Newest GPU server in the lab! 72gb ampere vram!
Reddit r/LocalLLaMA