Adaptive ToR: Complexity-Aware Tree-Based Retrieval for Pareto-Optimal Multi-Intent NLU
arXiv cs.AI / 4/28/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper introduces Adaptive Tree-of-Retrieval (Adaptive ToR), a complexity-aware retrieval architecture that dynamically chooses between single-step and adaptive-depth hierarchical retrieval based on query difficulty.
- It uses a Query Tree Classifier to compute a Query Complexity Index from weighted linguistic signals, routing each query to the most efficient retrieval topology.
- For complex queries, it recursively decomposes the input into sub-queries and applies adaptive pruning with two-stage filtering (quantitative similarity gating plus semantic relevance evaluation) to prevent exponential growth in nodes.
- A reranking layer improves production efficiency via a deduplicator-first pipeline and global LLM rescoring; results on the NLU++ benchmark show a 9.7% relative improvement over fixed-depth baselines alongside substantial latency and cost reductions.
- The depth analysis indicates many queries can be resolved quickly via single-step routing, while token consumption grows with depth, supporting the claim of a Pareto-optimal tradeoff among accuracy, latency, and computation.
Related Articles
How to Build Traceable and Evaluated LLM Workflows Using Promptflow, Prompty, and OpenAI
MarkTechPost

An improvement of the convergence proof of the ADAM-Optimizer
Dev.to
Claude Code 会话历史在哪里?如何找回你的 AI 编程对话记录
Dev.to
We built an AI that runs an entire business autonomously. Not a demo. Not a prototype. Actually running. YC-backed, here's what we learned.
Reddit r/artificial
langchain-tests==1.1.7
LangChain Releases