Self-Describing Structured Data with Dual-Layer Guidance: A Lightweight Alternative to RAG for Precision Retrieval in Large-Scale LLM Knowledge Navigation
arXiv cs.CL / 4/23/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper addresses the “Lost-in-the-Middle” effect in LLMs, which reduces attention to information located in the middle of long context windows and can limit knowledge-retrieval over large structured libraries.
- It proposes Self-Describing Structured Retrieval (SDSR), a lightweight approach where structured data files include human-authored navigation metadata placed at the primacy position to leverage LLM attention biases.
- SDSR uses a Dual-Layer Guidance strategy that combines in-file metadata with explicit routing rules in the system prompt to improve precision retrieval.
- In a four-round benchmark over an expanded 190-skill library (36→119 categories) with adversarial distractor injection, the combined approach (in-file + prompt guidance) achieves 100% primary routing accuracy versus 65% without guidance.
- The work also extends SDSR to semi-structured corpora, suggesting cross-reference encoding can support retrieval without vector databases when document structure is recoverable.
Related Articles

Black Hat USA
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

10 AI Tools Every Developer Should Try in 2026
Dev.to

Why use an AI gateway at all?
Dev.to