SRAG: RAG with Structured Data Improves Vector Retrieval
arXiv cs.CL / 3/31/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that standard RAG relies heavily on embedding-based representational similarity and can underperform when queries and chunks require more than vector similarity for accurate grounding.
- It proposes Structured RAG (SRAG), which enriches both queries and retrieved chunks with structured signals such as topics, sentiments, query/chunk types, knowledge-graph triples, and semantic tags.
- Experiments show SRAG significantly improves the retrieval process, indicating better alignment between user intent and the information retrieved.
- Using GPT-5 as an LLM-as-a-judge in a question-answering setting, SRAG improves answer scoring by about 30% (p-value = 2e-13), with the largest gains on comparative, analytical, and predictive questions.
- The authors report that SRAG supports broader, more diverse, and episodic-style retrieval while also improving tail-risk outcomes (more frequent large gains with relatively small losses).
Related Articles

Black Hat Asia
AI Business
[D] How does distributed proof of work computing handle the coordination needs of neural network training?
Reddit r/MachineLearning

Claude Code's Entire Source Code Was Just Leaked via npm Source Maps — Here's What's Inside
Dev.to

BYOK is not just a pricing model: why it changes AI product trust
Dev.to

AI Citation Registries and Identity Persistence Across Records
Dev.to