Toward Agentic RAG for Ukrainian

arXiv cs.AI / 4/17/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper investigates agentic Retrieval-Augmented Generation (RAG) for Ukrainian as part of the UNLP 2026 Shared Task on Multi-Domain Document Understanding.
  • It proposes a two-stage retrieval pipeline using BGE-M3 with BGE reranking, paired with a lightweight agentic layer that performs query rephrasing and answer-retry loops over Qwen2.5-3B-Instruct.
  • The results indicate that retrieval quality is the main bottleneck: agentic retries can improve answer accuracy, but end performance is still limited by document and page identification.
  • The authors analyze practical limits of offline agentic pipelines and suggest future work combining stronger retrieval with more advanced agentic reasoning for Ukrainian.

Abstract

We present an initial investigation into Agentic Retrieval-Augmented Generation (RAG) for Ukrainian, conducted within the UNLP 2026 Shared Task on Multi-Domain Document Understanding. Our system combines two-stage retrieval (BGE-M3 with BGE reranking) with a lightweight agentic layer performing query rephrasing and answer-retry loops on top of Qwen2.5-3B-Instruct. Our analysis reveals that retrieval quality is the primary bottleneck: agentic retry mechanisms improve answer accuracy but the overall score remains constrained by document and page identification. We discuss practical limitations of offline agentic pipelines and outline directions for combining stronger retrieval with more advanced agentic reasoning for Ukrainian.

Toward Agentic RAG for Ukrainian | AI Navigate