Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej
arXiv cs.CL / 3/26/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces VidhikDastaavej, an anonymized large-scale dataset of private Indian legal documents spanning 133 categories, intended to fill a gap in public resources for long-form legal drafting research.
- It proposes a Model-Agnostic Wrapper (MAW) for structured legal document generation that separates section planning from per-section generation using retrieval-based prompts.
- The approach is designed to be independent of any specific LLM, enabling use across both open- and closed-source models.
- Evaluation across lexical, semantic, LLM-based, and expert/annotator-driven metrics—including inter-annotator agreement—finds MAW improves factual accuracy, coherence, and completeness over fine-tuned baselines.
- The work delivers both a new benchmark dataset and a generalizable framework, aiming to accelerate Legal AI and structured legal text generation research in the Indian context.
Related Articles
Regulating Prompt Markets: Securities Law, Intellectual Property, and the Trading of Prompt Assets
Dev.to
Mercor competitor Deccan AI raises $25M, sources experts from India
Dev.to
How We Got Local MCP Servers Working in Claude Cowork (The Missing Guide)
Dev.to
How Should Students Document AI Usage in Academic Work?
Dev.to

I asked my AI agent to design a product launch image. Here's what came back.
Dev.to