Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

arXiv cs.CL / 3/26/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces VidhikDastaavej, an anonymized large-scale dataset of private Indian legal documents spanning 133 categories, intended to fill a gap in public resources for long-form legal drafting research.
  • It proposes a Model-Agnostic Wrapper (MAW) for structured legal document generation that separates section planning from per-section generation using retrieval-based prompts.
  • The approach is designed to be independent of any specific LLM, enabling use across both open- and closed-source models.
  • Evaluation across lexical, semantic, LLM-based, and expert/annotator-driven metrics—including inter-annotator agreement—finds MAW improves factual accuracy, coherence, and completeness over fine-tuned baselines.
  • The work delivers both a new benchmark dataset and a generalizable framework, aiming to accelerate Legal AI and structured legal text generation research in the Indian context.

Abstract

Automating legal document drafting can improve efficiency and reduce the burden of manual legal work. Yet, the structured generation of private legal documents remains underexplored, particularly in the Indian context, due to the scarcity of public datasets and the complexity of adapting models for long-form legal drafting. To address this gap, we introduce VidhikDastaavej, a large-scale, anonymized dataset of private legal documents curated in collaboration with an Indian law firm. Covering 133 diverse categories, this dataset is the first resource of its kind and provides a foundation for research in structured legal text generation and Legal AI more broadly. We further propose a Model-Agnostic Wrapper (MAW), a two-stage generation framework that first plans the section structure of a legal draft and then generates each section with retrieval-based prompts. MAW is independent of any specific LLM, making it adaptable across both open- and closed-source models. Comprehensive evaluation, including lexical, semantic, LLM-based, and expert-driven assessments with inter-annotator agreement, shows that the wrapper substantially improves factual accuracy, coherence, and completeness compared to fine-tuned baselines. This work establishes both a new benchmark dataset and a generalizable generation framework, paving the way for future research in AI-assisted legal drafting.