LegalDrill: Diagnosis-Driven Synthesis for Legal Reasoning in Small Language Models

arXiv cs.CL / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes LegalDrill, a diagnosis-driven synthesis framework to improve small language models’ performance on high-stakes legal reasoning involving statute interpretation and logical consistency.
  • Instead of collecting expensive expert reasoning traces, LegalDrill generates and iteratively refines reasoning trajectories from a strong teacher using fine-grained prompting and then applies self-reflective verification to select the most effective training data.
  • The selected data are used to train students via supervised fine-tuning and direct preference optimization, targeting both reasoning coherence and deduction consistency.
  • Experiments across multiple legal benchmarks show significant gains for representative small models while avoiding the need for scarce expert annotations, suggesting a scalable route to practical legal reasoning systems.
  • The approach also addresses a key limitation of standard data curation (e.g., rejection sampling), which often lacks the granularity needed beyond final verdict labels.

Abstract

Small language models (SLMs) are promising for real-world deployment due to their efficiency and low operational cost. However, their limited capacity struggles with high-stakes legal reasoning tasks that require coherent statute interpretation and logically consistent deduction. Furthermore, training SLMs for such tasks demands high-quality, concise reasoning trajectories, which are prohibitively expensive to manually collect and difficult to curate via standard rejection sampling, lacking granularity beyond final verdicts. To address these challenges, we propose {LegalDrill}, a diagnosis-driven synthesis framework that extracts and iteratively refines reasoning trajectories from a capable teacher via fine-grained prompting, then a self-reflective verification is employed to adaptively select the most effective data for the SLM student. The resulting data empower SLM training through supervised fine-tuning and direct preference optimization. Extensive experiments on several legal benchmarks demonstrate that {LegalDrill} significantly bolsters the legal reasoning capabilities of representative SLMs while bypassing the need for scarce expert annotations, paving a scalable path toward practical legal reasoning systems.