LegalDrill: Diagnosis-Driven Synthesis for Legal Reasoning in Small Language Models

arXiv cs.CL / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes LegalDrill, a diagnosis-driven synthesis framework to improve small language models’ performance on high-stakes legal reasoning involving statute interpretation and logical consistency.
Instead of collecting expensive expert reasoning traces, LegalDrill generates and iteratively refines reasoning trajectories from a strong teacher using fine-grained prompting and then applies self-reflective verification to select the most effective training data.
The selected data are used to train students via supervised fine-tuning and direct preference optimization, targeting both reasoning coherence and deduction consistency.
Experiments across multiple legal benchmarks show significant gains for representative small models while avoiding the need for scarce expert annotations, suggesting a scalable route to practical legal reasoning systems.
The approach also addresses a key limitation of standard data curation (e.g., rejection sampling), which often lacks the granularity needed beyond final verdict labels.

Abstract

Small language models (SLMs) are promising for real-world deployment due to their efficiency and low operational cost. However, their limited capacity struggles with high-stakes legal reasoning tasks that require coherent statute interpretation and logically consistent deduction. Furthermore, training SLMs for such tasks demands high-quality, concise reasoning trajectories, which are prohibitively expensive to manually collect and difficult to curate via standard rejection sampling, lacking granularity beyond final verdicts. To address these challenges, we propose {LegalDrill}, a diagnosis-driven synthesis framework that extracts and iteratively refines reasoning trajectories from a capable teacher via fine-grained prompting, then a self-reflective verification is employed to adaptively select the most effective data for the SLM student. The resulting data empower SLM training through supervised fine-tuning and direct preference optimization. Extensive experiments on several legal benchmarks demonstrate that {LegalDrill} significantly bolsters the legal reasoning capabilities of representative SLMs while bypassing the need for scarce expert annotations, paving a scalable path toward practical legal reasoning systems.

LLMs will be a commodity

Reddit r/artificial

Indian Developers: How to Build AI Side Income with $0 Capital in 2026

Dev.to

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally

Reddit r/LocalLLaMA

Dex lands $5.3M to grow its AI-driven talent matching platform

Tech.eu

AI Citation Registry: Why Daily Updates Leave No Time for Data Structuring

Dev.to

LegalDrill: Diagnosis-Driven Synthesis for Legal Reasoning in Small Language Models

Key Points

Abstract

Related Articles

LLMs will be a commodity

Indian Developers: How to Build AI Side Income with $0 Capital in 2026

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally

Dex lands $5.3M to grow its AI-driven talent matching platform

AI Citation Registry: Why Daily Updates Leave No Time for Data Structuring

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer