Adaptive Cost-Efficient Evaluation for Reliable Patent Claim Validation

arXiv cs.CL / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces ACE (Adaptive Cost-efficient Evaluation) to validate patent claims with zero-defect tolerance despite the tradeoff between lightweight encoders’ limited legal reasoning and LLMs’ high evaluation cost.
  • ACE uses predictive entropy to selectively route only high-uncertainty claims to an expert LLM for a Chain of Patent Thought (CoPT) procedure aligned with 35 U.S.C. statutory standards.
  • The approach is designed to better capture long-range legal dependencies while maintaining operational efficiency.
  • Experiments report an F1 score of 94.95% and a 78% reduction in costs versus using LLMs for all claims.
  • The authors release ACE-40k, a 40,000-claim benchmark with MPEP-grounded error annotations to support subsequent research.

Abstract

Automated validation of patent claims demands zero-defect tolerance, as even a single structural flaw can render a claim legally defective. Existing evaluation paradigms suffer from a rigidity-resource dilemma: lightweight encoders struggle with nuanced legal dependencies, while exhaustive verification via Large Language Models (LLMs) is prohibitively costly. To bridge this gap, we propose ACE (Adaptive Cost-efficient Evaluation), a hybrid framework that uses predictive entropy to route only high-uncertainty claims to an expert LLM. The expert then executes a Chain of Patent Thought (CoPT) protocol grounded in 35 U.S.C. statutory standards. This design enables ACE to handle long-range legal dependencies more effectively while preserving efficiency. ACE achieves the best F1 among the evaluated methods at 94.95\%, while reducing operational costs by 78\% compared to standalone LLM deployments. We also construct ACE-40k, a 40,000-claim benchmark with MPEP-grounded error annotations, to facilitate further research.