ExVerus: Verus Proof Repair via Counterexample Reasoning

arXiv cs.LG / 3/30/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

EXVERUSは、失敗したVerusの証明に対して反例（counterexample）を自動生成・検証し、その行動的フィードバックをLLMの推論に取り込む反例ガイド型フレームワークを提案している。
従来のようにソースコード全体から静的に推論するのではなく、反例を一般化して帰納的不変条件（inductive invariants）を作り、同種の失敗を再発させないことを狙っている。
評価では、既存のプロンプトベースのVerus証明生成手法と比べて、証明の正確性・頑健性が向上し、さらにトークン効率も改善したと報告している。
この研究は、形式検証におけるLLM利用において、検証器のフィードバックを単なるエラー情報としてでなく“具体的なプログラム挙動（反例）”として活用する方向性を示している。

Abstract

Large Language Models (LLMs) have shown promising results in automating formal verification. However, existing approaches treat proof generation as a static, end-to-end prediction over source code, relying on limited verifier feedback and lacking access to concrete program behaviors. We present EXVERUS, a counterexample-guided framework that enables LLMs to reason about proofs using behavioral feedback via counterexamples. When a proof fails, EXVERUS automatically generates and validates counterexamples, and then guides the LLM to generalize them into inductive invariants to block these failures. Our evaluation shows that EXVERUS significantly improves proof accuracy, robustness, and token efficiency over the state-of-the-art prompting-based Verus proof generator.

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 3/30DailyView insight →

Freedom and Constraints of Autonomous Agents — Self-Modification, Trust Boundaries, and Emergent Gameplay

Dev.to

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Reddit r/artificial

Stop Tweaking Prompts: Build a Feedback Loop Instead

Dev.to

Privacy-Preserving Active Learning for autonomous urban air mobility routing under real-time policy constraints

Dev.to

The Prompt Tax: Why Every AI Feature Costs More Than You Think

Dev.to

ExVerus: Verus Proof Repair via Counterexample Reasoning

Key Points

Abstract

💡 Insights using this article

Related Articles

Freedom and Constraints of Autonomous Agents — Self-Modification, Trust Boundaries, and Emergent Gameplay

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Stop Tweaking Prompts: Build a Feedback Loop Instead

Privacy-Preserving Active Learning for autonomous urban air mobility routing under real-time policy constraints

The Prompt Tax: Why Every AI Feature Costs More Than You Think

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer