LogicDiff: Logic-Guided Denoising Improves Reasoning in Masked Diffusion Language Models

Key Points

Masked diffusion language models (MDLMs) currently use confidence-based unmasking that tends to delay high-entropy logical connectives, which are crucial branching points for reasoning chains.

Abstract

Masked diffusion language models (MDLMs) generate text by iteratively unmasking tokens from a fully masked sequence, offering parallel generation and bidirectional context. However, their standard confidence-based unmasking strategy systematically defers high-entropy logical connective tokens, the critical branching points in reasoning chains, leading to severely degraded reasoning performance. We introduce LogicDiff, an inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking. A lightweight classification head (4.2M parameters, 0.05% of the base model) predicts the logical role of each masked position (premise, connective, derived step, conclusion, or filler) from the base model's hidden states with 98.4% accuracy. A dependency-ordered scheduler then unmasks tokens in logical dependency order: premises first, then connectives, then derived steps, then conclusions. Without modifying a single parameter of the base model and without any reinforcement learning or task-specific training, LogicDiff improves LLaDA-8B-Instruct accuracy from 22.0% to 60.7% on GSM8K (+38.7 percentage points) and from 23.6% to 29.2% on MATH-500 (+5.6 pp), with less than 6% speed overhead. Our results demonstrate that a substantial portion of the reasoning deficit in MDLMs is attributable to suboptimal token unmasking order, not to limitations of the model's learned representations.

LogicDiff: Logic-Guided Denoising Improves Reasoning in Masked Diffusion Language Models

Key Points

Abstract

Related Articles

Black Hat Asia

How to Verify Information Online and Avoid Fake Content

I built an AI code reviewer solo while working full-time — honest post-launch breakdown

Why Your State Management Is Slowing Down AI-Assisted Development

Google Stitch vs Claude: Which AI Design Tool Wins in 2026?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Related Articles

How to Verify Information Online and Avoid Fake Content
Dev.to

I built an AI code reviewer solo while working full-time — honest post-launch breakdown
Dev.to

Why Your State Management Is Slowing Down AI-Assisted Development
Dev.to

Google Stitch vs Claude: Which AI Design Tool Wins in 2026?
Dev.to