MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

arXiv cs.AI / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

MedForge introduces a pre-hoc, evidence-grounded framework for detecting forgery in medical images to protect clinical trust and safety.
It debuts MedForge-90K, a large-scale dataset of realistic lesion edits across 19 pathologies with expert-guided reasoning supervision and gold edit locations.
MedForge-Reasoner adopts a localize-then-analyze approach that identifies suspicious regions before delivering a verdict, improving interpretability.
The approach uses Forgery-aware GSPO to strengthen grounding of explanations and reduce hallucinations in medical forgery detection.
Experiments report state-of-the-art detection accuracy and trustworthy, expert-aligned explanations.

Abstract

Text-guided image editors can now manipulate authentic medical scans with high fidelity, enabling lesion implantation/removal that threatens clinical trust and safety. Existing defenses are inadequate for healthcare. Medical detectors are largely black-box, while MLLM-based explainers are typically post-hoc, lack medical expertise, and may hallucinate evidence on ambiguous cases. We present MedForge, a data-and-method solution for pre-hoc, evidence-grounded medical forgery detection. We introduce MedForge-90K, a large-scale benchmark of realistic lesion edits across 19 pathologies with expert-guided reasoning supervision via doctor inspection guidelines and gold edit locations. Building on it, MedForge-Reasoner performs localize-then-analyze reasoning, predicting suspicious regions before producing a verdict, and is further aligned with Forgery-aware GSPO to strengthen grounding and reduce hallucinations. Experiments demonstrate state-of-the-art detection accuracy and trustworthy, expert-aligned explanations.

The massive shift toward edge computing and local processing

Dev.to

Self-Refining Agents in Spec-Driven Development

Dev.to

Week 3: Why I'm Learning 'Boring' ML Before Building with LLMs

Dev.to

The Three-Agent Protocol Is Transferable. The Discipline Isn't.

Dev.to

has anyone tried this? Flash-MoE: Running a 397B Parameter Model on a Laptop

Reddit r/LocalLLaMA

MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

Key Points

Abstract

Related Articles

The massive shift toward edge computing and local processing

Self-Refining Agents in Spec-Driven Development

Week 3: Why I'm Learning 'Boring' ML Before Building with LLMs

The Three-Agent Protocol Is Transferable. The Discipline Isn't.

has anyone tried this? Flash-MoE: Running a 397B Parameter Model on a Laptop

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer