Vision-Language Based Expert Reporting for Painting Authentication and Defect Detection
arXiv cs.CV / 3/17/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- The paper presents a fully automated vision-language model (VLM) that operates without human intervention during inference, combining multi-modal pulsed active infrared thermography (AIRT) analysis with structured natural-language reporting for painting authentication and defect detection.
- It processes thermal sequences using Principal Component Thermography (PCT), Thermographic Signal Reconstruction (TSR), and Pulsed Phase Thermography (PPT), fusing anomaly masks into a consensus segmentation that guides the VLM’s reporting.
- The VLM generates reports describing anomaly location, thermal behavior, plausible physical interpretations, and explicitly notes uncertainty to enable explainable conservation decisions.
- Evaluations on two marquetries show consistent anomaly detection and stable, generalizable interpretations, indicating reproducibility across samples and potential for standardized documentation in cultural heritage contexts.
Related Articles

Interactive Web Visualization of GPT-2
Reddit r/artificial
[R] Causal self-attention as a probabilistic model over embeddings
Reddit r/MachineLearning
The 5 software development trends that actually matter in 2026 (and what they mean for your startup)
Dev.to
InVideo AI Review: Fast Finished
Dev.to

33 LangChain Alternatives That Won't Leak Your Data (2026 Guide)
Dev.to