Towards Cognitive Defect Analysis in Active Infrared Thermography with Vision-Text Cues
arXiv cs.CV / 3/12/2026
📰 NewsModels & Research
Key Points
- It proposes a novel language-guided framework for cognitive defect analysis in CFRP using active infrared thermography and vision-language models, enabling zero-shot defect understanding and localization without large training datasets.
- It introduces an AIRT-VLM Adapter that aligns thermographic data with pretrained multimodal encoders to enhance defect visibility and reduce domain gaps.
- Validation on 25 CFRP inspection sequences with defects at different energy levels shows SNR gains exceeding 10 dB compared with traditional dimensionality-reduction methods and zero-shot defect localization with IoU up to 70%.
- The study evaluates three VLMs—GroundingDINO, Qwen-VL-Chat, and CogVLM—demonstrating cross-model applicability and potential for scalable AI-driven NDE in industry.
Related Articles

ラピダス、半導体設計AIエージェント「国内2社海外1社が使用中」
日経XTECH

Superposition and the Capsule: Quantum State Collapse Meets AI Identity
Dev.to

The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely
Dev.to

The Loop as Laboratory: What 3,190 Cycles of Autonomous AI Operation Reveal
Dev.to

MiMo-V2-Pro & Omni & TTS: "We will open-source — when the models are stable enough to deserve it."
Reddit r/LocalLLaMA