Towards Cognitive Defect Analysis in Active Infrared Thermography with Vision-Text Cues
arXiv cs.CV / 3/12/2026
📰 NewsModels & Research
Key Points
- It proposes a novel language-guided framework for cognitive defect analysis in CFRP using active infrared thermography and vision-language models, enabling zero-shot defect understanding and localization without large training datasets.
- It introduces an AIRT-VLM Adapter that aligns thermographic data with pretrained multimodal encoders to enhance defect visibility and reduce domain gaps.
- Validation on 25 CFRP inspection sequences with defects at different energy levels shows SNR gains exceeding 10 dB compared with traditional dimensionality-reduction methods and zero-shot defect localization with IoU up to 70%.
- The study evaluates three VLMs—GroundingDINO, Qwen-VL-Chat, and CogVLM—demonstrating cross-model applicability and potential for scalable AI-driven NDE in industry.
Related Articles
[R] Combining Identity Anchors + Permission Hierarchies achieves 100% refusal in abliterated LLMs — system prompt only, no fine-tuning
Reddit r/MachineLearning
[P] Vibecoded on a home PC: building a ~2700 Elo browser-playable neural chess engine with a Karpathy-inspired AI-assisted research loop
Reddit r/MachineLearning
Meet DuckLLM 1.0 My First Model!
Reddit r/LocalLLaMA
Since FastFlowLM added support for Linux, I decided to benchmark all the models they support, here are some results
Reddit r/LocalLLaMA
What measure do I use to compare nested models and non nested models in high dimensional survival analysis [D]
Reddit r/MachineLearning