AI Navigate

One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control

arXiv cs.CV / 3/20/2026

📰 NewsModels & Research

Key Points

  • O2MAG is introduced as a training-free few-shot anomaly generation method that uses self-attention from a reference anomalous image to synthesize more realistic anomalies for industrial anomaly detection.
  • The method leverages three parallel diffusion processes with self-attention grafting and incorporates an anomaly mask to reduce foreground-background query confusion while enabling text-guided anomaly synthesis.
  • Anomaly-Guided Optimization is proposed to better align generated anomalies with the target anomalous distribution, enhancing realism and text consistency.
  • Dual-Attention Enhancement reinforces both self- and cross-attention on masked regions to mitigate faint anomaly synthesis inside anomaly masks, and extensive experiments show it outperforms prior state-of-the-art methods on downstream AD tasks.

Abstract

Industrial anomaly detection (AD) is characterized by an abundance of normal images but a scarcity of anomalous ones. Although numerous few-shot anomaly synthesis methods have been proposed to augment anomalous data for downstream AD tasks, most existing approaches require time-consuming training and struggle to learn distributions that are faithful to real anomalies, thereby restricting the efficacy of AD models trained on such data. To address these limitations, we propose a training-free few-shot anomaly generation method, namely O2MAG, which leverages the self-attention in One reference anomalous image to synthesize More realistic anomalies, supporting effective downstream anomaly detection. Specifically, O2MAG manipulates three parallel diffusion processes via self-attention grafting and incorporates the anomaly mask to mitigate foreground-background query confusion, synthesizing text-guided anomalies that closely adhere to real anomalous distributions. To bridge the semantic gap between the encoded anomaly text prompts and the true anomaly semantics, Anomaly-Guided Optimization is further introduced to align the synthesis process with the target anomalous distribution, steering the generation toward realistic and text-consistent anomalies. Moreover, to mitigate faint anomaly synthesis inside anomaly masks, Dual-Attention Enhancement is adopted during generation to reinforce both self- and cross-attention on masked regions. Extensive experiments validate the effectiveness of O2MAG, demonstrating its superior performance over prior state-of-the-art methods on downstream AD tasks.