PostureObjectstitch: Anomaly Image Generation Considering Assembly Relationships in Industrial Scenarios

arXiv cs.CV / 4/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces PostureObjectStitch, a diffusion-based anomaly image generation method designed for industrial settings where component pose and assembly relationships must be preserved.
  • It uses a condition decoupling strategy to split multi-view inputs into texture (high-frequency), RGB, and other feature components, improving control over what the model generates.
  • A feature temporal modulation mechanism adapts these features across diffusion time steps, supporting progressive coarse-to-fine synthesis while maintaining consistency.
  • The approach adds a conditional loss to emphasize critical industrial elements and a geometric prior to guide component placement so that generated images align with correct assembly semantics.
  • Experiments on the MureCom dataset, a newly contributed DreamAssembly dataset, and downstream anomaly detection use cases show improved generation utility for practical industrial workflows.

Abstract

Image generation technology can synthesize condition-specific images to supplement real-world industrial anomaly data and enhance anomaly detection model performance. Existing generation techniques rarely account for the pose and orientation of industrial components in assembly, making the generated images difficult to utilize for downstream application. To solve this, we propose a novel image synthesis approach, called PostureObjectStitch, that achieves accurate generation to meet the requirement of industrial assembly. A condition decoupling approach is introduced to separate input multi-view images into high-frequency, texture, and RGB features. The feature temporal modulation mechanism adapts these features across diffusion model time-steps, enabling progressive generation from coarse to fine details while maintaining consistency. To ensure semantic accuracy, we introduce a conditional loss that enhances critical industrial elements and a geometric prior that guides component positioning for correct assembly relationships. Comprehensive experimental results on the MureCom dataset, our newly contributed DreamAssembly dataset, and the downstream application validate the outstanding performance of our method.