Efficient Visual Anomaly Detection at the Edge: Enabling Real-Time Industrial Inspection on Resource-Constrained Devices

arXiv cs.CV / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses the need for real-time and privacy-preserving visual anomaly detection in industrial quality control by targeting deployment on resource-constrained edge devices rather than cloud systems.
  • It introduces two lightweight edge-oriented variants, PatchCore-Lite and PaDiM-Lite, designed to reduce memory and computation while preserving anomaly detection performance.
  • PatchCore-Lite accelerates inference via a two-stage search that uses a product-quantized memory bank for coarse matching followed by exact search on only a decoded subset.
  • PaDiM-Lite speeds up anomaly scoring by using diagonal covariance to make Mahalanobis distance computable through efficient element-wise operations.
  • Experiments on MVTec AD and VisA benchmarks report major efficiency improvements, including a 79% memory reduction for PatchCore-Lite and a 77% memory reduction plus a 31% inference-time decrease for PaDiM-Lite, indicating suitability for edge industrial inspection.

Abstract

Visual Anomaly Detection (VAD) is essential for industrial quality control, enabling automatic defect detection in manufacturing. In real production lines, VAD systems must satisfy strict real-time and privacy requirements, necessitating a shift from cloud-based processing to local edge deployment. However, processing data locally on edge devices introduces new challenges because edge hardware has limited memory and computational resources. To overcome these limitations, we propose two efficient VAD methods designed for edge deployment: PatchCore-Lite and Padim-Lite, based on the popular PatchCore and PaDiM models. PatchCore-Lite runs first a coarse search on a product-quantized memory bank, then an exact search on a decoded subset. Padim-Lite is sped up using diagonal covariance, turning Mahalanobis distance into efficient element-wise computation. We evaluate our methods on the MVTec AD and VisA benchmarks and show their suitability for edge environments. PatchCore-Lite achieves a remarkable 79% reduction in total memory footprint, while PaDiM-Lite achieves substantial efficiency gains with a 77% reduction in total memory and a 31% decrease in inference time. These results show that VAD can be effectively deployed on edge devices, enabling real-time, private, and cost-efficient industrial inspection.