Breaking the Rigid Prior: Towards Articulated 3D Anomaly Detection

arXiv cs.CV / 4/30/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • Existing 3D anomaly detection methods often assume a rigid geometric prior (pose-invariant normal shapes), which breaks down for articulated objects whose joint-driven motions create structured geometry changes.
  • The paper introduces ArtiAD, a new large-scale benchmark for articulated 3D anomaly detection, including 15,229 point clouds across 39 object categories with dense joint-angle variation and multiple structural anomaly types.
  • ArtiAD annotations include joint configurations and part-level motion labels, allowing researchers to disentangle pose-induced deformations from true structural defects, and it includes a seen/unseen articulation split for interpolation and extrapolation.
  • The proposed SPA-SDF baseline replaces the rigid prior with a pose-conditioned implicit representation (an articulation-independent structural prior plus a Fourier-encoded joint embedding) and recovers articulation by minimizing reconstruction energy.
  • SPA-SDF achieves AUROC of 0.884 (seen) and 0.874 (unseen), outperforming rigid-prior baselines, and the authors plan to publicly release the code and benchmark to support future work.

Abstract

Existing 3D anomaly detection methods are built on a rigid prior: normal geometry is pose-invariant and can be canonicalized through registration or alignment. This prior does not hold for articulated objects with hinge or sliding joints, where valid pose changes induce structured geometric variations that cannot be collapsed to a single canonical template, causing pose-induced deformations to be misidentified as anomalies while true structural defects are obscured. No existing benchmark addresses this challenge. We introduce ArtiAD, the first large-scale benchmark for articulated 3D anomaly detection, comprising 15,229 point clouds across 39 object categories with dense joint-angle variations and six structural anomaly types. Each sample is annotated with its joint configuration and part-level motion labels, enabling explicit disentanglement of pose-induced geometry from structural defects. ArtiAD also provides a seen/unseen articulation split to evaluate both interpolation and extrapolation to novel joint configurations. We propose Shape-Pose-Aware Signed Distance Field (SPA-SDF), a baseline that replaces the rigid prior with a continuous pose-conditioned implicit field, factorized into an articulation-independent structural prior and a Fourier-encoded joint embedding. At inference, the articulation state is recovered by minimizing reconstruction energy, and anomalies are identified as point-wise deviations from the learned manifold. SPA-SDF achieves 0.884 object-level AUROC on seen configurations and 0.874 on unseen configurations, substantially outperforming all rigid-based baselines. Our code and benchmark will be publicly released to facilitate future research.