Neuro-Symbolic Process Anomaly Detection

arXiv cs.AI / 3/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses process anomaly detection in process mining, noting that neural methods trained only from event logs can mislabel rare but conformant behavior as anomalies due to frequency-based statistical limitations.
  • It proposes a neuro-symbolic framework that embeds human domain knowledge into neural anomaly detection by combining Logic Tensor Networks (LTN) with Declare constraints.
  • Using autoencoders as the base model, Declare constraints are incorporated as soft logical “guiderails” during learning to better separate truly anomalous traces from rare yet valid ones.
  • Experiments on both synthetic and real-world datasets show improved F1 scores, including scenarios with as few as 10 conformant traces available.
  • The results indicate that selecting the right Declare constraints—and thus the quality of the provided domain knowledge—substantially affects the performance gains.

Abstract

Process anomaly detection is an important application of process mining for identifying deviations from the normal behavior of a process. Neural network-based methods have recently been applied to this task, learning directly from event logs without requiring a predefined process model. However, since anomaly detection is a purely statistical task, these models fail to incorporate human domain knowledge. As a result, rare but conformant traces are often misclassified as anomalies due to their low frequency, which limits the effectiveness of the detection process. Recent developments in the field of neuro-symbolic AI have introduced Logic Tensor Networks (LTN) as a means to integrate symbolic knowledge into neural networks using real-valued logic. In this work, we propose a neuro-symbolic approach that integrates domain knowledge into neural anomaly detection using LTN and Declare constraints. Using autoencoder models as a foundation, we encode Declare constraints as soft logical guiderails within the learning process to distinguish between anomalous and rare but conformant behavior. Evaluations on synthetic and real-world datasets demonstrate that our approach improves F1 scores even when as few as 10 conformant traces exist, and that the choice of Declare constraint and by extension human domain knowledge significantly influences performance gains.