Constraint-Based Analysis of Reasoning Shortcuts in Neurosymbolic Learning

arXiv cs.AI / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies “reasoning shortcuts” in neurosymbolic learning, where models satisfy logical constraints during training without learning the correct concept-to-label mapping.
  • It formalizes shortcut-freeness as a constraint satisfaction problem and analyzes when the intended concept mapping becomes uniquely determined by the constraints.
  • The authors prove a necessary discrimination property for shortcut-freeness under bijective mappings, but show it is not sufficient even with a connected constraint graph.
  • They introduce an ASP-based method with proven soundness and completeness to verify whether a constraint set uniquely determines the mapping, and propose a greedy repair procedure to eliminate detected shortcuts.
  • The work also provides complexity results (e.g., coNP-complete decision, #P-complete counting, NP-hard minimal repair) and sample complexity bounds, with experiments confirming performance across eight benchmark domains.

Abstract

Neurosymbolic systems can satisfy logical constraints during learning without achieving the intended concept-label correspondence; this is a problem known as reasoning shortcuts. We formalize reasoning shortcuts as a constraint satisfaction problem and investigate under which conditions concept mappings are uniquely determined by the constraints. We prove that a discrimination property (requiring that no valid concept mapping can be transformed into another valid mapping by swapping two concept values) is necessary for shortcut-freeness under bijective mappings, but demonstrate via a counterexample that it is insufficient even when the constraint graph is connected. We develop an ASP-based algorithm that verifies whether a given constraint set uniquely determines the intended concept mapping, with proven soundness and completeness. When shortcuts are detected, a greedy repair algorithm eliminates them by augmenting the constraint set, converging in at most k iterations, where k is the number of alternative valid mappings. We further provide a complexity classification: deciding shortcut-freeness is coNP-complete, counting shortcuts is #P-complete, and finding minimal repairs is NP-hard. We also establish sample complexity bounds showing that logarithmically many label queries suffice for disambiguation in favorable cases, while querying all ambiguous positions suffices in the worst case. Experiments across eight benchmark domains validate our approach.