Structured Abductive-Deductive-Inductive Reasoning for LLMs via Algebraic Invariants

arXiv cs.AI / 4/20/2026

📰 NewsModels & Research

Key Points

  • The paper argues that current LLMs struggle with structured logical reasoning by mixing hypothesis generation and verification and by failing to clearly separate conjecture from validated knowledge.
  • It proposes an LLM-assisted abductive–deductive–inductive reasoning protocol (Peirce’s tripartite inference) that explicitly organizes reasoning steps rather than letting them blur together.
  • The framework enforces five algebraic invariants (“Gamma Quintet”), especially the “Weakest Link bound,” which limits any conclusion’s reliability to that of the least-supported premise in the inference chain.
  • The authors validate the “Weakest Link” idea by relating it to weakest-link resolution in possibilistic logic and by empirically testing it on chain-of-thought reasoning.
  • They provide a verified reference implementation, checking all invariants with a property-based test suite (100 properties) plus fuzz testing (16 tests) over 10^5+ generated cases.

Abstract

Large language models exhibit systematic limitations in structured logical reasoning: they conflate hypothesis generation with verification, cannot distinguish conjecture from validated knowledge, and allow weak reasoning steps to propagate unchecked through inference chains. We present a symbolic reasoning scaffold that operationalizes Peirce's tripartite inference -- abduction, deduction, and induction -- as an explicit protocol for LLM-assisted reasoning. The framework enforces logical consistency through five algebraic invariants (the Gamma Quintet), the strongest of which -- the Weakest Link bound -- ensures that no conclusion in a reasoning chain can exceed the reliability of its least-supported premise. This principle, independently grounded as weakest link resolution in possibilistic logic and empirically validated for chain-of-thought reasoning, prevents logical inconsistencies from accumulating across multi-step inference. We verify all invariants through a property-based testing suite of 100 properties and 16 fuzz tests over 10^5+ generated cases, providing a verified reference implementation of the invariants suitable as a foundation for future reasoning benchmarks.