AI Navigate

Conflict-Free Policy Languages for Probabilistic ML Predicates: A Framework and Case Study with the Semantic Router DSL

arXiv cs.LG / 3/20/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • It introduces a three-level decidability framework for conflicts in policy languages when signals are probabilistic ML predicates, showing crisp conflicts are SAT-decidable, embedding conflicts relate to spherical geometry, and classifier conflicts are undecidable without distributional knowledge.
  • It identifies embedding conflicts as the dominant practical issue and proposes using a temperature-scaled softmax to partition embedding space into Voronoi regions to prevent co-firing without retraining.
  • The mechanisms for conflict detection and prevention have been implemented in the Semantic Router DSL, a production routing language for LLM inference, illustrating practical applicability.
  • The work discusses broader applicability to semantic RBAC and API gateway policy beyond routing, indicating potential cross-domain impact.
  • The paper argues that existing policy languages assume crisp predicates, and this framework enables conflict-free operation in probabilistic ML-driven routing.

Abstract

Conflict detection in policy languages is a solved problem -- as long as every rule condition is a crisp Boolean predicate. BDDs, SMT solvers, and NetKAT all exploit that assumption. But a growing class of routing and access-control systems base their decisions on probabilistic ML signals: embedding similarities, domain classifiers, complexity estimators. Two such signals, declared over categories the author intended to be disjoint, can both clear their thresholds on the same query and silently route it to the wrong model. Nothing in the compiler warns about this. We characterize the problem as a three-level decidability hierarchy -- crisp conflicts are decidable via SAT, embedding conflicts reduce to spherical cap intersection, and classifier conflicts are undecidable without distributional knowledge -- and show that for the embedding case, which dominates in practice, replacing independent thresholding with a temperature-scaled softmax partitions the embedding space into Voronoi regions where co-firing is impossible. No model retraining is needed. We implement the detection and prevention mechanisms in the Semantic Router DSL, a production routing language for LLM inference, and discuss how the same ideas apply to semantic RBAC and API gateway policy.