SHAP Is Not Production-Ready — And We Need to Stop Pretending It Is

Dev.to / 4/15/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical Usage

Read original →

共有:

Key Points

The article argues that many production explainable-AI (XAI) setups are fundamentally unreliable because methods like SHAP can be too slow and yield inconsistent explanations.
In a real-time fraud system test, the author reports KernelExplainer SHAP adds ~30 ms per prediction, produces different explanations across runs, and depends on background data and sampling.
The author claims the core architectural mistake is separating the model’s prediction from a separate, stochastic explainer rather than tying explanations directly to the deterministic forward computation.
As an alternative, the author reports removing SHAP and generating explanations inside the forward pass using a neuro-symbolic model with symbolic rules, achieving ~0.9 ms per prediction and deterministic outputs.
The piece concludes that the real challenge is architecture for deployable explainability—supporting real-time latency, auditability, and consistency—rather than trying to “fix” SHAP alone.

SHAP Is Not Production-Ready — And We Need to Stop Pretending It Is

This might be unpopular, but it needs to be said:

Most explainable AI setups are fundamentally broken in production.

Not because they’re inaccurate.

Because they’re too slow, inconsistent, and disconnected from the model itself.

The uncomfortable reality

In a real-time fraud system, I tested SHAP (KernelExplainer):

~30 ms per prediction
Run it twice → different explanations
Requires background data + sampling

Now ask yourself:

Would you ship a system where:

explanations change every time
latency is unpredictable
and audit logs aren’t deterministic?

Because that’s exactly what we’re doing.

The core mistake

We’ve accepted this architecture:

Model → Prediction → Separate Explainer → Explanation

That separation is the problem.

You’re trying to explain a deterministic system with a stochastic process… and calling it reliable.

I tried something different

Instead of improving SHAP…

I removed it.

Built a model where:

symbolic rules run alongside the neural network
explanations are generated inside the forward pass

No post-processing. No sampling.

What happened

0.9 ms per prediction + explanation
33× faster than SHAP
Deterministic outputs
Same fraud recall as the baseline

The explanation is no longer something you compute later.

It’s something the model already knows.

The bigger point

We don’t have an explainability problem.

We have an architecture problem.

As long as explanations are:

bolted on
slow
and probabilistic

they will never work in systems that need:

real-time decisions
auditability
and consistency

Full experiment (code + benchmark)

I documented everything here:

👉 Explainable AI in Production: A Neuro-Symbolic Model for Real-Time Fraud Detection

If you're building real systems, this is the question:

Do you want explanations that look good…

or explanations you can actually deploy?

Black Hat USA

AI Business

Black Hat Asia

AI Business

FastAPI With LangChain and MongoDB

Dev.to

Best AI Game Creator in 2026

Dev.to

[Patterns] AI Agent Error Handling That Actually Works

Dev.to

SHAP Is Not Production-Ready — And We Need to Stop Pretending It Is

Key Points

SHAP Is Not Production-Ready — And We Need to Stop Pretending It Is

The uncomfortable reality

The core mistake

I tried something different

What happened

The bigger point

Full experiment (code + benchmark)

Related Articles

Black Hat USA

Black Hat Asia

FastAPI With LangChain and MongoDB

Best AI Game Creator in 2026

[Patterns] AI Agent Error Handling That Actually Works

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer