WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior
arXiv cs.CL / 3/20/2026
📰 NewsModels & Research
Key Points
- WASD (unWeaving Actionable Sufficient Directives) is a new framework that explains LLM behavior by identifying sufficient neuron-activation predicates, enabling more natural language controllability over outputs.
- It represents candidate conditions as neuron-activation predicates and iteratively searches for a minimal subset that guarantees the current output under input perturbations.
- The approach outperforms conventional attribution graphs in stability, accuracy, and conciseness in SST-2 and CounterFact experiments using the Gemma-2-2B model.
- A case study on cross-lingual output generation demonstrates WASD's practical effectiveness in controlling model behavior for multilingual tasks.
Related Articles

Interesting loop
Reddit r/LocalLLaMA
Qwen3.5-122B-A10B Uncensored (Aggressive) — GGUF Release + new K_P Quants
Reddit r/LocalLLaMA
FeatherOps: Fast fp8 matmul on RDNA3 without native fp8
Reddit r/LocalLLaMA

VerityFlow-AI: Engineering a Multi-Agent Swarm for Real-Time Truth-Validation and Deep-Context Media Synthesis
Dev.to
: [R] Sinc Reconstruction for LLM Prompts: Applying Nyquist-Shannon to the Specification Axis (275 obs, 97% cost reduction, open source)
Reddit r/MachineLearning