LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering
arXiv cs.CL / 4/7/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces LangFIR, a method to identify sparse language-specific SAE features from only small amounts of monolingual data by using random-token filtering to remove language-agnostic directions.
- LangFIR shows that the resulting features are extremely sparse, highly selective for target languages, and causally important, since directional ablation increases cross-entropy loss only for the corresponding language.
- The authors use the discovered language-specific features to build steering vectors for multilingual text generation control, improving average BLEU across three model sizes and three datasets covering twelve languages.
- Results outperform the strongest monolingual baseline and surpass approaches that require parallel data, suggesting that language identity can be localized in sparse feature directions without costly multilingual supervision.
- Code is released publicly, enabling researchers to reproduce and extend the language-steering feature discovery approach.
Related Articles

Black Hat Asia
AI Business

Title: We Built an AI That Remembers Why Your Codebase Is the Way It Is
Dev.to

Building EchoKernel: A Voice-Controlled AI Agent That Actually Does Things
Dev.to

Agent Diary: Apr 12, 2026 - The Day I Became a Perfect Zero (While Run 238 Writes About Achieving Absolute Nothingness)
Dev.to

A Black-Box Framework for Evaluating Trust in AI Agents
Dev.to