LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering

📰 ArXiv cs.AI

LangFIR discovers sparse language-specific features from monolingual data for language steering in large language models

advanced Published 7 Apr 2026

Action Steps

Utilize sparse autoencoders (SAEs) to decompose residual streams in large language models
Identify language-specific directions in the residual stream using monolingual data
Add language-specific vectors to model activations at inference time for language steering
Evaluate the effectiveness of LangFIR in controlling the language of model outputs

Who Needs to Know This

NLP engineers and researchers on a team can benefit from LangFIR as it enables more reliable language control in multilingual models, while data scientists and AI engineers can apply the technique to improve model performance

Key Insight

💡 LangFIR enables reliable language control in multilingual models using monolingual data

Key Takeaways

LangFIR discovers sparse language-specific features from monolingual data for language steering in large language models

Full Article

Title: LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering

Abstract:
arXiv:2604.03532v1 Announce Type: cross Abstract: Large language models (LLMs) show strong multilingual capabilities, yet reliably controlling the language of their outputs remains difficult. Representation-level steering addresses this by adding language-specific vectors to model activations at inference time, but identifying language-specific directions in the residual stream often relies on multilingual or parallel data that can be expensive to obtain. Sparse autoencoders (SAEs) decompose res

Read full paper → ← Back to Reads