When transformers learn "impossible" languages, what do they learn?

📰 ArXiv cs.AI

You'll learn how transformer language models learn 'impossible' languages and what this reveals about their linguistic capacities, which matters for understanding AI language acquisition limits

advanced Published 1 Jul 2026

Action Steps

Evaluate transformer language models on 'impossible' languages using sample efficiency metrics
Assess test-set perplexity to compare model performance on human and unnatural languages
Analyze linguistic capacities of transformer models to explain non-attestation in human languages
Compare results with theoretical motivations for language learnability
Apply findings to improve language model architecture and training data

Who Needs to Know This

AI engineers and researchers on a team benefit from this knowledge to improve language model performance and understand its limitations, while data scientists can apply these insights to develop more effective NLP models

Key Insight

💡 Transformer language models can acquire unnatural languages, but their performance is limited by sample efficiency and test-set perplexity, revealing insights into their linguistic capacities