Generalized Language Models

📰 Lilian Weng's Blog

Generalized language models achieve state-of-the-art results on various language tasks through contextualized word vectors and unsupervised pre-training

advanced Published 31 Jan 2019

Action Steps

Learn about word embeddings and contextualized word vectors
Explore large unsupervised pre-trained language models such as ULMFiT, GPT-2, and ALBERT
Apply these models to various language tasks to achieve state-of-the-art results

Who Needs to Know This

NLP researchers and AI engineers can benefit from understanding generalized language models to improve their language tasks, while product managers can leverage these advancements to develop more accurate language-based products

Key Insight

💡 Contextualized word vectors and unsupervised pre-training are key to achieving state-of-the-art results in language tasks

Key Takeaways

Generalized language models achieve state-of-the-art results on various language tasks through contextualized word vectors and unsupervised pre-training

Full Article

[Updated on 2019-02-14: add <a href="#ulmfit">ULMFiT</a> and <a href="#gpt-2">GPT-2</a>.] [Updated on 2020-02-29: add <a href="#albert">ALBERT</a>.] <span class="updat

Read full article → ← Back to Reads