📰 Lilian Weng's Blog
Articles from Lilian Weng's Blog · 49 articles · Updated every 3 hours · View all news
All
⚡ AI Lessons (4905)
ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
11mo ago
Why We Think
Special thanks to John Schulman for a lot of super valuable feedback and direct edits on this post. Test time compute ( Graves et al. 2016 , Ling, et al. 2017 ,
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
1y ago
Reward Hacking in Reinforcement Learning
Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely l
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
1y ago
Extrinsic Hallucinations in LLMs
Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucina
Lilian Weng's Blog
⚡ AI Lesson
1y ago
Diffusion Models for Video Generation
Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using
Lilian Weng's Blog
⚡ AI Lesson
2y ago
Thinking about High-Quality Human Data
[Special thank you to Ian Kivlichan for many useful pointers (E.g. the 100+ year old Nature paper “Vox populi”) and nice feedback. 🙏 ] High-quality
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
2y ago
Adversarial Attacks on LLMs
The use of large language models in the real world has strongly accelerated by the launch of ChatGPT. We (including my team at OpenAI, shoutout to them) have in
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
2y ago
LLM Powered Autonomous Agents
Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT , GPT-Engineer and Ba
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Prompt Engineering
Prompt Engineering , also known as In-Context Prompting , refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without u
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
The Transformer Family Version 2.0
Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Large Transformer Model Inference Optimization
[Updated on 2023-01-24: add a small section on Distillation .] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. T
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
3y ago
Some Math behind Neural Tangent Kernel
Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes
Lilian Weng's Blog
⚡ AI Lesson
3y ago
Generalized Visual Language Models
Processing images to generate text, such as image captioning and visual question-answering, has been studied for years. Traditionally such systems rely on an ob
Lilian Weng's Blog
⚡ AI Lesson
3y ago
Learning with not Enough Data Part 3: Data Generation
Here comes the Part 3 on learning with not enough data (Previous: Part 1 and Part 2 ). Let’s consider two approaches for generating synthetic data for training.
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
4y ago
Learning with not Enough Data Part 2: Active Learning
This is part 2 of what to do when facing a limited amount of labeled data for supervised learning tasks. This time we will get some amount of human labeling wor
Lilian Weng's Blog
⚡ AI Lesson
4y ago
Learning with not Enough Data Part 1: Semi-Supervised Learning
When facing a limited amount of labeled data for supervised learning tasks, four appr
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
4y ago
How to Train Really Large Models on Many GPUs?
[Updated on 2022-03-13: add expert choice routing .] [U
Lilian Weng's Blog
⚡ AI Lesson
4y ago
What are Diffusion Models?
[Updated on 2021-09-19: Highly recommend this blog post on score-based generative modeling
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
4y ago
Contrastive Representation Learning
The goal of contrastive representation learning is to learn such an embedding space in which similar sample pairs stay close to each other while dissimilar ones
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
5y ago
Reducing Toxicity in Language Models
Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior
Lilian Weng's Blog
⚡ AI Lesson
5y ago
Controllable Neural Text Generation
[Updated on 2021-02-01: Updated to version 2.0 with several wor
Lilian Weng's Blog
⚡ AI Lesson
5y ago
How to Build an Open-Domain Question Answering System?
[Updated on 2020-11-12: add an example on closed-book factual QA using OpenAI API (beta).
Lilian Weng's Blog
⚡ AI Lesson
5y ago
Neural Architecture Search
Although most popular and successful model architectures are designed by hum
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
5y ago
Exploration Strategies in Deep Reinforcement Learning
[Updated on 2020-06-17: Add “exploration via disagreement” in the “Forward Dynamics” section . Exploitation versus ex
Lilian Weng's Blog
⚡ AI Lesson
5y ago
The Transformer Family
[Updated on 2023-01-27 : After almost three years, I did a big refactoring update of this post to incorporate a bunch of new Transformer models since 2020. The
DeepCamp AI