📰 Lilian Weng's Blog
Articles from Lilian Weng's Blog · 51 articles · Updated every 3 hours · View all news
All
⚡ AI Lessons (4907)
ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
Lilian Weng's Blog
⚡ AI Lesson
6y ago
Curriculum for Reinforcement Learning
[Updated on 2020-02-03: mentioning PCG in the “Task-Specific Curriculum” section. <spa
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
6y ago
Self-Supervised Representation Learning
[Updated on 2020-01-09: add a new section on Contrastive Predictive Coding ]. [Updated on 2020-04-13: add a “Momentum Contra
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
6y ago
Evolution Strategies
Stochastic gradient descent is a universal choice for optimizing deep learning models. However, it is not the only option. With bl
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
6y ago
Meta Reinforcement Learning
In my earlier post on meta-learning , the problem is mainly defined in the context of few-shot classificati
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
6y ago
Domain Randomization for Sim2Real Transfer
In Robotics, one of the hardest problems is how to make your model transfer to the real world. Due to the sample inefficiency of deep RL algorithms and the cost
Lilian Weng's Blog
⚡ AI Lesson
7y ago
Are Deep Neural Networks Dramatically Overfitted?
[Updated on 2019-05-27: add the section on Lottery Ticket Hypothesis.] If you are like me, entering into the field of deep learning with experience in tradition
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
7y ago
Generalized Language Models
[Updated on 2019-02-14: add ULMFiT and GPT-2 .] [Updated on 2020-02-29: add ALBERT .] <span class="updat
Lilian Weng's Blog
👁️ Computer Vision
⚡ AI Lesson
7y ago
Object Detection Part 4: Fast Detection Models
In Part 3 , we have reviewed models in the R-CNN family. All of them are region-based object detection algorithms. They can achieve hig
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
7y ago
Meta-Learning: Learning to Learn Fast
[Updated on 2019-10-01: thanks to Tianhao, we have
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
7y ago
Flow-based Deep Generative Models
So far, I’ve written about two types of generative models, GAN and VAE . Neither of them explicitly learns the probability density function of real da
Lilian Weng's Blog
7y ago
From Autoencoder to Beta-VAE
[Updated on 2019-07-18: add a section on VQ-VAE & VQ-VAE-2 .] [Updat
Lilian Weng's Blog
⚡ AI Lesson
7y ago
Attention? Attention!
[Updated on 2018-10-28: Add Pointer Network and the link to my implementation of Transformer.] <span class
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
7y ago
Implementing Deep Reinforcement Learning Models with Tensorflow + OpenAI Gym
The full implementation is available in lilianweng/deep-reinforcement-learning-gym In the previous two posts, I have introduced the algorithms of many deep rein
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
7y ago
Policy Gradient Algorithms
[Updated on 2018-06-30: add two new policy gradient methods, SAC and D4PG .] [Updated on 2018-09-30: ad
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
8y ago
A (Long) Peek into Reinforcement Learning
[Updated on 2020-09-03: Updated the algorithm of SARSA and Q-learning so that the diffe
Lilian Weng's Blog
⚡ AI Lesson
8y ago
The Multi-Armed Bandit Problem and Its Solutions
The algorithms are implemented for Bernoulli bandit in lilianweng/multi-armed-bandit . Exploitation vs Exploration The exploration vs exploitation dilemma exist
Lilian Weng's Blog
👁️ Computer Vision
⚡ AI Lesson
8y ago
Object Detection for Dummies Part 3: R-CNN Family
[Updated on 2018-12-20: Remove YOLO here. Part 4 will cover multiple fast object detection algorithms, including YOLO.] [Updated on 2018-12-27: Add bbox regress
Lilian Weng's Blog
👁️ Computer Vision
⚡ AI Lesson
8y ago
Object Detection for Dummies Part 2: CNN, DPM and Overfeat
Part 1 of the “Object Detection for Dummies” series introduced: (1) the concept of image gradient vector and how HOG algorithm summarizes the inform
Lilian Weng's Blog
👁️ Computer Vision
⚡ AI Lesson
8y ago
Object Detection for Dummies Part 1: Gradient Vector, HOG, and SS
I’ve never worked in the field of
Lilian Weng's Blog
⚡ AI Lesson
8y ago
Learning Word Embedding
Human vocabulary comes in free text. In ord
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
8y ago
Anatomize Deep Learning with Information Theory
Professor Naftali Tishby passed away in 2021. Hope the post can introduce his cool idea of information bottleneck to more people. Recently I watched the talk &l
Lilian Weng's Blog
🧠 Large Language Models
⚡ AI Lesson
8y ago
From GAN to WGAN
[Updated on 2018-09-30: thanks to Yoonju, we have this post translated in Korean !] [
Lilian Weng's Blog
⚡ AI Lesson
8y ago
How to Explain the Prediction of a Machine Learning Model?
The machine learning models have started penetrating into critical areas like health care, justice systems, and financial industry. Thus to figure out how the m
Lilian Weng's Blog
⚡ AI Lesson
8y ago
Predict Stock Prices Using RNN: Part 2
In the Part 2 tutorial, I would like to continue the topic on stock price prediction and to endow the recurrent neural network that I have built in <a href="htt
DeepCamp AI