AI News — Latest Developments & Breakthroughs

Lilian Weng's Blog ⚡ AI Lesson 6y ago

Curriculum for Reinforcement Learning

[Updated on 2020-02-03: mentioning PCG in the “Task-Specific Curriculum” section. <spa

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Self-Supervised Representation Learning

[Updated on 2020-01-09: add a new section on Contrastive Predictive Coding ]. [Updated on 2020-04-13: add a “Momentum Contra

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Evolution Strategies

Stochastic gradient descent is a universal choice for optimizing deep learning models. However, it is not the only option. With bl

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Meta Reinforcement Learning

In my earlier post on meta-learning , the problem is mainly defined in the context of few-shot classificati

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 6y ago

Domain Randomization for Sim2Real Transfer

In Robotics, one of the hardest problems is how to make your model transfer to the real world. Due to the sample inefficiency of deep RL algorithms and the cost

Lilian Weng's Blog ⚡ AI Lesson 7y ago

Are Deep Neural Networks Dramatically Overfitted?

[Updated on 2019-05-27: add the section on Lottery Ticket Hypothesis.] If you are like me, entering into the field of deep learning with experience in tradition

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 7y ago

Generalized Language Models

[Updated on 2019-02-14: add ULMFiT and GPT-2 .] [Updated on 2020-02-29: add ALBERT .] <span class="updat

Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 7y ago

Object Detection Part 4: Fast Detection Models

In Part 3 , we have reviewed models in the R-CNN family. All of them are region-based object detection algorithms. They can achieve hig

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 7y ago

Meta-Learning: Learning to Learn Fast

[Updated on 2019-10-01: thanks to Tianhao, we have

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 7y ago

Flow-based Deep Generative Models

So far, I’ve written about two types of generative models, GAN and VAE . Neither of them explicitly learns the probability density function of real da

Lilian Weng's Blog 7y ago

From Autoencoder to Beta-VAE

[Updated on 2019-07-18: add a section on VQ-VAE & VQ-VAE-2 .] [Updat

Lilian Weng's Blog ⚡ AI Lesson 7y ago

Attention? Attention!

[Updated on 2018-10-28: Add Pointer Network and the link to my implementation of Transformer.] <span class

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 7y ago

Implementing Deep Reinforcement Learning Models with Tensorflow + OpenAI Gym

The full implementation is available in lilianweng/deep-reinforcement-learning-gym In the previous two posts, I have introduced the algorithms of many deep rein

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 7y ago

Policy Gradient Algorithms

[Updated on 2018-06-30: add two new policy gradient methods, SAC and D4PG .] [Updated on 2018-09-30: ad

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 8y ago

A (Long) Peek into Reinforcement Learning

[Updated on 2020-09-03: Updated the algorithm of SARSA and Q-learning so that the diffe

Lilian Weng's Blog ⚡ AI Lesson 8y ago

The Multi-Armed Bandit Problem and Its Solutions

The algorithms are implemented for Bernoulli bandit in lilianweng/multi-armed-bandit . Exploitation vs Exploration The exploration vs exploitation dilemma exist

Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 8y ago

Object Detection for Dummies Part 3: R-CNN Family

[Updated on 2018-12-20: Remove YOLO here. Part 4 will cover multiple fast object detection algorithms, including YOLO.] [Updated on 2018-12-27: Add bbox regress

Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 8y ago

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

Part 1 of the “Object Detection for Dummies” series introduced: (1) the concept of image gradient vector and how HOG algorithm summarizes the inform

Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 8y ago

Object Detection for Dummies Part 1: Gradient Vector, HOG, and SS

I’ve never worked in the field of

Lilian Weng's Blog ⚡ AI Lesson 8y ago

Learning Word Embedding

Human vocabulary comes in free text. In ord

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 8y ago

Anatomize Deep Learning with Information Theory

Professor Naftali Tishby passed away in 2021. Hope the post can introduce his cool idea of information bottleneck to more people. Recently I watched the talk &l

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 8y ago

From GAN to WGAN

[Updated on 2018-09-30: thanks to Yoonju, we have this post translated in Korean !] [

Lilian Weng's Blog ⚡ AI Lesson 8y ago

How to Explain the Prediction of a Machine Learning Model?

The machine learning models have started penetrating into critical areas like health care, justice systems, and financial industry. Thus to figure out how the m

Lilian Weng's Blog ⚡ AI Lesson 8y ago

Predict Stock Prices Using RNN: Part 2

In the Part 2 tutorial, I would like to continue the topic on stock price prediction and to endow the recurrent neural network that I have built in <a href="htt

📰 Lilian Weng's Blog