Chat GPT Rewards Model Explained!
How does Reinforcement learning come into play with ChatGPT?
ABOUT ME
โญ Subscribe: https://www.youtube.com/c/CodeEmporium?sub_confirmation=1
๐ Medium Blog: https://medium.com/@dataemporium
๐ป Github: https://github.com/ajhalthor
๐ LinkedIn: https://www.linkedin.com/in/ajay-halthor-477974bb/
Transformer Neural Networks: https://www.youtube.com/watch?v=TQQlZhbC5ps
RESOURCES
[1] ChatGPT blog: https://openai.com/blog/chatgpt/
[2] Instruct GPT which is the model ChatGPT was modeled after: https://arxiv.org/pdf/2203.02155.pdf
[3] Likert Scale: https://www.youtube.com/watch?v=Tf_71r1Ve5w
[4] Main paper behind nucleus sampling: https://arxiv.org/pdf/1904.09751.pdf
[5] PPO algorithm: https://openai.com/blog/openai-baselines-ppo/
[6] PPO algorithms (main paper): https://arxiv.org/pdf/1707.06347.pdf
[7] GPT original paper: https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
[8] GPT-3 paper: https://arxiv.org/pdf/2005.14165.pdf
[9] Nice article by hugging face: https://huggingface.co/blog/rlhf
MATH COURSES (7 day free trial)
๐ Mathematics for Machine Learning: https://imp.i384100.net/MathML
๐ Calculus: https://imp.i384100.net/Calculus
๐ Statistics for Data Science: https://imp.i384100.net/AdvancedStatistics
๐ Bayesian Statistics: https://imp.i384100.net/BayesianStatistics
๐ Linear Algebra: https://imp.i384100.net/LinearAlgebra
๐ Probability: https://imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
๐ โญ Deep Learning Specialization: https://imp.i384100.net/Deep-Learning
๐ Python for Everybody: https://imp.i384100.net/python
๐ MLOps Course: https://imp.i384100.net/MLOps
๐ Natural Language Processing (NLP): https://imp.i384100.net/NLP
๐ Machine Learning in Production: https://imp.i384100.net/MLProduction
๐ Data Science Specialization: https://imp.i384100.net/DataScience
๐ Tensorflow: https://imp.i384100.net/Tensorflow
Watch on YouTube โ
(saves to browser)
Sign in to unlock AI tutor explanation ยท โก30
Playlist
Uploads from CodeEmporium ยท CodeEmporium ยท 0 of 60
โ Previous
Next โ
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Linear Regression and Multiple Regression
CodeEmporium
Logistic Regression - THE MATH YOU SHOULD KNOW!
CodeEmporium
Generative Adversarial Networks - FUTURISTIC & FUN AI !
CodeEmporium
Deep Learning on the Cloud - GPU TO LEARN FASTER
CodeEmporium
Deep Mind's AlphaGo Zero - EXPLAINED
CodeEmporium
Mask Region based Convolution Neural Networks - EXPLAINED!
CodeEmporium
Attention in Neural Networks
CodeEmporium
Depthwise Separable Convolution - A FASTER CONVOLUTION!
CodeEmporium
One Neural network learns EVERYTHING ?!
CodeEmporium
Neural Voice Cloning
CodeEmporium
AI creates Image Classifiersโฆby DRAWING?
CodeEmporium
Unpaired Image-Image Translation using CycleGANs
CodeEmporium
K-Means Clustering - EXPLAINED!
CodeEmporium
Random Forest Classification
CodeEmporium
Data Science in Finance
CodeEmporium
Hypothesis testing with Applications in Data Science
CodeEmporium
A/B Testing - Simply Explained
CodeEmporium
The Kernel Trick - THE MATH YOU SHOULD KNOW!
CodeEmporium
Support Vector Machines - THE MATH YOU SHOULD KNOW
CodeEmporium
Principal Component Analysis (PCA) - THE MATH YOU SHOULD KNOW!
CodeEmporium
History of Calculus - Animated
CodeEmporium
Curiosity in AI
CodeEmporium
DropBlock - A BETTER DROPOUT for Neural Networks
CodeEmporium
Autoencoders - EXPLAINED
CodeEmporium
Recurrent Neural Networks - EXPLAINED!
CodeEmporium
LSTM Networks - EXPLAINED!
CodeEmporium
Building an Image Captioner with Neural Networks
CodeEmporium
10 Machine Learning Questions - ANSWERED!
CodeEmporium
How do neural networks work?
CodeEmporium
Evolution of Face Generation | Evolution of GANs
CodeEmporium
How does Google Translate's AI work?
CodeEmporium
How to keep up with AI research?
CodeEmporium
How does YouTube recommend videos? - AI EXPLAINED!
CodeEmporium
Variational Autoencoders - EXPLAINED!
CodeEmporium
Logistic Regression - VISUALIZED!
CodeEmporium
Gradient Descent - THE MATH YOU SHOULD KNOW
CodeEmporium
Boosting - EXPLAINED!
CodeEmporium
Transformer Neural Networks - EXPLAINED! (Attention is all you need)
CodeEmporium
Loss Functions - EXPLAINED!
CodeEmporium
Optimizers - EXPLAINED!
CodeEmporium
NLP with Neural Networks & Transformers
CodeEmporium
Batch Normalization - EXPLAINED!
CodeEmporium
Activation Functions - EXPLAINED!
CodeEmporium
Data Scientist Answers Interview Questions
CodeEmporium
Why use GPU with Neural Networks?
CodeEmporium
How do GPUs speed up Neural Network training?
CodeEmporium
BERT Neural Network - EXPLAINED!
CodeEmporium
ConvNets Scaled Efficiently
CodeEmporium
Transformer Neural Net makes music! (JukeboxAI)
CodeEmporium
What do filters of Convolution Neural Network learn?
CodeEmporium
We're hosting a Machine Learning Conference!
CodeEmporium
MLconfEU 2020: Machine Learning Conference for Software Engineers
CodeEmporium
Are Neural Networks Intelligent?
CodeEmporium
Time Series Forecasting with Machine Learning
CodeEmporium
Few Shot Learning - EXPLAINED!
CodeEmporium
How does a Data Scientist Fight FRAUD?
CodeEmporium
How would a Data Scientist analyze Customer Churn?
CodeEmporium
Expectations with Machine Learning
CodeEmporium
Why Logistic Regression DOESN'T return probabilities?!
CodeEmporium
How you SHOULD code Machine Learning
CodeEmporium
More on: RL Foundations
View skill โRelated AI Lessons
โก
โก
โก
โก
7 Common Java Streams Mistakes and How to Avoid Them
Medium ยท Programming
Implementing an Item-Based Recommendation System from Scratch in Python
Medium ยท Machine Learning
The Threshold Is a Business Decision, Not a Statistical One
Medium ยท Machine Learning
Can Your Stress Level Predict How Much You Sleep?
Medium ยท Machine Learning
๐
Tutor Explanation
DeepCamp AI