OpenAI announces FINETUNING ๐Ÿ‘€ for ChatGPT

Wes Roth ยท Intermediate ยท๐Ÿง  Large Language Models ยท2y ago

Key Takeaways

The video discusses the concept of fine-tuning in LLMs, its advantages and disadvantages, and its applications in various use cases, including legal research, prompt engineering, and cost reduction, with a focus on OpenAI's announcement of fine-tuning for ChatGPT.

Full Transcript

so open AI just announced that users will be able to fine-tune their GPT 3.5 model and looks like the ability to fine-tune gpt4 will be coming in a few months according to open AI early test I've shown that a fine-tuned version of GPT 3.5 turbo can match or even outperform base gpt4 level capabilities on certain narrow tasks so what is fine tuning you remember that movie Rain Man one thing I was really good at math he was really fast at calculating stuff had this like Supernatural ability to remember numbers but he kind of struggled with other tasks I kind of think of fine-tuning as a little bit like that basically fine-tuning means specializing a model to some specific task if you want a customer service chat bot that answers questions about your particular product you can fine tune a model to do that if you want a computer game character a non-player character that's that feels like he's really part of that world you can fine tune a model to do that now fine-tuning can have some downsides basically you should assume that there's going to be some in degradation of some of the abilities outside of what you're fine-tuning for outside of what it's specialized to do for example if you fine-tune a model to answer as if it was an orc chieftain in your Dungeons and Dragons game it may lose its ability to argue the finer points of post-colonial feminism so what are the advantages of fine-tuning a model well there are quite a few one is it can enable a very custom very controlled experience if you need the llm to say specific things to not break character to not say things like as a large language model well then fine-tuning allows you to custom shape what it says and does as openai puts it improved durability reliable output formatting and custom tone this can be extra important places where for example you're doing some sort of code completion or composing API calls you want Chad GPT GPT 3.5 whatever to Output the code in a very specific way you don't want it starting with like oh sure I can help with that and then doing the code you wanted just a code or whatever format you're looking for or if it can't do it you want to throw up some specific air message then you can kind of flag but you don't want it making up something brand new to answer your prompt the other big Advantage is the cost so you can slash the cost of using the llm for your particular tasks now it's important to note that the actual cost per 1000 tokens is going to be bigger than on the base models it's six to eight times higher than the base model but since you no longer have to give it you know let's say multiple examples you don't have to teach it to Output the proper format every single time there's going to be some use cases where using the fine-tune models is going to save a lot of money so where's this going to be used well customer service is going to be huge email chat Bots Etc this can be used very effectively and you have to worry about it making up some offensive crap on the spot and making your customers mad things like language translation you can force it to respond in a specific language so whatever the prompt is it responds in that language and it translates that and translates that prompt into whatever language you want it to so for Education this will be massive for things like tutoring for learning for code searching for therapy stuff like that or gaming like remember our evil orc that's hopelessly stuck in his ways the entire backstory of their own could be given to all the characters and then each would be ready to respond in character individually but they would all have sort of like the same backstory if you want them all to know something like this land is getting invaded they would all know about it so they can respond appropriately to any questions that you might ask fine-tuning can be used in legal research for example remember that Lord I used Chad between chord and it cited a bunch of fake cases he got into a lot of trouble for that fine tuning would be able to help with that one thing where I think personally this is going to be used is where you know if you have certain AI agents where you have multiple instances of Chad gbt that are kind of working together to achieve a mission for example you have one that's making decisions one that's writing little scripts maybe you have one as the base model and one as a fine-tuned model that's specialized in doing that specific thing that he needs to now now that I think about it almost certain that we're going to see a model that is fine-tuned to produce prompts for another model for another base model basically something that you teach to Just Produce like the perfect prompts every single time based on whatever input you have that you needed to take into consideration so two couple other places where fine tuning can help according to open AI high quality results then regular prompting ability to train on more examples that can fit into a prompt so this is interesting because so basically oftentimes when we're using these llm models there's something that is referred to as for example few shot learning or zero shot learning shot meets examples basically so few shot means you give it some examples as what you want the output to look like and sometimes a few will do but sometimes you get better responses when you provide a hundred two hundred a thousand different examples for it to go off of so that's one way to kind of get around it basically you can fine tune a model and use you know a thousand examples to really fine tune exactly what you want the output to be so token saving is due to Shorter prompts again you're training it it's almost like custom instructions but for like the whole model and so so there's you're using less input tokens you're using less output token and lower latency requests so basically you're getting the answer back faster so here they're saying how fine-tuning can be good for specific applications but that's not always worth the time and effort they're saying first get good with prompt engineering prompt chaining breaking complex tasks into multiple prompt multiple prompts excuse me and function calling by the way function calling is not going to be available for training these or fine-tuning these models so their main tasks for which our models May initially appear to not perform well but with better prompting can achieve better results iterating over prompts and other tactics has a much faster feedback loop than iterating with fine tuning and so the use cases where they think it's going to be very good so specifically when you when you have to set the style tone format Etc like kind of we talked about improving reliability so for customer service where you really don't want to say the wrong thing correcting failures to follow complex prompts handling many edge cases in very specific ways and performing a new scalar task that's hard to articulate in a prompt this might be something like trying to copy a certain Style of writing like if somebody has a very specific style you know the more sort of examples you give it the better the outputs is likely going to be but that might be difficult with sort of the base model so something like this you can give it I assume you can probably give it a whole book's worth of writing styles and then train on that one high level way to think about these cases is when it's easier to show not tell and another scenario is where you need to reduce costs in our latency without sacrificing quality and then they have a little bit of a fine-tuning steps kind of like a brief tutorial so basically where we prepare the data we're going to be feeding it upload the files using your API key we're going to create the training file and and then we're going to be using that sort of new model that we created with our with our ID and so I think a lot of this is going to depend on the cost structure of this so you're paying let's say seven times more per thousand tokens but you're saving money on how much tokens are going back and forth potentially so it sounds like there's I mean obviously there's gonna be some use cases where this is going to be great and for a lot of the other stuff might not be so I feel like I don't see a current knee need for me to use anything like this right now but maybe maybe in the future there will be some specific case where this would work obviously if this was much cheaper than running the base model this would be a big deal like if you can train the model to do the thing that you wanted to do and have it cost a lot less than you know let's say gbt4 that would be excellent but that's not in the cards right now unfortunately but I'm sure as we see more and more people testing this thing out and you know publishing their use cases we're going to see specific use cases where we're gonna be like of course this is great for that why didn't we think of that curiously know what you think if you have something that you're going to use it for where you think it's going to be like the perfect use for fine tuning instead of the base model please let me know and again that could be 4gbt 5 3.5 turbo or later for gpt4 and they also have two sort of of the much cheaper models DaVinci Dash 002 babbage002 whereas you can see here the cost per thousand tokens is much much less than sort of like all the other ones so maybe if you have some specific use cases for that I'd love to hear it let me know in the comments and my name is Wes Roth and thank you for watching

Original Description

#openai #gpt4 #ai ๐Ÿ”ฅ The Smartest A.I. Newsletter Ever. https://natural20.com/
Watch on YouTube โ†— (saves to browser)
Sign in to unlock AI tutor explanation ยท โšก30

Playlist

Uploads from Wes Roth ยท Wes Roth ยท 56 of 60

1 Which Vanguard index fund to buy? (hint: it's the one Warren Buffett recommends)
Which Vanguard index fund to buy? (hint: it's the one Warren Buffett recommends)
Wes Roth
2 What does PALANTIR do - Palantir Stock, Founder, Controversy Explained Simply (plus why I'm BUYING).
What does PALANTIR do - Palantir Stock, Founder, Controversy Explained Simply (plus why I'm BUYING).
Wes Roth
3 Paypal misinformation fine ($2,500) - Close Your Accounts ASAP!
Paypal misinformation fine ($2,500) - Close Your Accounts ASAP!
Wes Roth
4 China Was Just Sent Back to the Dark Ages  |  US starts aggressively cutting ties
China Was Just Sent Back to the Dark Ages | US starts aggressively cutting ties
Wes Roth
5 ChatGPT Business Ideas - How I Use ChatGPT to make money
ChatGPT Business Ideas - How I Use ChatGPT to make money
Wes Roth
6 ChatGPT Explained - The AI revolution is happening right now... [ chat gpt ]
ChatGPT Explained - The AI revolution is happening right now... [ chat gpt ]
Wes Roth
7 ChatGPT Banned - New York blocking network access to ChatGPT
ChatGPT Banned - New York blocking network access to ChatGPT
Wes Roth
8 ChatGPT Trading - this [INSANE] tool A.I. built for me
ChatGPT Trading - this [INSANE] tool A.I. built for me
Wes Roth
9 Small Business Grants for ChatGPT and A.I. (similar to PPP and EIDL in 2023) |
Small Business Grants for ChatGPT and A.I. (similar to PPP and EIDL in 2023) |
Wes Roth
10 How to Make Passive Income with ChatGPT AI
How to Make Passive Income with ChatGPT AI
Wes Roth
11 OpenAIโ€™s GPT-4 Artificial Intelligence = AGI? TRILLIONS of Parameters Plus THIS
OpenAIโ€™s GPT-4 Artificial Intelligence = AGI? TRILLIONS of Parameters Plus THIS
Wes Roth
12 How Nvidia AI Robot Trained 42 Years In 32 Hours And Did THIS | Google DeepMind AlphaCode
How Nvidia AI Robot Trained 42 Years In 32 Hours And Did THIS | Google DeepMind AlphaCode
Wes Roth
13 John Carmack | AGI by 2030 | Will John Carmack's AI company be the one to make it?
John Carmack | AGI by 2030 | Will John Carmack's AI company be the one to make it?
Wes Roth
14 AI Small Business Grants
AI Small Business Grants
Wes Roth
15 Elon Musk attacks OpenAI - here's Sam Altman's response
Elon Musk attacks OpenAI - here's Sam Altman's response
Wes Roth
16 Bill Gates on ChatGPT and OpenAI "The Age of AI has begun"
Bill Gates on ChatGPT and OpenAI "The Age of AI has begun"
Wes Roth
17 Sparks of AGI | Microsoft Researchers claim GPT-4 Is showing "Artificial General Intelligence"
Sparks of AGI | Microsoft Researchers claim GPT-4 Is showing "Artificial General Intelligence"
Wes Roth
18 Elon Musk and Others Call for Pause on AI as GPT-4 shows signs of AGI.
Elon Musk and Others Call for Pause on AI as GPT-4 shows signs of AGI.
Wes Roth
19 Comparing GPT-4 and Google's Bard AI - Who is getting closer to AGI?
Comparing GPT-4 and Google's Bard AI - Who is getting closer to AGI?
Wes Roth
20 Sam Altman on UBI, OpenAI to $100 TRILLION and Massive Job Losses from AI Automation
Sam Altman on UBI, OpenAI to $100 TRILLION and Massive Job Losses from AI Automation
Wes Roth
21 25 ChatGPTs play a videogame...
25 ChatGPTs play a videogame...
Wes Roth
22 NVIDIA's new AI: Better Games, Art and... better life?
NVIDIA's new AI: Better Games, Art and... better life?
Wes Roth
23 Google AI Documents Leak about "Google and OpenAI"
Google AI Documents Leak about "Google and OpenAI"
Wes Roth
24 PaLM 2 vs GPT-4 | why Google is having a hard time catching up...
PaLM 2 vs GPT-4 | why Google is having a hard time catching up...
Wes Roth
25 How To Access ChatGPT Plugins | They are LIVE! (but hidden)
How To Access ChatGPT Plugins | They are LIVE! (but hidden)
Wes Roth
26 Sam Altman to Congress "America HAS to lead the world in AI"...
Sam Altman to Congress "America HAS to lead the world in AI"...
Wes Roth
27 Sam Altman Opening Statement to Congress on AI Regulation
Sam Altman Opening Statement to Congress on AI Regulation
Wes Roth
28 Sam Altman Congress Hearing "AI is the Biggest Threat to Human Race"
Sam Altman Congress Hearing "AI is the Biggest Threat to Human Race"
Wes Roth
29 Tree of Thoughts - GPT-4 Reasoning is Improved 900%
Tree of Thoughts - GPT-4 Reasoning is Improved 900%
Wes Roth
30 Governance of Superintelligence | OpenAI proposes measures for safe AI development.
Governance of Superintelligence | OpenAI proposes measures for safe AI development.
Wes Roth
31 Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper
Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper
Wes Roth
32 Minecraft AI - NVIDIA uses GPT-4 to create a SELF-IMPROVING ๐Ÿคฏ autonomous agent.
Minecraft AI - NVIDIA uses GPT-4 to create a SELF-IMPROVING ๐Ÿคฏ autonomous agent.
Wes Roth
33 AI Human Extinction Risk - Experts Warn of "Serious Risk"
AI Human Extinction Risk - Experts Warn of "Serious Risk"
Wes Roth
34 LoRA - Low-rank Adaption of AI Large Language Models: LoRA and QLoRA Explained Simply
LoRA - Low-rank Adaption of AI Large Language Models: LoRA and QLoRA Explained Simply
Wes Roth
35 99.3% of ChatGPT Performance with OpenSource AI - [QLoRA paper]
99.3% of ChatGPT Performance with OpenSource AI - [QLoRA paper]
Wes Roth
36 AlphaFold2 Explained | Google's DeepMind Solves Protein Folding
AlphaFold2 Explained | Google's DeepMind Solves Protein Folding
Wes Roth
37 Illumina AI - ChatGPT for your genome...
Illumina AI - ChatGPT for your genome...
Wes Roth
38 Text to Video Invasion! Runway AI releases GEN 2 text to video.
Text to Video Invasion! Runway AI releases GEN 2 text to video.
Wes Roth
39 LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models.
LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models.
Wes Roth
40 AlphaDev - DeepMind AI Discovers Better Algorithms for Foundational Computing
AlphaDev - DeepMind AI Discovers Better Algorithms for Foundational Computing
Wes Roth
41 OpenAI GPT-4 Function Calling: *HUGE* Potential
OpenAI GPT-4 Function Calling: *HUGE* Potential
Wes Roth
42 GPT-4 leaked! ๐Ÿ”ฅ All details exposed ๐Ÿ”ฅ It is over...
GPT-4 leaked! ๐Ÿ”ฅ All details exposed ๐Ÿ”ฅ It is over...
Wes Roth
43 Elon Musk announced XAI - the answer to OpenAI = X.AI
Elon Musk announced XAI - the answer to OpenAI = X.AI
Wes Roth
44 Andrej Karpathy GPT - Advice for building AI agents
Andrej Karpathy GPT - Advice for building AI agents
Wes Roth
45 TEST TO SEE IF AI CAN MAKE $1,000,000   (modern Turing test)
TEST TO SEE IF AI CAN MAKE $1,000,000 (modern Turing test)
Wes Roth
46 ChatGPT custom instructions are *POWERFUL*  Replace AutoGPT and BabyAGI?
ChatGPT custom instructions are *POWERFUL* Replace AutoGPT and BabyAGI?
Wes Roth
47 WORLDCOIN LAUNCH is starting! Backed by Sam Altman of OpenAI.
WORLDCOIN LAUNCH is starting! Backed by Sam Altman of OpenAI.
Wes Roth
48 WORLDCOIN ORB - I went to L.A. to get my eye scanned for WorldCoin [my experience]
WORLDCOIN ORB - I went to L.A. to get my eye scanned for WorldCoin [my experience]
Wes Roth
49 The Biggest Week of AI News In Months!
The Biggest Week of AI News In Months!
Wes Roth
50 Google Deepmind RT 2 - Using LLMs to Build Thinking, Learning Robots
Google Deepmind RT 2 - Using LLMs to Build Thinking, Learning Robots
Wes Roth
51 AI News is Getting *WEIRD* Human Brain Matter in Chips. OpenAI tutorial. Amazon unleashed it's AI.
AI News is Getting *WEIRD* Human Brain Matter in Chips. OpenAI tutorial. Amazon unleashed it's AI.
Wes Roth
52 GPT 5 release date ๐Ÿ”ฅ might be closer than we think | OpenAI applies for GPT-5 Trademark in the US.
GPT 5 release date ๐Ÿ”ฅ might be closer than we think | OpenAI applies for GPT-5 Trademark in the US.
Wes Roth
53 AI Agents Simulate a Town ๐Ÿคฏ Generative Agents: Interactive Simulacra of Human Behavior.
AI Agents Simulate a Town ๐Ÿคฏ Generative Agents: Interactive Simulacra of Human Behavior.
Wes Roth
54 Proof that AI Understands? ๐Ÿ‘€ Andrew Ng on LLMs building mental models, Othello GPT,  Geoffrey Hinton
Proof that AI Understands? ๐Ÿ‘€ Andrew Ng on LLMs building mental models, Othello GPT, Geoffrey Hinton
Wes Roth
55 OpenAI acquires Biomes ๐Ÿ‘€ an open-source MMORPG. ChatGPT plus Minecraft? ๐Ÿ”ฅ
OpenAI acquires Biomes ๐Ÿ‘€ an open-source MMORPG. ChatGPT plus Minecraft? ๐Ÿ”ฅ
Wes Roth
โ–ถ OpenAI announces FINETUNING ๐Ÿ‘€ for ChatGPT
OpenAI announces FINETUNING ๐Ÿ‘€ for ChatGPT
Wes Roth
57 Autonomous AI Agents - why YOU should be building them... and HOW.
Autonomous AI Agents - why YOU should be building them... and HOW.
Wes Roth
58 ChatGPT Enterprise - OpenAI launches the next BIG thing
ChatGPT Enterprise - OpenAI launches the next BIG thing
Wes Roth
59 HOODWINKED -  AI gets away with MURDER ๐Ÿ‘€ GPT-4 is an effective killer...
HOODWINKED - AI gets away with MURDER ๐Ÿ‘€ GPT-4 is an effective killer...
Wes Roth
60 Install Open Interpreter in 2 min | The free, open source CODE INTERPRETER!
Install Open Interpreter in 2 min | The free, open source CODE INTERPRETER!
Wes Roth

The video teaches the concept of fine-tuning in LLMs and its applications in various use cases, with a focus on OpenAI's announcement of fine-tuning for ChatGPT. Fine-tuning can be used to specialize a model to a specific task, improve durability and reliability, and reduce costs and latency. However, it can also have downsides, such as degradation of abilities outside of the fine-tuned task.

Key Takeaways
  1. Define the task for fine-tuning
  2. Choose the LLM model to fine-tune
  3. Prepare the training data
  4. Fine-tune the model
  5. Test and evaluate the fine-tuned model
  6. Deploy the fine-tuned model in the desired application
  7. Monitor and maintain the fine-tuned model
๐Ÿ’ก Fine-tuning can be used to slash the cost of using an LLM for a particular task, but the cost per 1000 tokens is higher for fine-tuned models.
๐Ÿ”’ Pro feature: Ask AI to explain this lesson โ†’

Related Reads

๐Ÿ“ฐ
Sonnet 5 vs GLM-5.2 vs everyone: how to pick the cheapest LLM API in 2026
Learn how to choose the cheapest LLM API in 2026 by comparing pricing models of Sonnet 5 and GLM-5.2
Dev.to AI
๐Ÿ“ฐ
JSON-Schema masks can block needed tool calls
Learn how JSON-Schema masks can block needed tool calls in LLM agents and how to sidestep the problem with a two-pass inference hack
Dev.to AI
๐Ÿ“ฐ
The Invisible Cage: What the Evolution from Claude Sonnet 4.6
Explore the evolution of AI models like Claude Sonnet 4.6 and its implications on AI development
Medium ยท AI
๐Ÿ“ฐ
The Best Vector Database in 2026: Qdrant vs Pinecone vs Weaviate vs Milvus vs pgvector
Learn how to choose the best vector database for your RAG system in 2026, comparing Qdrant, Pinecone, Weaviate, Milvus, and pgvector
Dev.to ยท Darshit Radadiya
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch โ†’