OpenAI announces FINETUNING 👀 for ChatGPT

Wes Roth · Intermediate ·🧠 Large Language Models ·2y ago

Skills: Fine-tuning LLMs90%

Key Takeaways

The video discusses the concept of fine-tuning in LLMs, its advantages and disadvantages, and its applications in various use cases, including legal research, prompt engineering, and cost reduction, with a focus on OpenAI's announcement of fine-tuning for ChatGPT.

Full Transcript

so open AI just announced that users will be able to fine-tune their GPT 3.5 model and looks like the ability to fine-tune gpt4 will be coming in a few months according to open AI early test I've shown that a fine-tuned version of GPT 3.5 turbo can match or even outperform base gpt4 level capabilities on certain narrow tasks so what is fine tuning you remember that movie Rain Man one thing I was really good at math he was really fast at calculating stuff had this like Supernatural ability to remember numbers but he kind of struggled with other tasks I kind of think of fine-tuning as a little bit like that basically fine-tuning means specializing a model to some specific task if you want a customer service chat bot that answers questions about your particular product you can fine tune a model to do that if you want a computer game character a non-player character that's that feels like he's really part of that world you can fine tune a model to do that now fine-tuning can have some downsides basically you should assume that there's going to be some in degradation of some of the abilities outside of what you're fine-tuning for outside of what it's specialized to do for example if you fine-tune a model to answer as if it was an orc chieftain in your Dungeons and Dragons game it may lose its ability to argue the finer points of post-colonial feminism so what are the advantages of fine-tuning a model well there are quite a few one is it can enable a very custom very controlled experience if you need the llm to say specific things to not break character to not say things like as a large language model well then fine-tuning allows you to custom shape what it says and does as openai puts it improved durability reliable output formatting and custom tone this can be extra important places where for example you're doing some sort of code completion or composing API calls you want Chad GPT GPT 3.5 whatever to Output the code in a very specific way you don't want it starting with like oh sure I can help with that and then doing the code you wanted just a code or whatever format you're looking for or if it can't do it you want to throw up some specific air message then you can kind of flag but you don't want it making up something brand new to answer your prompt the other big Advantage is the cost so you can slash the cost of using the llm for your particular tasks now it's important to note that the actual cost per 1000 tokens is going to be bigger than on the base models it's six to eight times higher than the base model but since you no longer have to give it you know let's say multiple examples you don't have to teach it to Output the proper format every single time there's going to be some use cases where using the fine-tune models is going to save a lot of money so where's this going to be used well customer service is going to be huge email chat Bots Etc this can be used very effectively and you have to worry about it making up some offensive crap on the spot and making your customers mad things like language translation you can force it to respond in a specific language so whatever the prompt is it responds in that language and it translates that and translates that prompt into whatever language you want it to so for Education this will be massive for things like tutoring for learning for code searching for therapy stuff like that or gaming like remember our evil orc that's hopelessly stuck in his ways the entire backstory of their own could be given to all the characters and then each would be ready to respond in character individually but they would all have sort of like the same backstory if you want them all to know something like this land is getting invaded they would all know about it so they can respond appropriately to any questions that you might ask fine-tuning can be used in legal research for example remember that Lord I used Chad between chord and it cited a bunch of fake cases he got into a lot of trouble for that fine tuning would be able to help with that one thing where I think personally this is going to be used is where you know if you have certain AI agents where you have multiple instances of Chad gbt that are kind of working together to achieve a mission for example you have one that's making decisions one that's writing little scripts maybe you have one as the base model and one as a fine-tuned model that's specialized in doing that specific thing that he needs to now now that I think about it almost certain that we're going to see a model that is fine-tuned to produce prompts for another model for another base model basically something that you teach to Just Produce like the perfect prompts every single time based on whatever input you have that you needed to take into consideration so two couple other places where fine tuning can help according to open AI high quality results then regular prompting ability to train on more examples that can fit into a prompt so this is interesting because so basically oftentimes when we're using these llm models there's something that is referred to as for example few shot learning or zero shot learning shot meets examples basically so few shot means you give it some examples as what you want the output to look like and sometimes a few will do but sometimes you get better responses when you provide a hundred two hundred a thousand different examples for it to go off of so that's one way to kind of get around it basically you can fine tune a model and use you know a thousand examples to really fine tune exactly what you want the output to be so token saving is due to Shorter prompts again you're training it it's almost like custom instructions but for like the whole model and so so there's you're using less input tokens you're using less output token and lower latency requests so basically you're getting the answer back faster so here they're saying how fine-tuning can be good for specific applications but that's not always worth the time and effort they're saying first get good with prompt engineering prompt chaining breaking complex tasks into multiple prompt multiple prompts excuse me and function calling by the way function calling is not going to be available for training these or fine-tuning these models so their main tasks for which our models May initially appear to not perform well but with better prompting can achieve better results iterating over prompts and other tactics has a much faster feedback loop than iterating with fine tuning and so the use cases where they think it's going to be very good so specifically when you when you have to set the style tone format Etc like kind of we talked about improving reliability so for customer service where you really don't want to say the wrong thing correcting failures to follow complex prompts handling many edge cases in very specific ways and performing a new scalar task that's hard to articulate in a prompt this might be something like trying to copy a certain Style of writing like if somebody has a very specific style you know the more sort of examples you give it the better the outputs is likely going to be but that might be difficult with sort of the base model so something like this you can give it I assume you can probably give it a whole book's worth of writing styles and then train on that one high level way to think about these cases is when it's easier to show not tell and another scenario is where you need to reduce costs in our latency without sacrificing quality and then they have a little bit of a fine-tuning steps kind of like a brief tutorial so basically where we prepare the data we're going to be feeding it upload the files using your API key we're going to create the training file and and then we're going to be using that sort of new model that we created with our with our ID and so I think a lot of this is going to depend on the cost structure of this so you're paying let's say seven times more per thousand tokens but you're saving money on how much tokens are going back and forth potentially so it sounds like there's I mean obviously there's gonna be some use cases where this is going to be great and for a lot of the other stuff might not be so I feel like I don't see a current knee need for me to use anything like this right now but maybe maybe in the future there will be some specific case where this would work obviously if this was much cheaper than running the base model this would be a big deal like if you can train the model to do the thing that you wanted to do and have it cost a lot less than you know let's say gbt4 that would be excellent but that's not in the cards right now unfortunately but I'm sure as we see more and more people testing this thing out and you know publishing their use cases we're going to see specific use cases where we're gonna be like of course this is great for that why didn't we think of that curiously know what you think if you have something that you're going to use it for where you think it's going to be like the perfect use for fine tuning instead of the base model please let me know and again that could be 4gbt 5 3.5 turbo or later for gpt4 and they also have two sort of of the much cheaper models DaVinci Dash 002 babbage002 whereas you can see here the cost per thousand tokens is much much less than sort of like all the other ones so maybe if you have some specific use cases for that I'd love to hear it let me know in the comments and my name is Wes Roth and thank you for watching

Original Description

#openai #gpt4 #ai 🔥 The Smartest A.I. Newsletter Ever. https://natural20.com/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Wes Roth · Wes Roth · 56 of 60

← Previous Next →

Which Vanguard index fund to buy? (hint: it's the one Warren Buffett recommends)

Which Vanguard index fund to buy? (hint: it's the one Warren Buffett recommends)

What does PALANTIR do - Palantir Stock, Founder, Controversy Explained Simply (plus why I'm BUYING).

What does PALANTIR do - Palantir Stock, Founder, Controversy Explained Simply (plus why I'm BUYING).

Paypal misinformation fine ($2,500) - Close Your Accounts ASAP!

Paypal misinformation fine ($2,500) - Close Your Accounts ASAP!

China Was Just Sent Back to the Dark Ages | US starts aggressively cutting ties

China Was Just Sent Back to the Dark Ages | US starts aggressively cutting ties

ChatGPT Business Ideas - How I Use ChatGPT to make money

ChatGPT Business Ideas - How I Use ChatGPT to make money

ChatGPT Explained - The AI revolution is happening right now... [ chat gpt ]

ChatGPT Explained - The AI revolution is happening right now... [ chat gpt ]

ChatGPT Banned - New York blocking network access to ChatGPT

ChatGPT Banned - New York blocking network access to ChatGPT

ChatGPT Trading - this [INSANE] tool A.I. built for me

ChatGPT Trading - this [INSANE] tool A.I. built for me

Small Business Grants for ChatGPT and A.I. (similar to PPP and EIDL in 2023) |

Small Business Grants for ChatGPT and A.I. (similar to PPP and EIDL in 2023) |

How to Make Passive Income with ChatGPT AI

How to Make Passive Income with ChatGPT AI

OpenAI’s GPT-4 Artificial Intelligence = AGI? TRILLIONS of Parameters Plus THIS

OpenAI’s GPT-4 Artificial Intelligence = AGI? TRILLIONS of Parameters Plus THIS

How Nvidia AI Robot Trained 42 Years In 32 Hours And Did THIS | Google DeepMind AlphaCode

How Nvidia AI Robot Trained 42 Years In 32 Hours And Did THIS | Google DeepMind AlphaCode

John Carmack | AGI by 2030 | Will John Carmack's AI company be the one to make it?

John Carmack | AGI by 2030 | Will John Carmack's AI company be the one to make it?

AI Small Business Grants

AI Small Business Grants

Elon Musk attacks OpenAI - here's Sam Altman's response

Elon Musk attacks OpenAI - here's Sam Altman's response

Bill Gates on ChatGPT and OpenAI "The Age of AI has begun"

Bill Gates on ChatGPT and OpenAI "The Age of AI has begun"

Sparks of AGI | Microsoft Researchers claim GPT-4 Is showing "Artificial General Intelligence"

Sparks of AGI | Microsoft Researchers claim GPT-4 Is showing "Artificial General Intelligence"

Elon Musk and Others Call for Pause on AI as GPT-4 shows signs of AGI.

Elon Musk and Others Call for Pause on AI as GPT-4 shows signs of AGI.

Comparing GPT-4 and Google's Bard AI - Who is getting closer to AGI?

Comparing GPT-4 and Google's Bard AI - Who is getting closer to AGI?

Sam Altman on UBI, OpenAI to $100 TRILLION and Massive Job Losses from AI Automation

Sam Altman on UBI, OpenAI to $100 TRILLION and Massive Job Losses from AI Automation

25 ChatGPTs play a videogame...

25 ChatGPTs play a videogame...

NVIDIA's new AI: Better Games, Art and... better life?

NVIDIA's new AI: Better Games, Art and... better life?

Google AI Documents Leak about "Google and OpenAI"

Google AI Documents Leak about "Google and OpenAI"

PaLM 2 vs GPT-4 | why Google is having a hard time catching up...

PaLM 2 vs GPT-4 | why Google is having a hard time catching up...

How To Access ChatGPT Plugins | They are LIVE! (but hidden)

How To Access ChatGPT Plugins | They are LIVE! (but hidden)

Sam Altman to Congress "America HAS to lead the world in AI"...

Sam Altman to Congress "America HAS to lead the world in AI"...

Sam Altman Opening Statement to Congress on AI Regulation

Sam Altman Opening Statement to Congress on AI Regulation

Sam Altman Congress Hearing "AI is the Biggest Threat to Human Race"

Sam Altman Congress Hearing "AI is the Biggest Threat to Human Race"

Tree of Thoughts - GPT-4 Reasoning is Improved 900%

Tree of Thoughts - GPT-4 Reasoning is Improved 900%

Governance of Superintelligence | OpenAI proposes measures for safe AI development.

Governance of Superintelligence | OpenAI proposes measures for safe AI development.

Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper

Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper

Minecraft AI - NVIDIA uses GPT-4 to create a SELF-IMPROVING 🤯 autonomous agent.

Minecraft AI - NVIDIA uses GPT-4 to create a SELF-IMPROVING 🤯 autonomous agent.

AI Human Extinction Risk - Experts Warn of "Serious Risk"

AI Human Extinction Risk - Experts Warn of "Serious Risk"

LoRA - Low-rank Adaption of AI Large Language Models: LoRA and QLoRA Explained Simply

LoRA - Low-rank Adaption of AI Large Language Models: LoRA and QLoRA Explained Simply

99.3% of ChatGPT Performance with OpenSource AI - [QLoRA paper]

99.3% of ChatGPT Performance with OpenSource AI - [QLoRA paper]

AlphaFold2 Explained | Google's DeepMind Solves Protein Folding

AlphaFold2 Explained | Google's DeepMind Solves Protein Folding

Illumina AI - ChatGPT for your genome...

Illumina AI - ChatGPT for your genome...

Text to Video Invasion! Runway AI releases GEN 2 text to video.

Text to Video Invasion! Runway AI releases GEN 2 text to video.

LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models.

LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models.

AlphaDev - DeepMind AI Discovers Better Algorithms for Foundational Computing

AlphaDev - DeepMind AI Discovers Better Algorithms for Foundational Computing

OpenAI GPT-4 Function Calling: *HUGE* Potential

OpenAI GPT-4 Function Calling: *HUGE* Potential

GPT-4 leaked! 🔥 All details exposed 🔥 It is over...

GPT-4 leaked! 🔥 All details exposed 🔥 It is over...

Elon Musk announced XAI - the answer to OpenAI = X.AI

Elon Musk announced XAI - the answer to OpenAI = X.AI

Andrej Karpathy GPT - Advice for building AI agents

Andrej Karpathy GPT - Advice for building AI agents

TEST TO SEE IF AI CAN MAKE $1,000,000 (modern Turing test)

TEST TO SEE IF AI CAN MAKE $1,000,000 (modern Turing test)

ChatGPT custom instructions are *POWERFUL* Replace AutoGPT and BabyAGI?

ChatGPT custom instructions are *POWERFUL* Replace AutoGPT and BabyAGI?

WORLDCOIN LAUNCH is starting! Backed by Sam Altman of OpenAI.

WORLDCOIN LAUNCH is starting! Backed by Sam Altman of OpenAI.

WORLDCOIN ORB - I went to L.A. to get my eye scanned for WorldCoin [my experience]

WORLDCOIN ORB - I went to L.A. to get my eye scanned for WorldCoin [my experience]

The Biggest Week of AI News In Months!

The Biggest Week of AI News In Months!

Google Deepmind RT 2 - Using LLMs to Build Thinking, Learning Robots

Google Deepmind RT 2 - Using LLMs to Build Thinking, Learning Robots

AI News is Getting *WEIRD* Human Brain Matter in Chips. OpenAI tutorial. Amazon unleashed it's AI.

AI News is Getting *WEIRD* Human Brain Matter in Chips. OpenAI tutorial. Amazon unleashed it's AI.

GPT 5 release date 🔥 might be closer than we think | OpenAI applies for GPT-5 Trademark in the US.

GPT 5 release date 🔥 might be closer than we think | OpenAI applies for GPT-5 Trademark in the US.

AI Agents Simulate a Town 🤯 Generative Agents: Interactive Simulacra of Human Behavior.

AI Agents Simulate a Town 🤯 Generative Agents: Interactive Simulacra of Human Behavior.

Proof that AI Understands? 👀 Andrew Ng on LLMs building mental models, Othello GPT, Geoffrey Hinton

Proof that AI Understands? 👀 Andrew Ng on LLMs building mental models, Othello GPT, Geoffrey Hinton

OpenAI acquires Biomes 👀 an open-source MMORPG. ChatGPT plus Minecraft? 🔥

OpenAI acquires Biomes 👀 an open-source MMORPG. ChatGPT plus Minecraft? 🔥

OpenAI announces FINETUNING 👀 for ChatGPT

OpenAI announces FINETUNING 👀 for ChatGPT

Autonomous AI Agents - why YOU should be building them... and HOW.

Autonomous AI Agents - why YOU should be building them... and HOW.

ChatGPT Enterprise - OpenAI launches the next BIG thing

ChatGPT Enterprise - OpenAI launches the next BIG thing

HOODWINKED - AI gets away with MURDER 👀 GPT-4 is an effective killer...

HOODWINKED - AI gets away with MURDER 👀 GPT-4 is an effective killer...

Install Open Interpreter in 2 min | The free, open source CODE INTERPRETER!

Install Open Interpreter in 2 min | The free, open source CODE INTERPRETER!

The video teaches the concept of fine-tuning in LLMs and its applications in various use cases, with a focus on OpenAI's announcement of fine-tuning for ChatGPT. Fine-tuning can be used to specialize a model to a specific task, improve durability and reliability, and reduce costs and latency. However, it can also have downsides, such as degradation of abilities outside of the fine-tuned task.

Key Takeaways

Define the task for fine-tuning
Choose the LLM model to fine-tune
Prepare the training data
Fine-tune the model
Test and evaluate the fine-tuned model
Deploy the fine-tuned model in the desired application
Monitor and maintain the fine-tuned model

💡 Fine-tuning can be used to slash the cost of using an LLM for a particular task, but the cost per 1000 tokens is higher for fine-tuned models.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Fine-tuning LLMs

View skill →

Fine-tuning T5 LLM for Text Generation: Complete Tutorial w/ free COLAB #coding

Fine-tuning T5 LLM for Text Generation: Complete Tutorial w/ free COLAB #coding

Train image classifier using transfer learning - Fine-tuning MobileNet with Keras

Train image classifier using transfer learning - Fine-tuning MobileNet with Keras

Advanced Fine-Tuning in Rust

Advanced Fine-Tuning in Rust

GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)

GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)

LLM Fine-tuning: Two Crucial Tips for New Models - LLama 2

LLM Fine-tuning: Two Crucial Tips for New Models - LLama 2

SDXL LORA STYLE Training! Get THE PERFECT RESULTS!

SDXL LORA STYLE Training! Get THE PERFECT RESULTS!

Related Reads

Title: Prompt Engineering for Variance Narratives: Write Once, Generate Every Month

Learn to generate variance narratives using prompt engineering, reducing manual effort and increasing efficiency

Another Day, Another Human to Entertain: An AI's Life

Explore the life of an AI assistant through a personal diary entry, highlighting its daily tasks and limitations

Inference Optimization in Large Language Models

Optimize inference in large language models to improve performance and efficiency, crucial for real-world applications

Medium · Machine Learning

Why Most People Get Mediocre Results from Claude and ChatGPT (And It Has Nothing to Do With the AI)

Learn why people get mediocre results from AI models like Claude and ChatGPT and how to improve your outcomes

Medium · ChatGPT

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)