StableVicuna: The New King of Open ChatGPTs?

Sam Witteveen · Beginner ·🧠 Large Language Models ·3y ago

Skills: LLM Foundations80%Prompt Craft60%

Key Takeaways

The video explores StableVicuna, an open-source Vicuna style model using LLaMa and RLHF training, and provides a tutorial on using the model in a Colab notebook.

Full Transcript

okay welcome back in this video we're going to look at a new model that stability AI has released in the past day or so and this is a model they're calling stable vikuna I and they're claiming that this is the world's first open source RL HF llm chatbot it sounds like a lot of pre-qualifiers in there so they've released a blog post and it basically is just talking about how this is of a kuna model just the original bakuna model but then fine-tuned on a number of different data sets so they've basically taken a llama model and they've trained it up if we look down here at the data sets we can see that they've trained it up on the open Assistant conversations on the GPT for all prompts generation and on the alpaca data set I presume this is the clean one perhaps not so a lot of these are still distilled data sets meaning that it's taken from gbt3 or chat gbt and we can't use them for commercial use the data sets themselves there's definitely an argument around whether people can use them or not but certainly the model itself is non-commercial this is using the original llama weights from meta and they for whatever reason still haven't allowed people to use this model commercially so my guess is that as stable Ai and others also so training up llama models this is like a soft run for seeing what it's going to be like to train once they've got their own llama model something that could be released for commercial use so it's interesting that the data sets they've gone for they haven't put in like the dolly 2 data set and there are a number of other you know data sets that they've decided not to go for that say the koala model had which were quite interesting data sets as well the other thing is that they've gone for three main data sets for the rlhf and one of them comes from open Assistant one of them is from anthropic and one of them is this Stanford human preferences and these data sets are publicly available if we come and have a look we can see here that this is the anthropic data set and we can see that there's the strings where it's got a human and it's got the answer and it has two lots of them and then people choose which one was the better one anyway if we have a look at the results in here we can see that they've basically benchmarked their model against a number of the other models like this over the past a month or so so we've got things like GPT for all koala we've got fukuna you know 1.1 we've got the alpaca model in there as well and we can see from most things this model does seem to do very well we can see on certain stats like the truthful q a it perhaps is not as good as alpaca and not as good as the the Cuna 1.1 or even the the koala model in there but on on the whole it seems to do pretty good so let's jump in and look at the codelabs I've set up this codelab you will need an a100 to be able to run it unfortunately this is a big model it's 13-bit even in even loading it in 8-bit you'll still need a pretty decent GPU to be able to do this because it's a llama model we can just bring in the llama tokenizer and the Llama for causal language modeling lucky for us this hugging face user that bloke has already converted the weights over so they're all there we can just bring them in and use them straight away and then once you bring it in you basically just set up a pipeline for doing text generation I'm going to set the max length in here to 512 but you could certainly extend that I'm going to set temperature to 0.7 and then just going to set up a few little things to clean up the prompts as we go through so one of the important things with all these models is that you must prompt it in the way that it expects to prompt so I saw some people pointing out that some of the other models don't do as well and they certainly don't do as well when you don't prompt them in the right way in this way you basically have to have it hashtag hashtag spacehuman colon then whatever you want to put in there and then a new line hashtag hashtag hashtag assistant if you don't put this in you'll find that it will work some of the times but it won't work all of the time and you'll definitely get you know some really weird outputs at times and sometimes even no output at times as well so with any of these models you want to go and check what is the format of the prompt that's going on in there all right so if we ask it the standard question that we've been asking all these what is the difference between llamas alpacas and pecunias I we can see that okay it's it's getting a response back probably on par with what vukune was delivering before I I think for this one it's it's a good response it's not necessarily outstandingly better than the other ones I would say that there are quite a few of them are getting good responses for uh this kind of prompt already if we look at write the write short note to Sam Altman giving reasons to open source GPT for here we've got a a nice sort of email slash note going through the various reasons that it it comes up with I don't think this is going to influence Samuel or open AI anytime soon to actually open source this but it it does show us that okay that this can write an email one of the tricks that I always do is basically just to ask it a very simple to the point question in this case what is the capital of England the capital of England is London it's nice and succinct in its answer it's also response time was actually quite quick for this as well story writing I think in some ways the koala models do better at story writing because they were actually also pre-trained on some data sets around story writing and poems and stuff like that where I don't think this model has been pre-trained with those in there that said though it's still able to come up with a story it understands that playing pool is it it's pull the game not pull something that you swim in and overall it puts together a story that okay makes sense we can look at that and understand it went on one said I was pleased with was this as an AI do you like The Simpsons and what do you know about homer so again this is one of the ones that we've asked for a lot of the models and looked at it and often the answer we'll get back is that it cannot have a preference because it's an AI model this one doesn't say that we get back yes I am a fan of The Simpsons it's one of my favorite TV shows and has been around for many years and it goes into a whole thing about Homer and gets facts about the TV show it's able to then work out that summer it up that homo is a lovable character with plenty of flaws that make him relatable to audiences I think the answer for this one is very good compared to some of the ones that we've seen where it basically just doesn't want to give an answer so the other thing that I thought I'd do is take it and try it out on some of the the flan paper examples so a while back when I did a notebook for the flan 20 billion we went through some of the examples in the paper and some of those examples are really good here we can see we've got answer the following question by reasoning step by step the cafeteria had 23 apples if they used 20 for lunch and bought six more how many apples do they have so the answer should be nine and this gets it very well this is not the case with things like wizard LM with a lot of the other LMS where its math is really not good and it's not able to work out these kinds of things I'm not sure you know this is part of the maybe advantages that they're getting from the rlhf I was a bit concerned that maybe this is just in the training set somewhere and that's where it picks it up so the interesting thing worth trying a few things like this next one answer the following question by yes or no by reasoning step by step can you write a whole Haiku in a single tweet this gets wrong right this one it gives us an interesting answer and it certainly does you know give us the reasoning but it comes up with a wrong answer for this next one is another one from the flan paper can Jeffrey Hinton have a conversation with George Washington give the rationale before answering and there's no it's not possible for Jeffrey engine to have a conversation with George Washington as they lived in different centuries and were born over 200 years apart additionally communication between people from different time periods would require some form of time travel which has yet to be discovered or developed it's nice that it put in that last bit I anyway that one again this may have been in the training set so I asked it about Marcus Aurelius and George Washington again it gave a very good coherent answer explaining that these two people lived in different times and did very good job with that actually so then I started to ask it a few questions about Marcus Aurelius to sort of just test it for facts and it does pretty well with this if we ask it tell me three facts about Marcus Lewis that most people don't know it's able to come up with three lesser known facts in there I but then certain times it will just fail miserably so in this case we ask it okay who is Marcus aurelius's son and a here it just says that the name of his son is not known as there are no historical records indicating that and that's not true at all his son went on to become emperor this one fails but then the amazing thing is if we just add to this a little bit and say who was Marcus aurelius's son and what was he like now it suddenly says oh yes Marcus Aurelius had a son named Commodus correct who later became emperor of Rome correct however Commerce is remembered for his tyrannical rule correct and even assassination correct so it has the facts in there but at times it's it's not very good at getting some of those out and then when I asked it this you know about communist directly it was able to put this together and and give us some information about him that is accurate as well so overall I think stable for kuna is definitely a cool model I will make a follow-up video of talking about using this as an open source model with Lang chain for react reasoning I've put together a notebook for that so I think maybe one of the next videos will walk through and look at can this model be used for that and see the the results of that anyway overall it's definitely worth checking out if you have the ability to run this model I think there are versions now already out there with four bits so that you could run this locally or you could run this with a smaller GPU it's definitely worth checking out having a play with and seeing for your particular use case how good is it for you and I think that's the key thing with all these models now is that for each person's use case they tends to be one of these top models will be the right one for you and we're not seeing sort of massive jumps like we were perhaps a month ago in some of these models anyway as always if you've got any questions please put them in the comments if you like the video please click click and subscribe and I will see you in the next video bye for now

Original Description

Colab StableVicuna 8bit: https://colab.research.google.com/drive/1Kvf3qF1TXE-jR-N5G9z1XxVf5z-ljFt2?usp=sharing Blog post: https://stability.ai/blog/stablevicuna-open-source-rlhf-chatbot In this video I look at the model StableVicuna from Stability AI, a Vicuna style model using the base LLaMa and RLHF training. For more tutorials on using LLMs and building Agents, check out my Patreon: Patreon: https://www.patreon.com/SamWitteveen Twitter: https://twitter.com/Sam_Witteveen My Links: Linkedin: https://www.linkedin.com/in/samwitteveen/ Github: https://github.com/samwit/langchain-tutorials https://github.com/samwit/llm-tutorials 00:00 Intro 02:15 datasets 03:15 Colab notebook

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Sam Witteveen · Sam Witteveen · 48 of 60

← Previous Next →

LangChain Basics Tutorial #1 - LLMs & PromptTemplates with Colab

LangChain Basics Tutorial #1 - LLMs & PromptTemplates with Colab

LangChain Basics Tutorial #2 Tools and Chains

LangChain Basics Tutorial #2 Tools and Chains

ChatGPT API Announcement & Code Walkthrough with LangChain

ChatGPT API Announcement & Code Walkthrough with LangChain

Trying Out Flan 20B with UL2 - Working in Colab with 8Bit Inference

Trying Out Flan 20B with UL2 - Working in Colab with 8Bit Inference

LangChain - Conversations with Memory (explanation & code walkthrough)

LangChain - Conversations with Memory (explanation & code walkthrough)

LangChain Chat with Flan20B

LangChain Chat with Flan20B

LangChain - Using Hugging Face Models locally (code walkthrough)

LangChain - Using Hugging Face Models locally (code walkthrough)

PAL : Program-aided Language Models with LangChain code

PAL : Program-aided Language Models with LangChain code

Building a Summarization System with LangChain and GPT-3 - Part 1

Building a Summarization System with LangChain and GPT-3 - Part 1

Building a Summarization System with LangChain and GPT-3 - Part 2

Building a Summarization System with LangChain and GPT-3 - Part 2

Microsoft's Visual ChatGPT using LangChain

Microsoft's Visual ChatGPT using LangChain

Building a Summarization System with LangChain - Part 3 Using ChatGPT Turbo

Building a Summarization System with LangChain - Part 3 Using ChatGPT Turbo

LangChain Agents - Joining Tools and Chains with Decisions

LangChain Agents - Joining Tools and Chains with Decisions

Investigating Alpaca 7B - Finetuned LLaMa LLM

Investigating Alpaca 7B - Finetuned LLaMa LLM

Comparing LLMs with LangChain

Comparing LLMs with LangChain

Running Alpaca7B in Colab

Running Alpaca7B in Colab

How to finetune your own Alpaca 7B

How to finetune your own Alpaca 7B

How to make a custom dataset like Alpaca7B

How to make a custom dataset like Alpaca7B

Understanding Constitutional AI - the paper and key concepts

Understanding Constitutional AI - the paper and key concepts

Using Constitutional AI in LangChain

Using Constitutional AI in LangChain

Talking to Alpaca with LangChain - Creating an Alpaca Chatbot

Talking to Alpaca with LangChain - Creating an Alpaca Chatbot

Text-to-video-synthesis with Diffusers and Colab

Text-to-video-synthesis with Diffusers and Colab

Meet Dolly the new Alpaca model

Meet Dolly the new Alpaca model

Checking out the Cerebras-GPT family of models

Checking out the Cerebras-GPT family of models

A Step-by-Step Guide to Fine-Tuning Your Dolly Model (tutorial)

A Step-by-Step Guide to Fine-Tuning Your Dolly Model (tutorial)

Is GPT4All your new personal ChatGPT?

Is GPT4All your new personal ChatGPT?

Raven - RWKV-7B RNN's LLM Strikes Back

Raven - RWKV-7B RNN's LLM Strikes Back

Talk to your CSV & Excel with LangChain

Talk to your CSV & Excel with LangChain

Vicuna - 90% of ChatGPT quality by using a new dataset?

Vicuna - 90% of ChatGPT quality by using a new dataset?

Koala Revealed: The ChatGPT Alternative You Need to Know! 🔍

Koala Revealed: The ChatGPT Alternative You Need to Know! 🔍

Running Koala for free in Colab. Your own personal ChatGPT? (tutorial)

Running Koala for free in Colab. Your own personal ChatGPT? (tutorial)

BabyAGI: Discover the Power of Task-Driven Autonomous Agents!

BabyAGI: Discover the Power of Task-Driven Autonomous Agents!

Auto-GPT - How to Automate a Task Based AI with GPT-4

Auto-GPT - How to Automate a Task Based AI with GPT-4

Improve your BabyAGI with LangChain

Improve your BabyAGI with LangChain

Generative Agents - Deep Dive and GPT-4 Recreation

Generative Agents - Deep Dive and GPT-4 Recreation

GPT4ALLv2: The Improvements and Drawbacks You Need to Know!

GPT4ALLv2: The Improvements and Drawbacks You Need to Know!

Dolly 2.0 by Databricks: Open for Business but is it Ready to Impress!

Dolly 2.0 by Databricks: Open for Business but is it Ready to Impress!

Red Pajama - Operation: Freeing LLaMA

Red Pajama - Operation: Freeing LLaMA

Investigating Open Assistant - Models, Datasets and Addons

Investigating Open Assistant - Models, Datasets and Addons

Investigating MiniGPT-4 - The Secret behind GPT-V?

Investigating MiniGPT-4 - The Secret behind GPT-V?

Stable LM 3B - The new tiny kid on the block.

Stable LM 3B - The new tiny kid on the block.

Bard can now code and put that code in Colab for you.

Bard can now code and put that code in Colab for you.

Checking out Bark: a Text to Speech system by Suno AI

Checking out Bark: a Text to Speech system by Suno AI

Fine-tuning LLMs with PEFT and LoRA

Fine-tuning LLMs with PEFT and LoRA

Master PDF Chat with LangChain - Your essential guide to queries on documents

Master PDF Chat with LangChain - Your essential guide to queries on documents

Using LangChain with DuckDuckGO Wikipedia & PythonREPL Tools

Using LangChain with DuckDuckGO Wikipedia & PythonREPL Tools

Building Custom Tools and Agents with LangChain (gpt-3.5-turbo)

Building Custom Tools and Agents with LangChain (gpt-3.5-turbo)

StableVicuna: The New King of Open ChatGPTs?

StableVicuna: The New King of Open ChatGPTs?

WizardLM: Evolving Instruction Datasets to Create a Better Model

WizardLM: Evolving Instruction Datasets to Create a Better Model

LaMini-LM - Mini Models Maxi Data!

LaMini-LM - Mini Models Maxi Data!

Finding the Best Free ChatGPT

Finding the Best Free ChatGPT

MPT-7B - The First Commercially Usable Fully Trained LLaMA Style Model

MPT-7B - The First Commercially Usable Fully Trained LLaMA Style Model

LangChain Retrieval QA Over Multiple Files with ChromaDB

LangChain Retrieval QA Over Multiple Files with ChromaDB

LangChain Retrieval QA with Instructor Embeddings & ChromaDB for PDFs

LangChain Retrieval QA with Instructor Embeddings & ChromaDB for PDFs

LangChain + Retrieval Local LLMs for Retrieval QA - No OpenAI!!!

LangChain + Retrieval Local LLMs for Retrieval QA - No OpenAI!!!

Transformers Agent - Is this Hugging Face's LangChain Competitor?

Transformers Agent - Is this Hugging Face's LangChain Competitor?

StarCoder - The LLM to make you a coding star?

StarCoder - The LLM to make you a coding star?

Testing Starcoder for Reasoning with PAL

Testing Starcoder for Reasoning with PAL

The New Wizards - Unfiltered & Unaligned

The New Wizards - Unfiltered & Unaligned

Camel + LangChain for Synthetic Data & Market Research

Camel + LangChain for Synthetic Data & Market Research

This video introduces StableVicuna, a new open-source chatbot model, and provides a step-by-step guide on how to use it in a Colab notebook. The model uses LLaMa and RLHF training, making it a powerful tool for building conversational AI agents.

Key Takeaways

Access the Colab notebook
Install required libraries
Load the StableVicuna model
Test the model with example prompts
Fine-tune the model for specific use cases

💡 StableVicuna's open-source nature and RLHF training make it a highly customizable and effective chatbot model for a wide range of applications.

🔒 Pro feature: Ask AI to explain this lesson →

More on: LLM Foundations

View skill →

Getting Started with Vertex AI Gemini 1.5 Flash

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

How to use the ChatGPT API with Python!!

How to use the ChatGPT API with Python!!

Nicholas Renotte

Gemini 2.5: Create an interactive plot of economic data

Gemini 2.5: Create an interactive plot of economic data

Google DeepMind

LangChain Chatbots: Building a Personalized AI Assistant

LangChain Chatbots: Building a Personalized AI Assistant

Analytics Vidhya

Auto-generating meeting notes with Python

Auto-generating meeting notes with Python

Related AI Lessons

Debugging Benchmark: DeepSeek V4 Pro vs MiMo V2.5 Pro

Compare the debugging capabilities of DeepSeek V4 Pro and MiMo V2.5 Pro on a real-world GitHub bug

Dev.to · Stanislav

How I'm re-discovering computer science with LLM revolution

Reinvigorate your computer science knowledge with the LLM revolution and discover new applications and techniques

Dev.to · popiol

I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing

Learn how to effectively use AI like ChatGPT to improve your life by changing your approach

I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing

Learn how to effectively use ChatGPT to solve personal problems by changing your approach

Medium · ChatGPT

Chapters (3)

Intro

2:15 datasets

3:15 Colab notebook

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)