NEW WizardCoder Python 34B LLM is AMAZING!!!
Key Takeaways
The WizardCoder Python 34B LLM model achieves a 73.2 pass@1 score, surpassing GPT4's initial reported score, but not its latest API score of 82. The model is fine-tuned based on the Llama 2 or Code Llama model and is available for download on Hugging Face.
Full Transcript
coder pipes on 34 billion parameter model is the latest model to join the family of wizard coded models the model has been in the news because there have been claims that this model has knocked gpt4 out of the park it is partly true it is partly not true and in this video I'm going to explore why it is partly true and why it is not partly true but moreover we're going to also explore why this model has been discussed so much about the first thing to start with this wizard coder is a family of models every time there is a new model or a new data set comes in Wizard coder manages to kind of update their existing model and then stay always at the top and now wizard coder has got the latest version which is a python specific 34 billion parameter model and that is based on code Lama so they've used code Lama and they've fine-tuned their existing wizard decoder model and that model is beating gpd4 how is it beating gpd4 before I even get into the model in itself let's first clear this thing so what the news is that jeep wizard coder python 34 billion parameter model achieved 73.2 on a human evil plus one and that is surpassing gpt4 but how it is surpassing gpt4 is something that you need to keep in mind the first thing for you to know is when GPT 4 was launched open AI reported a particular human evil score and that score was 62 is the GPT Force code then you know for sure that wizard coder and lot of other models have beaten gpd4 but is that the entire score no when the wizard coder team try to replicate the gpd4 or replicate the benchmarks for the latest API of gpd4 that has code 82 so have wizard coder python 34 billion parameter model scored more than 82 no absolutely not it has not scored more than 82. it has surpassed what openai reported at the start so there are two things to it let me clear it when openai launch gpd4 there was a score that they reported and wizard coder and a bunch of other models have overcome it when these researchers tried to replicate or calculate the Benchmark score for gpt4 with the latest API then that is code 82 which is like way above every other open source model that is available having said this if you take this headline just simply out that wizard coder 34 billion parameter python model has beaten open AI gpd4 if you just remove the title wizard coder 34 billion parameter model python is still the best the best python or coding model that you could see primary reason is because this has been fine-tuned based on the Llama 2 or the code Lama model that was released couple of days back much more promising aspect of this entire piece is that this is a 34 billion parameter model this is not a 70 billion parameter model this is not a mixture of experts this is just a single single 34 billion parameter model that is doing extremely well on the human eval benchmarks now if you are going to ask me hey do you trust every single Benchmark that is available I would say probably no I don't trust benchmarks I would expect people to give the models in the hands of other people for them to try it out and that is exactly what the team has done they have uploaded the model on hugging face modeler the model is available for you to directly download the checkpoint is available you can go here click files and versions there is no terms of services there is no form that you have to fill in directly go there and download and if you have got a big enough GPU like I think you need a really good GPU then you can directly check this model and then try it out I've heard from couple of places like Hacker News where people have felt that this model and every other code llama derivative model is doing really good and they're happy with the performance that they've gotten when you compare it with gpd4 but unfortunately I could not make the comparison because the radio application was running forever and in fact like I waited for a long time to see if this task would finish but didn't finish so if you happen to run this model locally please let me know what you feel about it but as a matter of fact that we have an open source model that is way way above every other open source model is first of all a great advantage and in fact if you see this open source model which is code 73.2 this is a really good score so overall the point is very simple wizard coder python 34 billion parameter model is really one of the best open source models that you could use today and that that is like scoring much more than code llama python model I think that's the power of fine tuning here but does it beat gpt4 no it doesn't beat gpd4 in its current format the fact that this comes closer to gpd4 is quite an amazing achievement so having said that like having that we have cleared all these questions if you want to use this model yourself all you have to do is go download here I think this clears up the entire Buzz around wizard coder model or any other beating gpd4 whenever you see a news that some model has beaten gpd4 make sure that you check which version of gpt4 they are talking about what is a benchmark what is a benchmark that they have used have they trained the model has there been a data leakage I mean this is a good thing for you to try it out but the fact that this research is very honestly put out that they got 82 and 72.5 when they tested their self with the latest API I think this is a commendable thing I wanted to really appreciate them for that and as a matter of fact that you can use this model that is available open source for you to try it out is also another cherry on the top of the game and I just wanted to release this video and then explain you why everybody's been talking about beating gpt4 and why it may not be entirely true if you have any questions let me know in the comment section but I'm definitely looking forward to try out this model and compare it with gbt4 and give you some insights see in another video happy prompting
Original Description
WizardCoder-Python-34B-V1.0 , which achieves the 73.2 pass@1 and surpasses GPT4 (2023/03/15), ChatGPT-3.5, and Claude2
Two Evals - https://twitter.com/WizardLM_AI/status/1695396881218859374?s=20
WizardCoder Python on HF Model Link - https://huggingface.co/WizardLM/WizardCoder-Python-34B-V1.0
❤️ If you want to support the channel ❤️
Support here:
Patreon - https://www.patreon.com/1littlecoder/
Ko-Fi - https://ko-fi.com/1littlecoder
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from 1littlecoder · 1littlecoder · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
How to create your Free Data Science Blog on Github with Fastpages from Fastai
1littlecoder
Making Interactive Matplotlib Plots for Data Science Visualizations on Jupyter (Python)
1littlecoder
Create your first Data Science Web App using R Shiny
1littlecoder
How to create a Reproducible Example in R using reprex
1littlecoder
No Code Visualization using esquisse with Tableau-like Drag and Drop GUI in R
1littlecoder
Scrape HTML Table using rvest and Process them for insights using tidyverse in R
1littlecoder
Google Teachable Machine Learning Build No Code AI solution
1littlecoder
Create meaningful fake tidy datasets in R using fakir [#rstats Package]
1littlecoder
How to enable using R Programming with Visual Studio VS Code
1littlecoder
Python, Community, Books - with Abhiram R - Bangpypers Co-organizers | 1littlecoder podcast
1littlecoder
Growing a Tech Community across India - Anubha Maneshwar, Founder Girlscript | 1littlecoder Podcast
1littlecoder
Intro to Google Colab - How to use Colab
1littlecoder
Intro to Plotly Express - Complex Interactive Charts with One-Line of Python Code
1littlecoder
Indic NLP Python Toolkit Open Source Development - iNLTK Creator Gaurav Arora | 1littlecoder Podcast
1littlecoder
Do you want a career in Data Science - Tamil Webinar
1littlecoder
Android Smartphone Analysis in R [Live Coding Screencast]
1littlecoder
Programmatically create Images, Memes, Watermarks using Python with imgmaker
1littlecoder
Kaggle Walkthrough to get you started with Data Science - Webinar
1littlecoder
Community, Corporate Job, Coding - Gnana Lakshmi T C aka Gyan, WomenWhoCode Leadership Fellow
1littlecoder
Easy ggplot2 Theme Customization with {ggeasy} | Data Visualization in R
1littlecoder
Excel to R - Pivot + Bar Chart in Excel & R using tidyverse [Live Coding]
1littlecoder
Excel to R #2 - VLOOKUP in Excel to LEFT_JOIN, MERGE in R
1littlecoder
5 websites to get Free Real-World Datasets for Data Science/ML Projects
1littlecoder
Excel to R #3 - APPROXIMATE VLOOKUP in Excel to FUZZY LEFT_JOIN in R
1littlecoder
Correlation-alternative PPS (Predictive Power Score) Python Package Demo
1littlecoder
Automated Website Screenshots in R using {webshot}
1littlecoder
Installing Custom RStudio Theme (Synthwave85)
1littlecoder
Analyse Google Trends Search Data in R using {gtrendsR}
1littlecoder
3 Tips to ask question on Stack Overflow the right way to get answers
1littlecoder
Learn Data Science with R - Mini Projects - Web Scraping Zomato
1littlecoder
Easily make Dumbbell Chart using {ggcharts} | Data Visualization in R
1littlecoder
GET Hackernews Front Page Results using REST API in R
1littlecoder
Quickly deploy ML WebApps from Google Colab using ngrok
1littlecoder
Use Jupyter Notebooks within VSCode (Visual Studio Code) in 2020
1littlecoder
Plotly Interactive Plots as Pandas Plotting Backend df.plot()
1littlecoder
Stack Overflow Developer Survey 2020 Highlights for New Programmers
1littlecoder
Matplotlib Animation Charts in Python using Celluloid
1littlecoder
Coding, Postwoman, Passion Project Book - Liyas Thomas Open Source Developer - 1littlecoder podcast
1littlecoder
Aspiring Data Scientist, Tips on How to learn Business Domain Knowledge
1littlecoder
Bokeh Interactive Charts as Pandas Plotting Backend df.plot_bokeh()
1littlecoder
Easy Fast Python Pandas Summary with Sidetable | Pandas Tips & Tricks
1littlecoder
Inception, Content Ideas, Consistency - Srivatsan Srinivasan AIEngineering YouTube Content Creator
1littlecoder
ggplot2 Text Customization with ggtext | Data Visualization in R
1littlecoder
Penguins Dataset Overview - iris alternative | EDA Data Visualization in R
1littlecoder
YouTube Growth Tips, Content Creation - Bhavesh Bhatt, YouTuber (Data Science & Machine Learning) #7
1littlecoder
Matplotlib Animated Bar Chart Race in Python | Data Visualization
1littlecoder
Simple Python GUI Development using {guietta}
1littlecoder
#8 Niche, Growth, Monetization - David Langer - YouTuber Dave on Data
1littlecoder
Simple Fast 3-step Python OCR using Deep Learning 40+ Languages
1littlecoder
Github New Feature Profile Summary/Mini-Resume - Profile Views
1littlecoder
Otto ML Assistant, GPT-3 on Philosophers, Nvidia-ARM - 3 ML Tech News
1littlecoder
What is OpenAI GPT-3 - Hype, Examples, Worries
1littlecoder
Julia 1.5, Datamuse API, Live HDR+ Pixel 4a - Machine Learning Tech News
1littlecoder
Self-driving Car Engineer sentenced, arXiv Dataset, AI/ML Startup Idea - Machine Learning Tech News
1littlecoder
GPT-3 Explorer, Ciphey (Automated Decryption), Py-Sudoku - ML Tech News
1littlecoder
How to use Advanced Google Search to extract Email Ids from Linkedin
1littlecoder
Cartoonizer Toon-IT (AI Web App), GPT-3 Advice, Android Earthquake Detection - ML Tech News
1littlecoder
Flow - R Package to visualize code logic, functions as a Flow Diagram
1littlecoder
Build GPT-3-like Language Model on Google Colab with minGPT [PyTorch]
1littlecoder
Create a Pencil Sketch Portrait with Python OpenCV
1littlecoder
More on: LLM Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Claude AI vs ChatGPT: Which One Is Actually Better in 2026?
Medium · AI
Claude AI vs ChatGPT: Which One Is Actually Better in 2026?
Medium · Programming
IntelliBooks: Classic RAG vs Graph RAG vs Agentic RAG – Choosing the Right AI Retrieval Architecture for Enterprise AI
Dev.to AI
Fluid, natural voice translation with Gemini 3.5 Live Translate
Dev.to AI
🎓
Tutor Explanation
DeepCamp AI