Is This The Fastest ASR?
In this video, I dive into IBM's newly released Granite Speech 4.1 models and explore what makes them interesting — particularly the three 2B variants they've dropped and how each one makes a different trade-off between accuracy, richness, and throughput that you'll actually care about for real applications.
🔗 Links:
IBM Research Blog → https://research.ibm.com/blog/granite-4-1-ai-foundation-models
Twitter: https://x.com/Sam_Witteveen
🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes
👨💻Github:
https://github.com/samwit/llm-tutorials
⏱️Time Stamps:
00:00 Intro
00:20 IBM Granite Collection
00:27 Granite Docling
00:46 Granite Speech 4.1
01:16 Granite 4.1 Blog
01:38 Granite Speech 4.1 2B
04:02 Granite Speech 4.1 2B Plus
06:15 Granite Speech 4.1 2B NAR
07:30 NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper
07:45 Architecture
09:45 Code Time
12:00 Granite Speech Model Github
#DellProPrecision #DellProMax #Delltech #localai #NVIDIA
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: LLM Engineering
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
LLM Cost Calculator
Dev.to · Codehelper
How to Run Claude Code Locally (100% Free & Fully Private)
Medium · LLM
Stop Blaming Claude Opus 4.7. Your Prompts Were Always Broken — 4.6 Was Just Carrying You.
Medium · LLM
AI Isn’t “Inspired” by Human Writing. It Is Built on Unpaid Intellectual Labor
Dev.to AI
Chapters (12)
Intro
0:20
IBM Granite Collection
0:27
Granite Docling
0:46
Granite Speech 4.1
1:16
Granite 4.1 Blog
1:38
Granite Speech 4.1 2B
4:02
Granite Speech 4.1 2B Plus
6:15
Granite Speech 4.1 2B NAR
7:30
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper
7:45
Architecture
9:45
Code Time
12:00
Granite Speech Model Github
🎓
Tutor Explanation
DeepCamp AI