Introducing Scribe v2 Realtime
Today we’re introducing a state-of-the-art Speech to Text model.
Scribe v2 Realtime is the most accurate low-latency model, delivering live transcription in under 150ms for voice agents, meeting notetakers, and live apps — across 90+ languages!
Key highlights:
- Negative latency: Next word and punctuation prediction
- Automatic language detection: Speak in any language, switch language mid conversation
- Text conditioning: Scribe v2 Realtime continues the transcription based on the previous batch, useful when restarting a connection
- Voice Activity Detection (VAD)
- Manual commit: Full control over when to finalize transcript segments
- Multiple audio formats: Support for PCM (48kHz) and μ-law encoding
- Enterprise ready with SOC 2, ISO 27001, PCI DSS L1, HIPAA, and GDPR compliance, EU and India data residency options and Zero retention mode for sensitive workloads
You can build with the API.
Docs https://elevenlabs.io/docs/capabilities/speech-to-text?utm_source=youtube&utm_medium=organic&utm_campaign=launch&utm_content=introducing_scribe_v2_&_scribe_v2_realtime
Use Scribe v2 Realtime directly in ElevenLabs Agents to power human-sounding voice agents for support, sales, or in-product experiences.
→ https://elevenlabs.io/agents?utm_source=youtube&utm_medium=organic&utm_campaign=launch&utm_content=introducing_scribe_v2_&_scribe_v2_realtime
Ready to start building?
→ https://elevenlabs.io/speech-to-text?utm_source=youtube&utm_medium=organic&utm_campaign=launch&utm_content=introducing_scribe_v2_&_scribe_v2_realtime
Join the Community
• Discord – https://discord.gg/hPE7yT33Qc
• Reddit – https://www.reddit.com/r/ElevenLabs/
Links & Resources
• ElevenLabs – https://elevenlabs.io/?utm_source=youtube&utm_medium=organic&utm_campaign=launch&utm_content=introducing_scribe_v2_&_scribe_v2_realtime
• Docs & API – https://elevenlabs.io/docs/?utm_source=youtube&utm_medium=organic&utm_campaign=launch&utm_content=introducing_scribe_v2_&_scribe_v2_realtime
• Blog – https://e
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from ElevenLabs · ElevenLabs · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
AI Multilingual TTS Demo | ElevenLabs
ElevenLabs
Professional Voice Cloning Demo | ElevenLabs
ElevenLabs
Introducing: Projects
ElevenLabs
ElevenLabs Dubbing Studio
ElevenLabs
ElevenLabs Speech to Speech
ElevenLabs
Sound Effects are Coming Soon to ElevenLabs
ElevenLabs
It Started to Sing
ElevenLabs
It Started to Sing (Jazz)
ElevenLabs
Broke my Heart
ElevenLabs
My Love
ElevenLabs
ElevenLabs Dubbing API
ElevenLabs
ElevenLabs Audio Native
ElevenLabs
Increasing reader engagement with Audio Native [June 2024 Webinar]
ElevenLabs
ElevenLabs Text to Sound Effects API Demo
ElevenLabs
ElevenLabs Voiceover Studio
ElevenLabs
ElevenLabs Speech to Speech Tutorial
ElevenLabs
ElevenLabs Voice Isolator API Demo
ElevenLabs
ElevenLabs Turbo v2.5
ElevenLabs
ElevenLabs Reader App - Android
ElevenLabs
ElevenLabs Impact Program
ElevenLabs
We’ve reduced our pricing.
ElevenLabs
Sound Effects Explore
ElevenLabs
Behind the Voice - Michael Marshall
ElevenLabs
X to Voice
ElevenLabs
Huberman Labs now Dubbing with ElevenLabs
ElevenLabs
GenFM, Now Playing on ElevenReader: Smart Podcasts Produced by Generative AI
ElevenLabs
Introducing ElevenLabs Conversational Agents
ElevenLabs
Meet Flash
ElevenLabs
Transforming media production with AI sound effects & dubbing
ElevenLabs
ElevenLabs Conversational AI Webinar
ElevenLabs
ElevenLabs - Text to Speech, Dubbing, Sound Effects and more
ElevenLabs
Talk to Santa
ElevenLabs
Year in Review with special guest TIME's CTO Burhan Hamid
ElevenLabs
Conversational AI Voice Agents that can issue refunds
ElevenLabs
Prenez une longueur d'avance avec ElevenLabs
ElevenLabs
Studio is now available to everyone, with new features (walkthrough)
ElevenLabs
Build Conversational AI agents with Gemini 2.0 Flash
ElevenLabs
Meet Alexis & El – Support Agents Handling 4,000 Calls a Day
ElevenLabs
Transform your Speech with ElevenLabs Voice Changer
ElevenLabs
Personalize conversational AI phone calls with Twilio
ElevenLabs
Spotify is now accepting Audiobooks Narrated by ElevenLabs
ElevenLabs
Build Outbound AI Sales Agents
ElevenLabs
Meet Scribe
ElevenLabs
Build a multilingual speech transcription bot with the ElevenLabs transcriber API
ElevenLabs
Streaming and Caching Speech with Supabase
ElevenLabs
Meet GibberLink, Conversational AI's secret language
ElevenLabs
Building a Personal AI Receptionist
ElevenLabs
Cross-platform AI Voice Agents with Expo React Native
ElevenLabs
Automatic Language Detection for Conversational AI
ElevenLabs
Native Retrieval-Augmented Generation (RAG) in Conversational AI
ElevenLabs
Text to Bark from ElevenLabs
ElevenLabs
Meet KUBI the Conversational Robot Barista
ElevenLabs
Introducing the ElevenLabs MCP server
ElevenLabs
Collect and analyze data with Conversational AI
ElevenLabs
Agent Transfer - Conversational AI
ElevenLabs
Sound Effects are now available in Studio
ElevenLabs
How to Make your Professional Voice Clone.
ElevenLabs
Get unique AI Voiceovers in CapCut
ElevenLabs
Transfer to human - Conversational AI
ElevenLabs
Use HeyGen Avatar IV to Make AI Movie Characters
ElevenLabs
More on: Tool Use & Function Calling
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
ZTE Showcases AI Interactive Flat Panel at the Broadband User Congress in Brazil
The Register
Why the Outsourcing vs GCC Decision Looks Different in the AI Era
Medium · AI
4 perf walls I hit shipping an AI hub on Cloudflare Workers KV
Dev.to · Max
What Google’s UCP Tells Us About Agent-Ready Websites via @sejournal, @slobodanmanic
Search Engine Journal
🎓
Tutor Explanation
DeepCamp AI