We’re introducing three audio models in the API

OpenAI · Beginner ·🧠 Large Language Models ·2mo ago

Skills: LLM Foundations80%

Key Takeaways

Introduces three new audio models in the OpenAI API for voice apps

Original Description

We’re introducing three audio models in the API that unlock a new class of voice apps for developers. With these models, developers can build voice experiences that feel more natural, respond more intelligently, and take action in real time: • GPT‑Realtime‑2, our first voice model with GPT‑5‑class reasoning that can handle harder requests and carry the conversation forward naturally. • GPT‑Realtime‑Translate, a new live translation model that translates speech from 70+ input languages into 13 output languages while keeping pace with the speaker. • GPT‑Realtime‑Whisper, a new streaming speech-to-text that transcribes speech live as the speaker talks.

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: LLM Foundations

View skill →

Getting Started with Vertex AI Gemini 1.5 Flash

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

Open Assistant Live Coding (Open-Source ChatGPT Replication)

Open Assistant Live Coding (Open-Source ChatGPT Replication)

How To Create A Chatbot Using Python In 5 Minutes | Build Chatbot With Python | Simplilearn

How To Create A Chatbot Using Python In 5 Minutes | Build Chatbot With Python | Simplilearn

How to use the ChatGPT API with Python!!

How to use the ChatGPT API with Python!!

Nicholas Renotte

Gemini 2.5: Create an interactive plot of economic data

Gemini 2.5: Create an interactive plot of economic data

Google DeepMind

Related Reads

The Token Ledger Digest – 2026-07-19

Learn about the latest price reductions for MoonshotAI's Kimi models and how they can impact your large-scale code or reasoning workloads

A Developer's Quick-Start Guide to Claude AI

Get started with Claude AI quickly using a developer's setup checklist and learn how to harness its features

GPT-5.6 closes a 30-year gap in convex optimization. https://old.reddit.com/r/math/comments/1uxj3cy/after_openais_cdc_proof_anno

GPT-5.6 achieves a breakthrough in convex optimization, closing a 30-year gap, and its implications are significant for AI and math communities

Full-Text Search Artık Yeterli Olmadığında: Vektörler, LLM’ler ve Hybrid Search

Learn how to improve search results using vectors, LLMs, and hybrid search for more accurate and relevant outcomes

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)