Multimodal Generative AI: Vision, Speech, and Assistants

Coursera Course · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Multimodal Generative AI: Vision, Speech, and Assistants

Coursera · Beginner ·🧠 Large Language Models ·1h ago
We are introducing a new course to replace the "Coding with ChatGPT" course in the Generative AI specialization. This updated course will cover materials, models, and content released in 2024. Some of the new additions include material on using AI for image-to-text (vision), text-to-speech, speech-to-text, and the Assistant API. All these topics come with new labs, lessons, and exercises.
Watch on Coursera ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)