Build Multimodal Generative AI Applications

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Build Multimodal Generative AI Applications

Coursera · Intermediate ·🎨 Image & Video AI ·5h ago
Ready to level up your GenAI skills? Step into the exciting world of multimodal AI, where language, images, and speech come together to build smarter, more interactive applications. In this hands-on course, you’ll learn how to build systems that work across multiple modalities, from creating AI-powered storytellers and meeting assistants to developing image captioning tools and video generation apps. You’ll gain experience with real-world tools like IBM’s Granite, OpenAI’s Whisper, Sora and DALL·E, Meta’s Llama, Mistral’s Mixtral, and Gradio. Plus, you'll explore multimodal search, question…
Watch on Coursera ↗ (saves to browser)
Gestão de produtos digitais: Princípios básicos modernos
Next Up
Gestão de produtos digitais: Princípios básicos modernos
Coursera