Programming Generative AI: Unit 3

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Programming Generative AI: Unit 3

Coursera · Advanced ·🎨 Image & Video AI ·1mo ago
Unlock the full potential of generative AI with our advanced course module focused on state-of-the-art multimodal models. This course is designed for learners eager to bridge the gap between images and text, and to master the latest techniques in AI-driven content generation. You’ll begin by exploring the foundational concepts behind multimodal models, learning how contrastive language-image pre-training enables seamless integration of visual and textual data. Discover how these models power innovative applications like semantic image search, allowing you to query image content without manual labeling. Dive deeper into the mechanics of latent diffusion models and unravel the inner workings of stable diffusion, gaining the skills to transform text prompts into entirely new, never-before-seen images. The course also covers essential strategies for evaluating generative models and introduces efficient methods for fine-tuning and adapting pre-trained models to new styles and subjects. By the end, you’ll be equipped to build, adapt, and optimize cutting-edge text-to-image systems—ready to innovate in creative, research, or commercial settings.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
Learn to write effective AI image prompts for Midjourney with actionable examples and techniques
Medium · ChatGPT
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
Learn a step-by-step workflow for image-to-video AI that produces results, from preparation to delivery
Medium · AI
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
Learn about Image Harvest v1.0.2, a Chrome extension with internationalization, free pro trial, and quality-of-life improvements, and how to utilize it for privacy-first image extraction
Dev.to · kyriewen
Pix2Pix: Image-to-Image Translation using Conditional GANs
Learn how to use Pix2Pix for image-to-image translation with conditional GANs, a powerful technique for generating realistic images
Medium · Deep Learning
Up next
Inside image generation’s Renaissance moment — the OpenAI Podcast Ep. 19
OpenAI
Watch →