Prompt Engineering for Vision Models

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Prompt Engineering for Vision Models

Coursera · Beginner ·✍️ Prompt Engineering ·5h ago
Prompt engineering is used not only in text models but also in vision models. Depending on the vision model, they may use text prompts, but can also work with pixel coordinates, bounding boxes, or segmentation masks. In this course, you’ll learn to prompt different vision models like Meta’s Segment Anything Model (SAM), a universal image segmentation model, OWL-ViT, a zero-shot object detection model, and Stable Diffusion 2.0, a widely used diffusion model. You’ll also use a fine-tuning technique called DreamBooth to tune a diffusion model to associate a text label with an object of your pref…
Watch on Coursera ↗ (saves to browser)
Introduction to AI for management professionals
Next Up
Introduction to AI for management professionals
Coursera