Multi-modal AI

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Multi-modal AI

Coursera · Intermediate ·💻 AI-Assisted Coding ·5h ago
Learn to build production applications by combining visual and textual inputs with AI coding tools. You will explore multi-modal programming where screenshots, images, and text serve as inputs for AI-assisted code generation, and set up development environments configured for visual AI workflows. The course covers prompt engineering with visual context to improve code generation accuracy, and hands-on development with GitHub Copilot in VS Code for inline suggestions and chat-based interactions. You will build a complete project using live reload and browser developer tools for rapid feedback between AI generation and visual output. The iterative development module teaches documentation-driven design where documentation guides AI toward desired outcomes, image-based iteration for refining generated code through visual comparison, and automated checks and validations that maintain quality through development cycles. You will learn to identify and overcome common iteration challenges including regression and context drift. The advanced module covers Model Context Protocol for connecting AI tools with external capabilities, Playwright for browser automation and visual testing, and Playwright MCP for AI-driven browser interactions that validate web applications directly. By completing this course, you will be able to convert screenshots into production code through iterative, automated, multi-modal AI workflows.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Stop Overpaying for Claude — The Advisor Pattern Saves 85% [Hands-On Guide]
Apply the Advisor Pattern to save 85% on Claude costs by using Haiku+Opus for 2x quality, and follow a hands-on guide to implement this production framework
Medium · AI
Why Senior Developers Must Rethink Their Role in the AI Era
Senior developers must adapt to the AI era by rethinking their role and responsibilities to remain relevant
Medium · Machine Learning
Why Senior Developers Must Rethink Their Role in the AI Era
Senior developers must adapt to the AI era by rethinking their role and responsibilities to remain relevant
Medium · Programming
Playwright Chronicles: Crafting Elegant Automation in JS/TS from Scratch — Part 13: Introduction…
Learn to automate tasks in JavaScript/TypeScript using Playwright from scratch
Medium · JavaScript
Up next
ChatGPT Codex Super App + Paperclip + Hermes + Claude Design
Julian Goldie SEO
Watch →