Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
AI Traffic Camera Detects Speed & License Plates🚗
Computer Vision
AI Traffic Camera Detects Speed & License Plates🚗
Techie Sapien Intermediate 2d ago
How AI Builds Marketing Campaigns in Minutes (Not Days)
Computer Vision
How AI Builds Marketing Campaigns in Minutes (Not Days)
BugendaiTech Intermediate 6d ago
Why Selling to a Population is a Huge Mistake
Computer Vision
Why Selling to a Population is a Huge Mistake
Business Growth with Joe Intermediate 1w ago
How to build a custom vision agent
Computer Vision
How to build a custom vision agent
Google Cloud Tech Intermediate 1w ago
SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?
Computer Vision
SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?
El Banquero del Pueblo Intermediate 2w ago
AI Powered | Face Recognition @FameWorldEducationalHub  #computereducation #facerecognition
Computer Vision
AI Powered | Face Recognition @FameWorldEducationalHub #computereducation #facerecognition
FAME WORLD EDUCATIONAL HUB Intermediate 2w ago
Student Team Designs Predictive AI System to Optimize Port Operations
Computer Vision
Student Team Designs Predictive AI System to Optimize Port Operations
Huawei Intermediate 2w ago
Walking the Fine Line Between YOLO Agents and Trust
Computer Vision
Walking the Fine Line Between YOLO Agents and Trust
Workday Intermediate 2w ago
Google Listens to Your Videos
Computer Vision
Google Listens to Your Videos
Ahrefs Intermediate 4w ago
Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..
Computer Vision
Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..
Cohere Intermediate 1mo ago
How Whering architects cost efficient multimodal AI apps
Computer Vision
How Whering architects cost efficient multimodal AI apps
Google Cloud Tech Intermediate 1mo ago
AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It
Computer Vision
AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It
DeepLearningAI Intermediate 1mo ago
[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen
Computer Vision
[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen
anucvml Intermediate 1mo ago
PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI
Computer Vision
PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI
CraftifAI Intermediate 1mo ago
Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante
Computer Vision
Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante
El Banquero del Pueblo Intermediate 1mo ago
AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities
Computer Vision
AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities
QuickTech Daily Intermediate 1mo ago
AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs
Computer Vision
AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs
QuickTech Daily Intermediate 1mo ago
Data is hungry for context
Computer Vision
Data is hungry for context
DeepLearningAI Intermediate 1mo ago
DGX Spark Live:  NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
Computer Vision
DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
NVIDIA Developer Intermediate 1mo ago
4 Retirement Income Strategies 💰
Computer Vision
4 Retirement Income Strategies 💰
Money Matters MD Intermediate 2mo ago
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Computer Vision
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Google Cloud Intermediate 2mo ago
He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA
Computer Vision
He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA
El Club de Inversión Intermediate 2mo ago
Turn Images into Insights with Vision Events
Computer Vision
Turn Images into Insights with Vision Events
Roboflow Intermediate 2mo ago
Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions
Computer Vision
Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions
Autodesk Developer Intermediate 2mo ago
Animating the Xenomorph in Alien: Isolation.
Computer Vision
Animating the Xenomorph in Alien: Isolation.
AI and Games Intermediate 2mo ago
The True Origin of Vision Transformers #ai #podcast
Computer Vision
The True Origin of Vision Transformers #ai #podcast
The MAD Podcast with Matt Turck Intermediate 2mo ago
How AI Vision Evolved | Merve Noyan
Computer Vision
How AI Vision Evolved | Merve Noyan
Hugging Face Intermediate 2mo ago
Build Your Own AI Virtual Mouse using Python & OpenCV
Computer Vision
Build Your Own AI Virtual Mouse using Python & OpenCV
REGITE Intermediate 2mo ago
Bird's Eye View Traffic Analysis with YOLO26
Computer Vision
Bird's Eye View Traffic Analysis with YOLO26
Muhammad Moin Intermediate 2mo ago
Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz
Computer Vision
Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz
Alchemicy AI Intermediate 3mo ago
How do you build AI products that people actually trust, use, and scale?
Computer Vision
How do you build AI products that people actually trust, use, and scale?
BetterTech Intermediate 3mo ago
✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.
Computer Vision
✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.
OKR University Intermediate 3mo ago
🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt
Computer Vision
🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt
Digitek Nova Intermediate 3mo ago
Mistral Small 4: One AI Model for Everything? 🤯
Computer Vision ⚡ AI Lesson
Mistral Small 4: One AI Model for Everything? 🤯
Analytics Vidhya Intermediate 3mo ago
Mistral Small 4 in 8 mins!
Computer Vision ⚡ AI Lesson
Mistral Small 4 in 8 mins!
1littlecoder Intermediate 3mo ago
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Computer Vision
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 3mo ago
Deploy Edge AI: Setting Up GigE Cameras
Computer Vision ⚡ AI Lesson
Deploy Edge AI: Setting Up GigE Cameras
Roboflow Intermediate 4mo ago
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
Computer Vision ⚡ AI Lesson
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
PyTorch Intermediate 4mo ago
Interactive Speaking Course for 120+ | Duolingo English Test
Computer Vision
Interactive Speaking Course for 120+ | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 4mo ago
Rethinking Enterprise Networking, Open Architecture, Managed Operations | Statice Tech
Computer Vision
Rethinking Enterprise Networking, Open Architecture, Managed Operations | Statice Tech
Statice Tech Intermediate 4mo ago
»are they, a/i cartographers, drunk?« »infamous!«
Computer Vision
»are they, a/i cartographers, drunk?« »infamous!«
dmn*1975.1945.1915 Intermediate 4mo ago
X88 Pro 10 TV Box as a distraction-free productivity device in 2026
Computer Vision
X88 Pro 10 TV Box as a distraction-free productivity device in 2026
Cade Edwards Intermediate 6mo ago
Mistral OCR 3 Deep Dive: Document AI Done Right
Computer Vision
Mistral OCR 3 Deep Dive: Document AI Done Right
DataCreator AI Intermediate 6mo ago
Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin
Computer Vision
Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin
El Banquero del Pueblo Intermediate 4mo ago
Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)
Computer Vision ⚡ AI Lesson
Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)
Roboflow Intermediate 4mo ago
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Computer Vision
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Roboflow Intermediate 5mo ago
Unlock data from your files with Agentic Document Extraction
Computer Vision
Unlock data from your files with Agentic Document Extraction
DeepLearningAI Intermediate 5mo ago
New course! Document AI: From OCR to Agentic Doc Extraction
Computer Vision
New course! Document AI: From OCR to Agentic Doc Extraction
DeepLearningAI Intermediate 5mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Intro to Operating Systems 2: Memory Management
📚 External: Coursera ↗
Self-paced
Intro to Operating Systems 2: Memory Management
Opens on Coursera ↗
Infraestructura: Tecnologías Detrás de Recintos Inteligentes
📚 External: Coursera ↗
Self-paced
Infraestructura: Tecnologías Detrás de Recintos Inteligentes
Opens on Coursera ↗
Low Code Image Segmentation
📚 External: Coursera ↗
Self-paced
Low Code Image Segmentation
Opens on Coursera ↗
Market Research Case Study: Apply & Analyze
📚 External: Coursera ↗
Self-paced
Market Research Case Study: Apply & Analyze
Opens on Coursera ↗
The Social Media Landscape
📚 External: Coursera ↗
Self-paced
The Social Media Landscape
Opens on Coursera ↗
Landing.AI for Beginners: Build Data Visualization AI Models
📚 External: Coursera ↗
Self-paced
Landing.AI for Beginners: Build Data Visualization AI Models
Opens on Coursera ↗