Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
Computer Vision ⚡ AI Lesson
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
PyTorch Intermediate 4mo ago
One Open AI Model Built My Website, Image & Video
Computer Vision
One Open AI Model Built My Website, Image & Video
Analytics Vidhya Beginner 4mo ago
Every Type of AI is Converging Into One #Shorts #AI #NeuralKeith
Computer Vision
Every Type of AI is Converging Into One #Shorts #AI #NeuralKeith
NeuralKeith Beginner 4mo ago
Interactive Speaking Course for 120+ | Duolingo English Test
Computer Vision
Interactive Speaking Course for 120+ | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 4mo ago
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
Computer Vision
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
The Verge Beginner 4mo ago
Rethinking Enterprise Networking, Open Architecture, Managed Operations | Statice Tech
Computer Vision
Rethinking Enterprise Networking, Open Architecture, Managed Operations | Statice Tech
Statice Tech Intermediate 4mo ago
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Computer Vision ⚡ AI Lesson
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Roboflow Beginner 4mo ago
»are they, a/i cartographers, drunk?« »infamous!«
Computer Vision
»are they, a/i cartographers, drunk?« »infamous!«
dmn*1975.1945.1915 Intermediate 4mo ago
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Computer Vision
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Roboflow Intermediate 5mo ago
Full Duolingo English Test with Answers: January 2026 Format
Computer Vision ⚡ AI Lesson
Full Duolingo English Test with Answers: January 2026 Format
Teacher Luke - Duolingo English Test Beginner 5mo ago
Unlock data from your files with Agentic Document Extraction
Computer Vision
Unlock data from your files with Agentic Document Extraction
DeepLearningAI Intermediate 5mo ago
New course! Document AI: From OCR to Agentic Doc Extraction
Computer Vision
New course! Document AI: From OCR to Agentic Doc Extraction
DeepLearningAI Intermediate 5mo ago
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks  Learning Canonical Embeddings for Human Heads
Computer Vision ⚡ AI Lesson
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks Learning Canonical Embeddings for Human Heads
Cohere Beginner 5mo ago
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Computer Vision
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Analytics Vidhya Beginner 5mo ago
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Computer Vision
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Microsoft Research Advanced 5mo ago
X88 Pro 10 TV Box as a distraction-free productivity device in 2026
Computer Vision
X88 Pro 10 TV Box as a distraction-free productivity device in 2026
Cade Edwards Intermediate 6mo ago
Mistral OCR 3 Deep Dive: Document AI Done Right
Computer Vision
Mistral OCR 3 Deep Dive: Document AI Done Right
DataCreator AI Intermediate 6mo ago
33. What are Multimodal Agents? Definition, Examples & Applications In Hindi
Computer Vision
33. What are Multimodal Agents? Definition, Examples & Applications In Hindi
AI SayI Intermediate 6mo ago
26. What is Hugging Face? | Full Guide to Models, Datasets & NLP In Hindi
Computer Vision
26. What is Hugging Face? | Full Guide to Models, Datasets & NLP In Hindi
AI SayI Advanced 6mo ago
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Computer Vision ⚡ AI Lesson
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Cohere Advanced 6mo ago
The Next Frontier of AI: Real-Time Multimodal Decision Making
Computer Vision
The Next Frontier of AI: Real-Time Multimodal Decision Making
The Information Intermediate 6mo ago
SAM 3: The Eyes for AI  — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)
Computer Vision
SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)
Latent Space Intermediate 6mo ago
AI Paradox: Use Text for Logic, Avatars for Meaning
Computer Vision
AI Paradox: Use Text for Logic, Avatars for Meaning
Discover AI Intermediate 6mo ago
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
Computer Vision ⚡ AI Lesson
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
AI Anytime Intermediate 6mo ago
Grounding DINO: Open Vocabulary Object Detection on Videos
Computer Vision
Grounding DINO: Open Vocabulary Object Detection on Videos
PyImageSearch Intermediate 6mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Computer Vision ⚡ AI Lesson
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 6mo ago
Insane Results with YOLOv8 & YOLO11 — Detection, Segmentation, Pose & More!
Computer Vision
Insane Results with YOLOv8 & YOLO11 — Detection, Segmentation, Pose & More!
Muhammad Moin Intermediate 6mo ago
Email Segmentation: Getting More Sales With Less Traffic
Computer Vision
Email Segmentation: Getting More Sales With Less Traffic
Social Media Examiner Beginner 6mo ago
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Computer Vision
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Google for Developers Intermediate 7mo ago
Should AI be introduced to kids early?  #podcast #interview
Computer Vision ⚡ AI Lesson
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 7mo ago
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Computer Vision ⚡ AI Lesson
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Stanford Online Intermediate 7mo ago
AI Video Editing Hack
Computer Vision ⚡ AI Lesson
AI Video Editing Hack
Matt Wolfe Intermediate 7mo ago
Multimodal and Multi-model AI in Action
Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 7mo ago
InferenceJS: Real-time computer vision in your browser
Computer Vision
InferenceJS: Real-time computer vision in your browser
Chrome for Developers Intermediate 7mo ago
I Gave This Fish $10,000 to Trade Stocks
Computer Vision
I Gave This Fish $10,000 to Trade Stocks
Coding with Lewis Intermediate 7mo ago
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Computer Vision ⚡ AI Lesson
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Roboflow Advanced 5mo ago
Deploy Vision Models to NVIDIA Jetson Orin in Minutes | AI at the Edge
Computer Vision ⚡ AI Lesson
Deploy Vision Models to NVIDIA Jetson Orin in Minutes | AI at the Edge
Roboflow Beginner 6mo ago
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Computer Vision
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Analytics Vidhya Beginner 6mo ago
AI for Occupancy Analytics | Building a Smart Parking System
Computer Vision ⚡ AI Lesson
AI for Occupancy Analytics | Building a Smart Parking System
Roboflow Beginner 6mo ago
Roboflow Rapid Livestream | Use text prompts to train vision models
Computer Vision ⚡ AI Lesson
Roboflow Rapid Livestream | Use text prompts to train vision models
Roboflow Intermediate 6mo ago
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Computer Vision ⚡ AI Lesson
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Roboflow Advanced 6mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 6mo ago
I Took the Duolingo English Test and Here’s What Happened
Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 6mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Computer Vision ⚡ AI Lesson
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 6mo ago
SAM 3: The AI That Lets You “Segment Anything” — Images, Videos & Concepts
Computer Vision
SAM 3: The AI That Lets You “Segment Anything” — Images, Videos & Concepts
Analytics Vidhya Intermediate 7mo ago
How to Deploy Vision AI Models in the Cloud | Serverless, Dedicated, Batch Processing
Computer Vision ⚡ AI Lesson
How to Deploy Vision AI Models in the Cloud | Serverless, Dedicated, Batch Processing
Roboflow Beginner 7mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Computer Vision ⚡ AI Lesson
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 7mo ago
Segment Anything 3 (SAM 3): Text to Segmentation | Live Coding + Q&A (Nov 20th)
Computer Vision ⚡ AI Lesson
Segment Anything 3 (SAM 3): Text to Segmentation | Live Coding + Q&A (Nov 20th)
Roboflow Intermediate 7mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Automating Image Processing
📚 External: Coursera ↗
Self-paced
Automating Image Processing
Opens on Coursera ↗
Infraestructura: Tecnologías Detrás de Recintos Inteligentes
📚 External: Coursera ↗
Self-paced
Infraestructura: Tecnologías Detrás de Recintos Inteligentes
Opens on Coursera ↗
Custom Document Extraction with Document AI Workbench
📚 External: Coursera ↗
Self-paced
Custom Document Extraction with Document AI Workbench
Opens on Coursera ↗
Customer Relationship Management
📚 External: Coursera ↗
Self-paced
Customer Relationship Management
Opens on Coursera ↗
Finanzas para directivos
📚 External: Coursera ↗
Self-paced
Finanzas para directivos
Opens on Coursera ↗
Using Specialized Processors with Document AI (Python)
📚 External: Coursera ↗
Self-paced
Using Specialized Processors with Document AI (Python)
Opens on Coursera ↗