Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,333
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Computer Vision
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Roboflow Intermediate 3mo ago
Full Duolingo English Test with Answers: January 2026 Format
Computer Vision ⚡ AI Lesson
Full Duolingo English Test with Answers: January 2026 Format
Teacher Luke - Duolingo English Test Beginner 3mo ago
Unlock data from your files with Agentic Document Extraction
Computer Vision
Unlock data from your files with Agentic Document Extraction
DeepLearningAI Intermediate 3mo ago
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Computer Vision ⚡ AI Lesson
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Roboflow Advanced 3mo ago
New course! Document AI: From OCR to Agentic Doc Extraction
Computer Vision
New course! Document AI: From OCR to Agentic Doc Extraction
DeepLearningAI Intermediate 4mo ago
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks  Learning Canonical Embeddings for Human Heads
Computer Vision ⚡ AI Lesson
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks Learning Canonical Embeddings for Human Heads
Cohere Beginner 4mo ago
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Computer Vision
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Analytics Vidhya Beginner 4mo ago
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Computer Vision
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Microsoft Research Advanced 4mo ago
I Became "Radicalized" About AI
Computer Vision ⚡ AI Lesson
I Became "Radicalized" About AI
Ken Jee Intermediate 4mo ago
Mistral OCR 3 Deep Dive: Document AI Done Right
Computer Vision
Mistral OCR 3 Deep Dive: Document AI Done Right
DataCreator AI Intermediate 4mo ago
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Computer Vision
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Analytics Vidhya Beginner 4mo ago
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Computer Vision ⚡ AI Lesson
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Cohere Advanced 4mo ago
The Next Frontier of AI: Real-Time Multimodal Decision Making
Computer Vision
The Next Frontier of AI: Real-Time Multimodal Decision Making
The Information Intermediate 4mo ago
What does AI mean for education?
Computer Vision ⚡ AI Lesson
What does AI mean for education?
Anthropic Beginner 4mo ago
Are Humanoid Robots Actually Coming to Your Home? | Nikolaus, Rerun
Computer Vision
Are Humanoid Robots Actually Coming to Your Home? | Nikolaus, Rerun
Weights & Biases Intermediate 4mo ago
AI Paradox: Use Text for Logic, Avatars for Meaning
Computer Vision
AI Paradox: Use Text for Logic, Avatars for Meaning
Discover AI Intermediate 5mo ago
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
Computer Vision ⚡ AI Lesson
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
TWIML AI Podcast Beginner 5mo ago
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
Computer Vision ⚡ AI Lesson
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
AI Anytime Intermediate 5mo ago
Grounding DINO: Open Vocabulary Object Detection on Videos
Computer Vision
Grounding DINO: Open Vocabulary Object Detection on Videos
PyImageSearch Intermediate 5mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Computer Vision ⚡ AI Lesson
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 5mo ago
Insane Results with YOLOv8 & YOLO11 — Detection, Segmentation, Pose & More!
Computer Vision
Insane Results with YOLOv8 & YOLO11 — Detection, Segmentation, Pose & More!
Muhammad Moin Intermediate 5mo ago
I Took the Duolingo English Test and Here’s What Happened
Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 5mo ago
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Computer Vision
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Google for Developers Intermediate 5mo ago
Why are Transformers replacing CNNs?
Computer Vision
Why are Transformers replacing CNNs?
Julia Turc Beginner 5mo ago
Should AI be introduced to kids early?  #podcast #interview
Computer Vision ⚡ AI Lesson
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 5mo ago
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Computer Vision ⚡ AI Lesson
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Stanford Online Intermediate 5mo ago
AI Video Editing Hack
Computer Vision ⚡ AI Lesson
AI Video Editing Hack
Matt Wolfe Intermediate 5mo ago
Multimodal and Multi-model AI in Action
Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 5mo ago
InferenceJS: Real-time computer vision in your browser
Computer Vision
InferenceJS: Real-time computer vision in your browser
Chrome for Developers Intermediate 5mo ago
I Gave This Fish $10,000 to Trade Stocks
Computer Vision
I Gave This Fish $10,000 to Trade Stocks
Coding with Lewis Intermediate 5mo ago
A no nonsense intro to BM25
Computer Vision ⚡ AI Lesson
A no nonsense intro to BM25
Abhishek Thakur Beginner 5mo ago
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
Computer Vision ⚡ AI Lesson
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
PyData Beginner 5mo ago
Basic Network Segmentation
Computer Vision ⚡ AI Lesson
Basic Network Segmentation
John Hammond Intermediate 6mo ago
AI for Occupancy Analytics | Building a Smart Parking System
Computer Vision ⚡ AI Lesson
AI for Occupancy Analytics | Building a Smart Parking System
Roboflow Beginner 5mo ago
Roboflow Rapid Livestream | Use text prompts to train vision models
Computer Vision ⚡ AI Lesson
Roboflow Rapid Livestream | Use text prompts to train vision models
Roboflow Intermediate 5mo ago
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Computer Vision ⚡ AI Lesson
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Roboflow Advanced 5mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 5mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Computer Vision ⚡ AI Lesson
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 5mo ago
SAM 3: The AI That Lets You “Segment Anything” — Images, Videos & Concepts
Computer Vision
SAM 3: The AI That Lets You “Segment Anything” — Images, Videos & Concepts
Analytics Vidhya Intermediate 5mo ago
Duolingo Test SPEAKING Practice! Interactive Speaking - 7 Questions & Answers
Computer Vision
Duolingo Test SPEAKING Practice! Interactive Speaking - 7 Questions & Answers
Teacher Luke - Duolingo English Test Intermediate 5mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Computer Vision ⚡ AI Lesson
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 5mo ago
Segment Anything 3 (SAM 3): Text to Segmentation | Live Coding + Q&A (Nov 20th)
Computer Vision ⚡ AI Lesson
Segment Anything 3 (SAM 3): Text to Segmentation | Live Coding + Q&A (Nov 20th)
Roboflow Intermediate 5mo ago
Use this Template for Speak About the Photo + 10 Practice Questions | Duolingo English Test
Computer Vision
Use this Template for Speak About the Photo + 10 Practice Questions | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 5mo ago
The biggest mistake companies make deploying AI  #podcast #interview #dataanalysis #ai #datascience
Computer Vision ⚡ AI Lesson
The biggest mistake companies make deploying AI #podcast #interview #dataanalysis #ai #datascience
Abhishek Thakur Intermediate 5mo ago
Demystifying AI & Data Science (w/ Luca Massaron) 📱
Computer Vision ⚡ AI Lesson
Demystifying AI & Data Science (w/ Luca Massaron) 📱
Abhishek Thakur Intermediate 6mo ago
Demystifying AI & Data Science (w/ Luca Massaron)
Computer Vision ⚡ AI Lesson
Demystifying AI & Data Science (w/ Luca Massaron)
Abhishek Thakur Intermediate 6mo ago
Vibe + VSCode + Codex = Search UI
Computer Vision ⚡ AI Lesson
Vibe + VSCode + Codex = Search UI
Abhishek Thakur Beginner 6mo ago
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Computer Vision ⚡ AI Lesson
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Roboflow Beginner 6mo ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Build a DIY Multimodal Question Answering System with Vertex AI
📚 Coursera Course ↗
Self-paced
Build a DIY Multimodal Question Answering System with Vertex AI
Opens on Coursera ↗
Autoscaling TensorFlow Model Deployments with TF Serving and Kubernetes
📚 Coursera Course ↗
Self-paced
Autoscaling TensorFlow Model Deployments with TF Serving and Kubernetes
Opens on Coursera ↗
Multimodal Literacies: Communication and Learning in the Era of Digital Media
📚 Coursera Course ↗
Self-paced
Multimodal Literacies: Communication and Learning in the Era of Digital Media
Opens on Coursera ↗
Explore LiDAR in 3D
📚 Coursera Course ↗
Self-paced
Explore LiDAR in 3D
Opens on Coursera ↗
Build Real-Time Face Recognition with OpenCV
📚 Coursera Course ↗
Self-paced
Build Real-Time Face Recognition with OpenCV
Opens on Coursera ↗
Business Economics and Game Theory for Decision Making
📚 Coursera Course ↗
Self-paced
Business Economics and Game Theory for Decision Making
Opens on Coursera ↗