Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,333
lessons
Skills in this topic
View full skill map โ†’
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
DGX Spark Live:  NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
Computer Vision
DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
NVIDIA Developer Intermediate 1w ago
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Computer Vision
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Google Cloud Intermediate 3w ago
Turn Images into Insights with Vision Events
Computer Vision
Turn Images into Insights with Vision Events
Roboflow Intermediate 3w ago
Animating the Xenomorph in Alien: Isolation.
Computer Vision
Animating the Xenomorph in Alien: Isolation.
AI and Games Intermediate 1mo ago
The True Origin of Vision Transformers #ai #podcast
Computer Vision
The True Origin of Vision Transformers #ai #podcast
The MAD Podcast with Matt Turck Intermediate 1mo ago
How AI Vision Evolved | Merve Noyan
Computer Vision
How AI Vision Evolved | Merve Noyan
Hugging Face Intermediate 1mo ago
Quick Way to Improve your DET Writing Score! Duolingo English Test
Computer Vision
Quick Way to Improve your DET Writing Score! Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 1mo ago
Nvidia and Disney's Robotic Vision Has a Problem: The Real World โ”‚ Equity Podcast
Computer Vision โšก AI Lesson
Nvidia and Disney's Robotic Vision Has a Problem: The Real World โ”‚ Equity Podcast
TechCrunch Intermediate 1mo ago
Mistral Small 4: One AI Model for Everything? ๐Ÿคฏ
Computer Vision โšก AI Lesson
Mistral Small 4: One AI Model for Everything? ๐Ÿคฏ
Analytics Vidhya Intermediate 1mo ago
Mistral Small 4 in 8 mins!
Computer Vision โšก AI Lesson
Mistral Small 4 in 8 mins!
1littlecoder Intermediate 1mo ago
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Computer Vision
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 2mo ago
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLAโ€™s Image Processing Work
Computer Vision โšก AI Lesson
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLAโ€™s Image Processing Work
PyTorch Intermediate 2mo ago
Ultimate Data Science API Testing Tool
Computer Vision โšก AI Lesson
Ultimate Data Science API Testing Tool
Krish Naik Intermediate 3mo ago
The Hairy Ball Theorem
Computer Vision โšก AI Lesson
The Hairy Ball Theorem
3Blue1Brown Intermediate 3mo ago
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Computer Vision
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Roboflow Intermediate 3mo ago
Unlock data from your files with Agentic Document Extraction
Computer Vision
Unlock data from your files with Agentic Document Extraction
DeepLearningAI Intermediate 3mo ago
New course! Document AI: From OCR to Agentic Doc Extraction
Computer Vision
New course! Document AI: From OCR to Agentic Doc Extraction
DeepLearningAI Intermediate 4mo ago
I Became "Radicalized" About AI
Computer Vision โšก AI Lesson
I Became "Radicalized" About AI
Ken Jee Intermediate 4mo ago
Mistral OCR 3 Deep Dive: Document AI Done Right
Computer Vision
Mistral OCR 3 Deep Dive: Document AI Done Right
DataCreator AI Intermediate 4mo ago
The Next Frontier of AI: Real-Time Multimodal Decision Making
Computer Vision
The Next Frontier of AI: Real-Time Multimodal Decision Making
The Information Intermediate 4mo ago
Are Humanoid Robots Actually Coming to Your Home? | Nikolaus, Rerun
Computer Vision
Are Humanoid Robots Actually Coming to Your Home? | Nikolaus, Rerun
Weights & Biases Intermediate 4mo ago
AI Paradox: Use Text for Logic, Avatars for Meaning
Computer Vision
AI Paradox: Use Text for Logic, Avatars for Meaning
Discover AI Intermediate 5mo ago
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
Computer Vision โšก AI Lesson
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
AI Anytime Intermediate 5mo ago
Grounding DINO: Open Vocabulary Object Detection on Videos
Computer Vision
Grounding DINO: Open Vocabulary Object Detection on Videos
PyImageSearch Intermediate 5mo ago
Insane Results with YOLOv8 & YOLO11 โ€” Detection, Segmentation, Pose & More!
Computer Vision
Insane Results with YOLOv8 & YOLO11 โ€” Detection, Segmentation, Pose & More!
Muhammad Moin Intermediate 5mo ago
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Computer Vision
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Google for Developers Intermediate 5mo ago
SAM 3: The AI That Lets You โ€œSegment Anythingโ€ โ€” Images, Videos & Concepts
Computer Vision
SAM 3: The AI That Lets You โ€œSegment Anythingโ€ โ€” Images, Videos & Concepts
Analytics Vidhya Intermediate 5mo ago
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Computer Vision โšก AI Lesson
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Stanford Online Intermediate 5mo ago
AI Video Editing Hack
Computer Vision โšก AI Lesson
AI Video Editing Hack
Matt Wolfe Intermediate 5mo ago
InferenceJS: Real-time computer vision in your browser
Computer Vision
InferenceJS: Real-time computer vision in your browser
Chrome for Developers Intermediate 5mo ago
I Gave This Fish $10,000 to Trade Stocks
Computer Vision
I Gave This Fish $10,000 to Trade Stocks
Coding with Lewis Intermediate 5mo ago
The biggest mistake companies make deploying AI  #podcast #interview #dataanalysis #ai #datascience
Computer Vision โšก AI Lesson
The biggest mistake companies make deploying AI #podcast #interview #dataanalysis #ai #datascience
Abhishek Thakur Intermediate 5mo ago
Basic Network Segmentation
Computer Vision โšก AI Lesson
Basic Network Segmentation
John Hammond Intermediate 6mo ago
Demystifying AI & Data Science (w/ Luca Massaron) ๐Ÿ“ฑ
Computer Vision โšก AI Lesson
Demystifying AI & Data Science (w/ Luca Massaron) ๐Ÿ“ฑ
Abhishek Thakur Intermediate 6mo ago
Build a RAG Application from Scratch โ€” No LangChain, No LlamaIndex
Computer Vision
Build a RAG Application from Scratch โ€” No LangChain, No LlamaIndex
Muhammad Moin Intermediate 6mo ago
As we outsource more to smart home gadgets, have we thought about how weโ€™d react in their place?
Computer Vision
As we outsource more to smart home gadgets, have we thought about how weโ€™d react in their place?
The Verge Intermediate 6mo ago
Real Time AI Video Object Tracking! ๐Ÿ’ฅEdgeTAM - Sam 2 for On-Device ๐Ÿ”ฅ
Computer Vision
Real Time AI Video Object Tracking! ๐Ÿ’ฅEdgeTAM - Sam 2 for On-Device ๐Ÿ”ฅ
1littlecoder Intermediate 6mo ago
How to Create a Profitable Paid Search Strategy for 2026
Computer Vision
How to Create a Profitable Paid Search Strategy for 2026
Exposure Ninja Intermediate 6mo ago
Where Hazel is at and what we've been up to // October 2025 Hazel Dev Log
Computer Vision โšก AI Lesson
Where Hazel is at and what we've been up to // October 2025 Hazel Dev Log
The Cherno Intermediate 6mo ago
Multimodal Data Analysis with AI
Computer Vision โšก AI Lesson
Multimodal Data Analysis with AI
Latent Space Intermediate 6mo ago
Generate Image Captions That Focus on What You Need
Computer Vision โšก AI Lesson
Generate Image Captions That Focus on What You Need
NVIDIA Developer Intermediate 6mo ago
Interactive Speaking Course for 120+ | Duolingo English Test
Computer Vision
Interactive Speaking Course for 120+ | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 3mo ago
Roboflow Rapid Livestream | Use text prompts to train vision models
Computer Vision โšก AI Lesson
Roboflow Rapid Livestream | Use text prompts to train vision models
Roboflow Intermediate 5mo ago
Duolingo Test SPEAKING Practice! Interactive Speaking - 7 Questions & Answers
Computer Vision
Duolingo Test SPEAKING Practice! Interactive Speaking - 7 Questions & Answers
Teacher Luke - Duolingo English Test Intermediate 5mo ago
Segment Anything 3 (SAM 3): Text to Segmentation | Live Coding + Q&A (Nov 20th)
Computer Vision โšก AI Lesson
Segment Anything 3 (SAM 3): Text to Segmentation | Live Coding + Q&A (Nov 20th)
Roboflow Intermediate 5mo ago
Use this Template for Speak About the Photo + 10 Practice Questions | Duolingo English Test
Computer Vision
Use this Template for Speak About the Photo + 10 Practice Questions | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 5mo ago
Demystifying AI & Data Science (w/ Luca Massaron)
Computer Vision โšก AI Lesson
Demystifying AI & Data Science (w/ Luca Massaron)
Abhishek Thakur Intermediate 6mo ago
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Computer Vision
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Abhishek Thakur Intermediate 6mo ago
๐Ÿ“š Coursera Courses Opens on Coursera ยท Free to audit
1 / 3 View all โ†’
Running Distributed TensorFlow using Vertex AI
๐Ÿ“š Coursera Course โ†—
Self-paced
Running Distributed TensorFlow using Vertex AI
Opens on Coursera โ†—
AI and Disaster Management
๐Ÿ“š Coursera Course โ†—
Self-paced
AI and Disaster Management
Opens on Coursera โ†—
Traitement d'images : segmentation et caractรฉrisation
๐Ÿ“š Coursera Course โ†—
Self-paced
Traitement d'images : segmentation et caractรฉrisation
Opens on Coursera โ†—
Future of data and technology in football
๐Ÿ“š Coursera Course โ†—
Self-paced
Future of data and technology in football
Opens on Coursera โ†—
CompTIA Cloud CV0-003: Unit 3
๐Ÿ“š Coursera Course โ†—
Self-paced
CompTIA Cloud CV0-003: Unit 3
Opens on Coursera โ†—
H2O Cloud AI Developer Services
๐Ÿ“š Coursera Course โ†—
Self-paced
H2O Cloud AI Developer Services
Opens on Coursera โ†—