Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,333
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Best Mac Mini Alternatives for Running OpenClaw 24/7 in 2026
Computer Vision
Best Mac Mini Alternatives for Running OpenClaw 24/7 in 2026
Tin Rovic Advanced 4h ago
How Transformers Finally Ate Vision – Isaac Robinson, Roboflow
Computer Vision
How Transformers Finally Ate Vision – Isaac Robinson, Roboflow
AI Engineer Beginner 6d ago
Build an AI Face Recognition Meme Matcher
Computer Vision
Build an AI Face Recognition Meme Matcher
DataCamp Beginner 1w ago
FFmpeg: The Incredible Technology Behind Video on the Internet | Lex Fridman Podcast #496
Computer Vision
FFmpeg: The Incredible Technology Behind Video on the Internet | Lex Fridman Podcast #496
Lex Fridman Beginner 1w ago
DGX Spark Live:  NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
Computer Vision
DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
NVIDIA Developer Intermediate 1w ago
Neural Architecture Search: Train the Right Vision Model for Your Hardware
Computer Vision
Neural Architecture Search: Train the Right Vision Model for Your Hardware
Roboflow Beginner 2w ago
Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio
Computer Vision
Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio
Analytics Vidhya Beginner 2w ago
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Computer Vision
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Google Cloud Intermediate 3w ago
Turn Images into Insights with Vision Events
Computer Vision
Turn Images into Insights with Vision Events
Roboflow Intermediate 3w ago
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
Computer Vision
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
AI Engineer Beginner 3w ago
Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026
Computer Vision
Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026
Databricks Advanced 1mo ago
Animating the Xenomorph in Alien: Isolation.
Computer Vision
Animating the Xenomorph in Alien: Isolation.
AI and Games Intermediate 1mo ago
Learn Drone Programming with Python – Tutorial
Computer Vision
Learn Drone Programming with Python – Tutorial
freeCodeCamp.org Beginner 1mo ago
The True Origin of Vision Transformers #ai #podcast
Computer Vision
The True Origin of Vision Transformers #ai #podcast
The MAD Podcast with Matt Turck Intermediate 1mo ago
How AI Vision Evolved | Merve Noyan
Computer Vision
How AI Vision Evolved | Merve Noyan
Hugging Face Intermediate 1mo ago
Yasser Benigmin - Domain Adaptation in the Era of Foundation Models
Computer Vision
Yasser Benigmin - Domain Adaptation in the Era of Foundation Models
Cohere Advanced 1mo ago
The Future of Vision in ML | Merve Noyan | HF Podcast #1
Computer Vision
The Future of Vision in ML | Merve Noyan | HF Podcast #1
Hugging Face Beginner 1mo ago
Quick Way to Improve your DET Writing Score! Duolingo English Test
Computer Vision
Quick Way to Improve your DET Writing Score! Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 1mo ago
AI Powered Surveillance System for India
Computer Vision ⚡ AI Lesson
AI Powered Surveillance System for India
AI Anytime Beginner 1mo ago
Nvidia and Disney's Robotic Vision Has a Problem: The Real World │ Equity Podcast
Computer Vision ⚡ AI Lesson
Nvidia and Disney's Robotic Vision Has a Problem: The Real World │ Equity Podcast
TechCrunch Intermediate 1mo ago
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Computer Vision ⚡ AI Lesson
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Moz Beginner 1mo ago
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Computer Vision
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Muhammad Moin Beginner 1mo ago
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
Computer Vision
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
AI Podcast Series. Byte Goose AI. Beginner 1mo ago
Is Benjamin Netanyahu an AI clone?
Computer Vision ⚡ AI Lesson
Is Benjamin Netanyahu an AI clone?
The TensorFlow Channel Beginner 1mo ago
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Computer Vision
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Muhammad Moin Beginner 1mo ago
Mistral Small 4: One AI Model for Everything? 🤯
Computer Vision ⚡ AI Lesson
Mistral Small 4: One AI Model for Everything? 🤯
Analytics Vidhya Intermediate 1mo ago
Mistral Small 4 in 8 mins!
Computer Vision ⚡ AI Lesson
Mistral Small 4 in 8 mins!
1littlecoder Intermediate 1mo ago
How Audi Uses AI to Transform Automotive Manufacturing at Scale | Amazon Web Services
Computer Vision ⚡ AI Lesson
How Audi Uses AI to Transform Automotive Manufacturing at Scale | Amazon Web Services
Amazon Web Services Advanced 2mo ago
Jueves de Quack con Nerdearla
Computer Vision
Jueves de Quack con Nerdearla
GitHub Beginner 2mo ago
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Computer Vision
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 2mo ago
Image Search Engine in Python - Multimodal Embeddings
Computer Vision ⚡ AI Lesson
Image Search Engine in Python - Multimodal Embeddings
NeuralNine Beginner 2mo ago
Pegasus by TwelveLabs #ai #video #patternrecognition #tech #explained #llm #imagerecognition
Computer Vision
Pegasus by TwelveLabs #ai #video #patternrecognition #tech #explained #llm #imagerecognition
Jessica Wang Beginner 2mo ago
What Is Multimodal AI? Real-World Examples
Computer Vision
What Is Multimodal AI? Real-World Examples
Coursera Beginner 2mo ago
TensorFlow: Advanced Techniques Specialization
Computer Vision ⚡ AI Lesson
TensorFlow: Advanced Techniques Specialization
DeepLearning.AI Advanced 2mo ago
Music AI Sandbox | AI x Creativity: Wyclef Jean
Computer Vision ⚡ AI Lesson
Music AI Sandbox | AI x Creativity: Wyclef Jean
Google DeepMind Beginner 2mo ago
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
Computer Vision ⚡ AI Lesson
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
PyTorch Intermediate 2mo ago
Ultimate Data Science API Testing Tool
Computer Vision ⚡ AI Lesson
Ultimate Data Science API Testing Tool
Krish Naik Intermediate 3mo ago
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
Computer Vision
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
The Verge Beginner 3mo ago
AI Guidance for Physical Work
Computer Vision
AI Guidance for Physical Work
Y Combinator Advanced 3mo ago
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
Computer Vision ⚡ AI Lesson
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
Julia Turc Beginner 3mo ago
The Hairy Ball Theorem
Computer Vision ⚡ AI Lesson
The Hairy Ball Theorem
3Blue1Brown Intermediate 3mo ago
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Computer Vision
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Roboflow Beginner 1mo ago
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Computer Vision
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Roboflow Beginner 1mo ago
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Computer Vision
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Muhammad Moin Beginner 2mo ago
Full Speaking Course 2026: Duolingo English Test
Computer Vision ⚡ AI Lesson
Full Speaking Course 2026: Duolingo English Test
Teacher Luke - Duolingo English Test Beginner 2mo ago
One Open AI Model Built My Website, Image & Video
Computer Vision
One Open AI Model Built My Website, Image & Video
Analytics Vidhya Beginner 2mo ago
Interactive Speaking Course for 120+ | Duolingo English Test
Computer Vision
Interactive Speaking Course for 120+ | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 3mo ago
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Computer Vision ⚡ AI Lesson
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Roboflow Beginner 3mo ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
The "Who" of the Marketing Strategy:Segmentation & Targeting
📚 Coursera Course ↗
Self-paced
The "Who" of the Marketing Strategy:Segmentation & Targeting
Opens on Coursera ↗
Landing.AI for Beginners: Build Data Visualization AI Models
📚 Coursera Course ↗
Self-paced
Landing.AI for Beginners: Build Data Visualization AI Models
Opens on Coursera ↗
Marketing Communications: Intro to Consumer Behavior
📚 Coursera Course ↗
Self-paced
Marketing Communications: Intro to Consumer Behavior
Opens on Coursera ↗
AI for Video Production
📚 Coursera Course ↗
Self-paced
AI for Video Production
Opens on Coursera ↗
Create and Test a Document AI Processor
📚 Coursera Course ↗
Self-paced
Create and Test a Document AI Processor
Opens on Coursera ↗
Refine Segmentation: Boost Your AI Vision
📚 Coursera Course ↗
Self-paced
Refine Segmentation: Boost Your AI Vision
Opens on Coursera ↗