Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀
Computer Vision
₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀
hackathonwalebhaiya Beginner 2mo ago
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Computer Vision
From Raw Video to Real Physics: The Google Cloud AI Breakdown
Google Cloud Intermediate 2mo ago
He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA
Computer Vision
He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA
El Club de Inversión Intermediate 2mo ago
Turn Images into Insights with Vision Events
Computer Vision
Turn Images into Insights with Vision Events
Roboflow Intermediate 2mo ago
Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?
Computer Vision
Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?
Punto Base Beginner 2mo ago
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
Computer Vision
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
AI Engineer Beginner 2mo ago
When Your Car Can Reason: An Inside Look at BADAS-Reason Technology. V-JEPA2 and Physical Causality.
Computer Vision
When Your Car Can Reason: An Inside Look at BADAS-Reason Technology. V-JEPA2 and Physical Causality.
Byte Goose AI. Advanced 2mo ago
Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions
Computer Vision
Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions
Autodesk Developer Intermediate 2mo ago
Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026
Computer Vision
Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026
Databricks Advanced 2mo ago
Animating the Xenomorph in Alien: Isolation.
Computer Vision
Animating the Xenomorph in Alien: Isolation.
AI and Games Intermediate 2mo ago
From Vision Encoders to Perception Encoders: How Meta's EUPE Perception Encoder Beats the AI Giants.
Computer Vision
From Vision Encoders to Perception Encoders: How Meta's EUPE Perception Encoder Beats the AI Giants.
Byte Goose AI. Advanced 2mo ago
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Computer Vision
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Roboflow Beginner 2mo ago
Gemma 4 Vision Agent | Object Detection + VLM Pipeline
Computer Vision
Gemma 4 Vision Agent | Object Detection + VLM Pipeline
Prompt Engineering Beginner 2mo ago
Learn Drone Programming with Python – Tutorial
Computer Vision
Learn Drone Programming with Python – Tutorial
freeCodeCamp.org Beginner 2mo ago
The True Origin of Vision Transformers #ai #podcast
Computer Vision
The True Origin of Vision Transformers #ai #podcast
The MAD Podcast with Matt Turck Intermediate 2mo ago
De fundar Privalia a reinventar la construcción | 011h | #422
Computer Vision
De fundar Privalia a reinventar la construcción | 011h | #422
Itnig Beginner 2mo ago
Gemma 4 Explained: Google’s New Open-Source AI Models 🚀
Computer Vision
Gemma 4 Explained: Google’s New Open-Source AI Models 🚀
Analytics Vidhya Beginner 2mo ago
How AI Vision Evolved | Merve Noyan
Computer Vision
How AI Vision Evolved | Merve Noyan
Hugging Face Intermediate 2mo ago
I Tried Gemma 4 + OpenClaw Locally… INSANE Results!
Computer Vision
I Tried Gemma 4 + OpenClaw Locally… INSANE Results!
Muhammad Moin Beginner 2mo ago
Build Your Own AI Virtual Mouse using Python & OpenCV
Computer Vision
Build Your Own AI Virtual Mouse using Python & OpenCV
REGITE Intermediate 2mo ago
Bird's Eye View Traffic Analysis with YOLO26
Computer Vision
Bird's Eye View Traffic Analysis with YOLO26
Muhammad Moin Intermediate 2mo ago
Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz
Computer Vision
Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz
Alchemicy AI Intermediate 3mo ago
How do you build AI products that people actually trust, use, and scale?
Computer Vision
How do you build AI products that people actually trust, use, and scale?
BetterTech Intermediate 3mo ago
Yasser Benigmin - Domain Adaptation in the Era of Foundation Models
Computer Vision
Yasser Benigmin - Domain Adaptation in the Era of Foundation Models
Cohere Advanced 3mo ago
The Future of Vision in ML | Merve Noyan | HF Podcast #1
Computer Vision
The Future of Vision in ML | Merve Noyan | HF Podcast #1
Hugging Face Beginner 3mo ago
✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.
Computer Vision
✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.
OKR University Intermediate 3mo ago
🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt
Computer Vision
🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt
Digitek Nova Intermediate 3mo ago
China’s Secret Combat Robot Revealed at Lunar New Year Gala!
Computer Vision
China’s Secret Combat Robot Revealed at Lunar New Year Gala!
Technology Now Advanced 3mo ago
43 AI BASICS Benchmark datasets and leaderboards Part 1
Computer Vision
43 AI BASICS Benchmark datasets and leaderboards Part 1
Sinsavk AI for beginners Beginner 3mo ago
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Computer Vision ⚡ AI Lesson
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Moz Beginner 3mo ago
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
Computer Vision
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
AI Podcast Series. Byte Goose AI. Beginner 3mo ago
Mistral Small 4: One AI Model for Everything? 🤯
Computer Vision ⚡ AI Lesson
Mistral Small 4: One AI Model for Everything? 🤯
Analytics Vidhya Intermediate 3mo ago
Mistral Small 4 in 8 mins!
Computer Vision ⚡ AI Lesson
Mistral Small 4 in 8 mins!
1littlecoder Intermediate 3mo ago
Jueves de Quack con Nerdearla
Computer Vision
Jueves de Quack con Nerdearla
GitHub Beginner 3mo ago
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Computer Vision
Duolingo English Test 2026 - NEW Full Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 3mo ago
El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research
Computer Vision
El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research
Punto Base Beginner 3mo ago
What Is Multimodal AI? Real-World Examples
Computer Vision
What Is Multimodal AI? Real-World Examples
Coursera Beginner 3mo ago
Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin
Computer Vision
Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin
El Banquero del Pueblo Intermediate 4mo ago
TensorFlow: Advanced Techniques Specialization
Computer Vision ⚡ AI Lesson
TensorFlow: Advanced Techniques Specialization
DeepLearning.AI Advanced 4mo ago
IRPAPERS Explained!
Computer Vision ⚡ AI Lesson
IRPAPERS Explained!
Weaviate vector database Beginner 4mo ago
Music AI Sandbox | AI x Creativity: Wyclef Jean
Computer Vision ⚡ AI Lesson
Music AI Sandbox | AI x Creativity: Wyclef Jean
Google DeepMind Beginner 4mo ago
What is Machine Learning? 3 Types Explained Simply
Computer Vision
What is Machine Learning? 3 Types Explained Simply
NeuralKeith Beginner 4mo ago
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Computer Vision
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Roboflow Beginner 3mo ago
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Computer Vision
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Muhammad Moin Beginner 3mo ago
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Computer Vision
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Muhammad Moin Beginner 3mo ago
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Computer Vision
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Muhammad Moin Beginner 3mo ago
Deploy Edge AI: Setting Up GigE Cameras
Computer Vision ⚡ AI Lesson
Deploy Edge AI: Setting Up GigE Cameras
Roboflow Intermediate 4mo ago
Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)
Computer Vision ⚡ AI Lesson
Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)
Roboflow Intermediate 4mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Rural Marketing: Segmentation & Consumer Insights
📚 External: Coursera ↗
Self-paced
Rural Marketing: Segmentation & Consumer Insights
Opens on Coursera ↗
Build a DIY Multimodal Question Answering System with Vertex AI
📚 External: Coursera ↗
Self-paced
Build a DIY Multimodal Question Answering System with Vertex AI
Opens on Coursera ↗
Future of data and technology in football
📚 External: Coursera ↗
Self-paced
Future of data and technology in football
Opens on Coursera ↗
Artificial Vision for Textile quality control
📚 External: Coursera ↗
Self-paced
Artificial Vision for Textile quality control
Opens on Coursera ↗
Introduction to Computer Vision and Image Processing
📚 External: Coursera ↗
Self-paced
Introduction to Computer Vision and Image Processing
Opens on Coursera ↗
Automating Image Processing
📚 External: Coursera ↗
Self-paced
Automating Image Processing
Opens on Coursera ↗