Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
How an Iris Recognition System works
Computer Vision
How an Iris Recognition System works
Academic Gain Tutorials Beginner 4d ago
How a Biometric Vault Access System works
Computer Vision
How a Biometric Vault Access System works
Academic Gain Tutorials Beginner 1w ago
De 5.000 a 160.000 usuarios | PROXUS
Computer Vision
De 5.000 a 160.000 usuarios | PROXUS
Itnig Beginner 1w ago
The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning
Computer Vision
The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning
Professor Rahul Jain Beginner 1w ago
How AI Turns Words Into Images — Text-to-Image Explained
Computer Vision
How AI Turns Words Into Images — Text-to-Image Explained
Practical AI Pro Beginner 1w ago
This Is What Happens When You CRUSH An AI Video Model
Computer Vision
This Is What Happens When You CRUSH An AI Video Model
Alex Ziskind Beginner 1w ago
Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge
Computer Vision
Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge
QuickTech Daily Beginner 2w ago
TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile
Computer Vision
TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile
Computerphile Beginner 2w ago
Social World Models
Computer Vision
Social World Models
Simons Institute for the Theory of Computing Beginner 2w ago
What's New on Everlaw October 29, 2025
Computer Vision
What's New on Everlaw October 29, 2025
Everlaw Beginner 2w ago
AI: YOLO for Routine, Not Critical Tasks  #ai #podcast #futureofwork
Computer Vision
AI: YOLO for Routine, Not Critical Tasks #ai #podcast #futureofwork
Workday Beginner 2w ago
Manetho: AI-Powered Hieroglyphic Translation for Museums
Computer Vision
Manetho: AI-Powered Hieroglyphic Translation for Museums
Huawei Beginner 2w ago
Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.
Computer Vision
Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.
Anesthesia Patient Safety Foundation Beginner 3w ago
Are we creating new patient safety risks in the name of opioid reduction?
Computer Vision
Are we creating new patient safety risks in the name of opioid reduction?
Anesthesia Patient Safety Foundation Beginner 3w ago
Edge-Driven Multimodal Hypothesis Testing for Real-Time Research
Computer Vision
Edge-Driven Multimodal Hypothesis Testing for Real-Time Research
QuickTech Daily Beginner 4w ago
Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation
Computer Vision
Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation
Stanford Online Beginner 1mo ago
Neuralink's DJ Seo: Inside the Race to Connect Brains and AI
Computer Vision
Neuralink's DJ Seo: Inside the Race to Connect Brains and AI
Sequoia Capital Beginner 1mo ago
Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113
Computer Vision
Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113
Finance With Sharan Beginner 1mo ago
Track objects in video with SORT and OC-SORT
Computer Vision
Track objects in video with SORT and OC-SORT
Roboflow Beginner 1mo ago
Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯
Computer Vision
Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯
K-Transfer Beginner 1mo ago
This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT
Computer Vision
This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT
RealtyNXT Beginner 1mo ago
Build an AI Face Recognition Meme Matcher
Computer Vision
Build an AI Face Recognition Meme Matcher
DataCamp Beginner 1mo ago
Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI
Computer Vision
Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI
Hyperstack Beginner 1mo ago
Neural Architecture Search: Train the Right Vision Model for Your Hardware
Computer Vision
Neural Architecture Search: Train the Right Vision Model for Your Hardware
Roboflow Beginner 2mo ago
Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio
Computer Vision
Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio
Analytics Vidhya Beginner 2mo ago
₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀
Computer Vision
₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀
hackathonwalebhaiya Beginner 2mo ago
Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?
Computer Vision
Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?
Punto Base Beginner 2mo ago
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
Computer Vision
Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind
AI Engineer Beginner 2mo ago
Gemma 4 Vision Agent | Object Detection + VLM Pipeline
Computer Vision
Gemma 4 Vision Agent | Object Detection + VLM Pipeline
Prompt Engineering Beginner 2mo ago
Learn Drone Programming with Python – Tutorial
Computer Vision
Learn Drone Programming with Python – Tutorial
freeCodeCamp.org Beginner 2mo ago
De fundar Privalia a reinventar la construcción | 011h | #422
Computer Vision
De fundar Privalia a reinventar la construcción | 011h | #422
Itnig Beginner 2mo ago
Gemma 4 Explained: Google’s New Open-Source AI Models 🚀
Computer Vision
Gemma 4 Explained: Google’s New Open-Source AI Models 🚀
Analytics Vidhya Beginner 2mo ago
I Tried Gemma 4 + OpenClaw Locally… INSANE Results!
Computer Vision
I Tried Gemma 4 + OpenClaw Locally… INSANE Results!
Muhammad Moin Beginner 2mo ago
The Future of Vision in ML | Merve Noyan | HF Podcast #1
Computer Vision
The Future of Vision in ML | Merve Noyan | HF Podcast #1
Hugging Face Beginner 3mo ago
43 AI BASICS Benchmark datasets and leaderboards Part 1
Computer Vision
43 AI BASICS Benchmark datasets and leaderboards Part 1
Sinsavk AI for beginners Beginner 3mo ago
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Computer Vision ⚡ AI Lesson
Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023
Moz Beginner 3mo ago
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Computer Vision
OpenClaw Explained: Create AI Agents Without Coding (Full Intro)
Muhammad Moin Beginner 3mo ago
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
Computer Vision
V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs
AI Podcast Series. Byte Goose AI. Beginner 3mo ago
Jueves de Quack con Nerdearla
Computer Vision
Jueves de Quack con Nerdearla
GitHub Beginner 3mo ago
El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research
Computer Vision
El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research
Punto Base Beginner 3mo ago
What Is Multimodal AI? Real-World Examples
Computer Vision
What Is Multimodal AI? Real-World Examples
Coursera Beginner 3mo ago
IRPAPERS Explained!
Computer Vision ⚡ AI Lesson
IRPAPERS Explained!
Weaviate vector database Beginner 4mo ago
Music AI Sandbox | AI x Creativity: Wyclef Jean
Computer Vision ⚡ AI Lesson
Music AI Sandbox | AI x Creativity: Wyclef Jean
Google DeepMind Beginner 4mo ago
What is Machine Learning? 3 Types Explained Simply
Computer Vision
What is Machine Learning? 3 Types Explained Simply
NeuralKeith Beginner 4mo ago
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Computer Vision
How I Built an AI Guitar Teacher | Learn To Use AI with Live Video
Roboflow Beginner 2mo ago
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Computer Vision
Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik
Roboflow Beginner 3mo ago
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Computer Vision
Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step
Muhammad Moin Beginner 3mo ago
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Computer Vision
Is YOLO26 Faster Than YOLO11? Full Comparison & Results
Muhammad Moin Beginner 3mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Classify Images of Clouds in the Cloud with AutoML Vision
📚 External: Coursera ↗
Self-paced
Classify Images of Clouds in the Cloud with AutoML Vision
Opens on Coursera ↗
AI and Disaster Management
📚 External: Coursera ↗
Self-paced
AI and Disaster Management
Opens on Coursera ↗
Introduction to Deep Learning for Computer Vision
📚 External: Coursera ↗
Self-paced
Introduction to Deep Learning for Computer Vision
Opens on Coursera ↗
Marketing Communications: Intro to Consumer Behavior
📚 External: Coursera ↗
Self-paced
Marketing Communications: Intro to Consumer Behavior
Opens on Coursera ↗
Create Image Captioning Models - Português Brasileiro
📚 External: Coursera ↗
Self-paced
Create Image Captioning Models - Português Brasileiro
Opens on Coursera ↗
Process Images, Create Captioning AI Models
📚 External: Coursera ↗
Self-paced
Process Images, Create Captioning AI Models
Opens on Coursera ↗