Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
AI Traffic Camera Detects Speed & License Plates🚗
Computer Vision
AI Traffic Camera Detects Speed & License Plates🚗
Techie Sapien Intermediate 2d ago
How an Iris Recognition System works
Computer Vision
How an Iris Recognition System works
Academic Gain Tutorials Beginner 4d ago
How AI Builds Marketing Campaigns in Minutes (Not Days)
Computer Vision
How AI Builds Marketing Campaigns in Minutes (Not Days)
BugendaiTech Intermediate 6d ago
How a Biometric Vault Access System works
Computer Vision
How a Biometric Vault Access System works
Academic Gain Tutorials Beginner 1w ago
De 5.000 a 160.000 usuarios | PROXUS
Computer Vision
De 5.000 a 160.000 usuarios | PROXUS
Itnig Beginner 1w ago
The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning
Computer Vision
The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning
Professor Rahul Jain Beginner 1w ago
How AI Turns Words Into Images — Text-to-Image Explained
Computer Vision
How AI Turns Words Into Images — Text-to-Image Explained
Practical AI Pro Beginner 1w ago
Why Selling to a Population is a Huge Mistake
Computer Vision
Why Selling to a Population is a Huge Mistake
Business Growth with Joe Intermediate 1w ago
This Is What Happens When You CRUSH An AI Video Model
Computer Vision
This Is What Happens When You CRUSH An AI Video Model
Alex Ziskind Beginner 1w ago
How to build a custom vision agent
Computer Vision
How to build a custom vision agent
Google Cloud Tech Intermediate 1w ago
Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge
Computer Vision
Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge
QuickTech Daily Beginner 2w ago
TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile
Computer Vision
TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile
Computerphile Beginner 2w ago
SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?
Computer Vision
SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?
El Banquero del Pueblo Intermediate 2w ago
Social World Models
Computer Vision
Social World Models
Simons Institute for the Theory of Computing Beginner 2w ago
AI Diaries Episode Multimodal Drug Safety at the Edge
Computer Vision
AI Diaries Episode Multimodal Drug Safety at the Edge
QuickTech Daily Advanced 2w ago
What's New on Everlaw October 29, 2025
Computer Vision
What's New on Everlaw October 29, 2025
Everlaw Beginner 2w ago
AI Powered | Face Recognition @FameWorldEducationalHub  #computereducation #facerecognition
Computer Vision
AI Powered | Face Recognition @FameWorldEducationalHub #computereducation #facerecognition
FAME WORLD EDUCATIONAL HUB Intermediate 2w ago
Student Team Designs Predictive AI System to Optimize Port Operations
Computer Vision
Student Team Designs Predictive AI System to Optimize Port Operations
Huawei Intermediate 2w ago
Walking the Fine Line Between YOLO Agents and Trust
Computer Vision
Walking the Fine Line Between YOLO Agents and Trust
Workday Intermediate 2w ago
AI: YOLO for Routine, Not Critical Tasks  #ai #podcast #futureofwork
Computer Vision
AI: YOLO for Routine, Not Critical Tasks #ai #podcast #futureofwork
Workday Beginner 2w ago
Getac and the Future of Rugged Technology and the Deskless Workforce
Computer Vision
Getac and the Future of Rugged Technology and the Deskless Workforce
Neil C. Hughes Advanced 2w ago
Manetho: AI-Powered Hieroglyphic Translation for Museums
Computer Vision
Manetho: AI-Powered Hieroglyphic Translation for Museums
Huawei Beginner 2w ago
Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.
Computer Vision
Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.
Anesthesia Patient Safety Foundation Beginner 3w ago
Are we creating new patient safety risks in the name of opioid reduction?
Computer Vision
Are we creating new patient safety risks in the name of opioid reduction?
Anesthesia Patient Safety Foundation Beginner 3w ago
Google Listens to Your Videos
Computer Vision
Google Listens to Your Videos
Ahrefs Intermediate 4w ago
Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..
Computer Vision
Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..
Cohere Intermediate 1mo ago
Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation
Computer Vision
Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation
Stanford Online Beginner 1mo ago
Neuralink's DJ Seo: Inside the Race to Connect Brains and AI
Computer Vision
Neuralink's DJ Seo: Inside the Race to Connect Brains and AI
Sequoia Capital Beginner 1mo ago
How Whering architects cost efficient multimodal AI apps
Computer Vision
How Whering architects cost efficient multimodal AI apps
Google Cloud Tech Intermediate 1mo ago
Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113
Computer Vision
Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113
Finance With Sharan Beginner 1mo ago
AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It
Computer Vision
AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It
DeepLearningAI Intermediate 1mo ago
Track objects in video with SORT and OC-SORT
Computer Vision
Track objects in video with SORT and OC-SORT
Roboflow Beginner 1mo ago
[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen
Computer Vision
[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen
anucvml Intermediate 1mo ago
PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI
Computer Vision
PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI
CraftifAI Intermediate 1mo ago
Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante
Computer Vision
Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante
El Banquero del Pueblo Intermediate 1mo ago
Data is hungry for context
Computer Vision
Data is hungry for context
DeepLearningAI Intermediate 1mo ago
Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯
Computer Vision
Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯
K-Transfer Beginner 1mo ago
This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT
Computer Vision
This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT
RealtyNXT Beginner 1mo ago
KREA.AI: la startup de IA con más de 30 millones de usuarios | itnig podcast
Computer Vision
KREA.AI: la startup de IA con más de 30 millones de usuarios | itnig podcast
Itnig Advanced 1mo ago
Build an AI Face Recognition Meme Matcher
Computer Vision
Build an AI Face Recognition Meme Matcher
DataCamp Beginner 1mo ago
Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI
Computer Vision
Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI
Hyperstack Beginner 1mo ago
DGX Spark Live:  NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
Computer Vision
DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC
NVIDIA Developer Intermediate 1mo ago
Neural Architecture Search: Train the Right Vision Model for Your Hardware
Computer Vision
Neural Architecture Search: Train the Right Vision Model for Your Hardware
Roboflow Beginner 2mo ago
4 Retirement Income Strategies 💰
Computer Vision
4 Retirement Income Strategies 💰
Money Matters MD Intermediate 2mo ago
Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio
Computer Vision
Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio
Analytics Vidhya Beginner 2mo ago
Edge-Driven Multimodal Hypothesis Testing for Real-Time Research
Computer Vision
Edge-Driven Multimodal Hypothesis Testing for Real-Time Research
QuickTech Daily Beginner 4w ago
AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities
Computer Vision
AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities
QuickTech Daily Intermediate 1mo ago
AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs
Computer Vision
AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs
QuickTech Daily Intermediate 1mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Document AI: Project & API Writing
📚 External: Coursera ↗
Self-paced
Document AI: Project & API Writing
Opens on Coursera ↗
Infraestructura de IA: GPU de Cloud
📚 External: Coursera ↗
Self-paced
Infraestructura de IA: GPU de Cloud
Opens on Coursera ↗
Materiales para envase y embalaje
📚 External: Coursera ↗
Self-paced
Materiales para envase y embalaje
Opens on Coursera ↗
Business Economics and Game Theory for Decision Making
📚 External: Coursera ↗
Self-paced
Business Economics and Game Theory for Decision Making
Opens on Coursera ↗
Brand Positioning and Marketing Strategy
📚 External: Coursera ↗
Self-paced
Brand Positioning and Marketing Strategy
Opens on Coursera ↗
Salesforce Data Cloud Mastery: Certified Consultant Skills Path
📚 External: Coursera ↗
Self-paced
Salesforce Data Cloud Mastery: Certified Consultant Skills Path
Opens on Coursera ↗