Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 393

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

How an Iris Recognition System works

Computer Vision

How an Iris Recognition System works

Academic Gain Tutorials Beginner 4d ago

How a Biometric Vault Access System works

Computer Vision

How a Biometric Vault Access System works

Academic Gain Tutorials Beginner 1w ago

De 5.000 a 160.000 usuarios | PROXUS

Computer Vision

De 5.000 a 160.000 usuarios | PROXUS

Itnig Beginner 1w ago

The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning

Computer Vision

The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning

Professor Rahul Jain Beginner 1w ago

How AI Turns Words Into Images — Text-to-Image Explained

Computer Vision

How AI Turns Words Into Images — Text-to-Image Explained

Practical AI Pro Beginner 1w ago

This Is What Happens When You CRUSH An AI Video Model

Computer Vision

This Is What Happens When You CRUSH An AI Video Model

Alex Ziskind Beginner 1w ago

Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge

Computer Vision

Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge

QuickTech Daily Beginner 2w ago

TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile

Computer Vision

TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile

Computerphile Beginner 2w ago

Social World Models

Computer Vision

Social World Models

Simons Institute for the Theory of Computing Beginner 2w ago

What's New on Everlaw October 29, 2025

Computer Vision

What's New on Everlaw October 29, 2025

Everlaw Beginner 2w ago

AI: YOLO for Routine, Not Critical Tasks #ai #podcast #futureofwork

Computer Vision

AI: YOLO for Routine, Not Critical Tasks #ai #podcast #futureofwork

Workday Beginner 2w ago

Manetho: AI-Powered Hieroglyphic Translation for Museums

Computer Vision

Manetho: AI-Powered Hieroglyphic Translation for Museums

Huawei Beginner 2w ago

Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.

Computer Vision

Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.

Anesthesia Patient Safety Foundation Beginner 3w ago

Are we creating new patient safety risks in the name of opioid reduction?

Computer Vision

Are we creating new patient safety risks in the name of opioid reduction?

Anesthesia Patient Safety Foundation Beginner 3w ago

Edge-Driven Multimodal Hypothesis Testing for Real-Time Research

Computer Vision

Edge-Driven Multimodal Hypothesis Testing for Real-Time Research

QuickTech Daily Beginner 4w ago

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation

Computer Vision

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation

Stanford Online Beginner 1mo ago

Neuralink's DJ Seo: Inside the Race to Connect Brains and AI

Computer Vision

Neuralink's DJ Seo: Inside the Race to Connect Brains and AI

Sequoia Capital Beginner 1mo ago

Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113

Computer Vision

Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113

Finance With Sharan Beginner 1mo ago

Track objects in video with SORT and OC-SORT

Computer Vision

Track objects in video with SORT and OC-SORT

Roboflow Beginner 1mo ago

Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯

Computer Vision

Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯

K-Transfer Beginner 1mo ago

This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT

Computer Vision

This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT

RealtyNXT Beginner 1mo ago

Build an AI Face Recognition Meme Matcher

Computer Vision

Build an AI Face Recognition Meme Matcher

DataCamp Beginner 1mo ago

Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI

Computer Vision

Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI

Hyperstack Beginner 1mo ago

Neural Architecture Search: Train the Right Vision Model for Your Hardware

Computer Vision

Neural Architecture Search: Train the Right Vision Model for Your Hardware

Roboflow Beginner 2mo ago

Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio

Computer Vision

Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio

Analytics Vidhya Beginner 2mo ago

₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀

Computer Vision

₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀

hackathonwalebhaiya Beginner 2mo ago

Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?

Computer Vision

Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?

Punto Base Beginner 2mo ago

Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind

Computer Vision

Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind

AI Engineer Beginner 2mo ago

Gemma 4 Vision Agent | Object Detection + VLM Pipeline

Computer Vision

Gemma 4 Vision Agent | Object Detection + VLM Pipeline

Prompt Engineering Beginner 2mo ago

Learn Drone Programming with Python – Tutorial

Computer Vision

Learn Drone Programming with Python – Tutorial

freeCodeCamp.org Beginner 2mo ago

De fundar Privalia a reinventar la construcción | 011h | #422

Computer Vision

De fundar Privalia a reinventar la construcción | 011h | #422

Itnig Beginner 2mo ago

Gemma 4 Explained: Google’s New Open-Source AI Models 🚀

Computer Vision

Gemma 4 Explained: Google’s New Open-Source AI Models 🚀

Analytics Vidhya Beginner 2mo ago

I Tried Gemma 4 + OpenClaw Locally… INSANE Results!

Computer Vision

I Tried Gemma 4 + OpenClaw Locally… INSANE Results!

Muhammad Moin Beginner 2mo ago

The Future of Vision in ML | Merve Noyan | HF Podcast #1

Computer Vision

The Future of Vision in ML | Merve Noyan | HF Podcast #1

Hugging Face Beginner 3mo ago

43 AI BASICS Benchmark datasets and leaderboards Part 1

Computer Vision

43 AI BASICS Benchmark datasets and leaderboards Part 1

Sinsavk AI for beginners Beginner 3mo ago

Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023

Computer Vision ⚡ AI Lesson

Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023

Moz Beginner 3mo ago

OpenClaw Explained: Create AI Agents Without Coding (Full Intro)

Computer Vision

OpenClaw Explained: Create AI Agents Without Coding (Full Intro)

Muhammad Moin Beginner 3mo ago

V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs

Computer Vision

V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs

AI Podcast Series. Byte Goose AI. Beginner 3mo ago

Jueves de Quack con Nerdearla

Computer Vision

Jueves de Quack con Nerdearla

GitHub Beginner 3mo ago

El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research

Computer Vision

El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research

Punto Base Beginner 3mo ago

What Is Multimodal AI? Real-World Examples

Computer Vision

What Is Multimodal AI? Real-World Examples

Coursera Beginner 3mo ago

IRPAPERS Explained!

Computer Vision ⚡ AI Lesson

IRPAPERS Explained!

Weaviate vector database Beginner 4mo ago

Music AI Sandbox | AI x Creativity: Wyclef Jean

Computer Vision ⚡ AI Lesson

Music AI Sandbox | AI x Creativity: Wyclef Jean

Google DeepMind Beginner 4mo ago

What is Machine Learning? 3 Types Explained Simply

Computer Vision

What is Machine Learning? 3 Types Explained Simply

NeuralKeith Beginner 4mo ago

How I Built an AI Guitar Teacher | Learn To Use AI with Live Video

Computer Vision

How I Built an AI Guitar Teacher | Learn To Use AI with Live Video

Roboflow Beginner 2mo ago

Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik

Computer Vision

Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik

Roboflow Beginner 3mo ago

Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step

Computer Vision

Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step

Muhammad Moin Beginner 3mo ago

Is YOLO26 Faster Than YOLO11? Full Comparison & Results

Computer Vision

Is YOLO26 Faster Than YOLO11? Full Comparison & Results

Muhammad Moin Beginner 3mo ago

📚 Continue on Coursera External links · Free to audit

View all →

📚 External: Coursera ↗

Classify Images of Clouds in the Cloud with AutoML Vision

Opens on Coursera ↗

AI and Disaster Management

📚 External: Coursera ↗

AI and Disaster Management

Opens on Coursera ↗

Introduction to Deep Learning for Computer Vision

📚 External: Coursera ↗

Introduction to Deep Learning for Computer Vision

Opens on Coursera ↗

📚 External: Coursera ↗

Marketing Communications: Intro to Consumer Behavior

Opens on Coursera ↗

📚 External: Coursera ↗

Create Image Captioning Models - Português Brasileiro

Opens on Coursera ↗

📚 External: Coursera ↗

Process Images, Create Captioning AI Models

Opens on Coursera ↗

📚 External: Coursera ↗

Uptraining with Document AI Workbench

Opens on Coursera ↗

Energía para campeones: Nutrición deportiva para el fútbol

📚 External: Coursera ↗

Energía para campeones: Nutrición deportiva para el fútbol

Opens on Coursera ↗

📚 External: Coursera ↗

Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital

Opens on Coursera ↗

📚 External: Coursera ↗

Form Parsing Using Document AI

Opens on Coursera ↗

AI for Video Production

📚 External: Coursera ↗

AI for Video Production

Opens on Coursera ↗

📚 External: Coursera ↗

Process Images & Extract Motion Features

Opens on Coursera ↗

Applied Machine Learning: Techniques and Applications

📚 External: Coursera ↗

Applied Machine Learning: Techniques and Applications

Opens on Coursera ↗

Salesforce Data Cloud Mastery: Certified Consultant Skills Path

📚 External: Coursera ↗

Salesforce Data Cloud Mastery: Certified Consultant Skills Path

Opens on Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

📚 External: Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

Opens on Coursera ↗

Event Management and Promotion Strategies

📚 External: Coursera ↗

Event Management and Promotion Strategies

Opens on Coursera ↗

H2O Cloud AI Developer Services

📚 External: Coursera ↗

H2O Cloud AI Developer Services

Opens on Coursera ↗

Supply Market Analysis

📚 External: Coursera ↗

Supply Market Analysis

Opens on Coursera ↗