Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 393

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀

Computer Vision

₹12 Lakh AI Fellowship 😳 | Adobe Research 2026 (India) 🚀

hackathonwalebhaiya Beginner 2mo ago

From Raw Video to Real Physics: The Google Cloud AI Breakdown

Computer Vision

From Raw Video to Real Physics: The Google Cloud AI Breakdown

Google Cloud Intermediate 2mo ago

He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA

Computer Vision

He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA

El Club de Inversión Intermediate 2mo ago

Turn Images into Insights with Vision Events

Computer Vision

Turn Images into Insights with Vision Events

Roboflow Intermediate 2mo ago

Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?

Computer Vision

Roth vs Traditional 401(k): ¿Qué pasa si suben los impuestos?

Punto Base Beginner 2mo ago

Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind

Computer Vision

Gemma, DeepMind's Family of Open Models — Omar Sanseviero, Google DeepMind

AI Engineer Beginner 2mo ago

When Your Car Can Reason: An Inside Look at BADAS-Reason Technology. V-JEPA2 and Physical Causality.

Computer Vision

When Your Car Can Reason: An Inside Look at BADAS-Reason Technology. V-JEPA2 and Physical Causality.

Byte Goose AI. Advanced 2mo ago

Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions

Computer Vision

Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions

Autodesk Developer Intermediate 2mo ago

Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026

Computer Vision

Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026

Databricks Advanced 2mo ago

Animating the Xenomorph in Alien: Isolation.

Computer Vision

Animating the Xenomorph in Alien: Isolation.

AI and Games Intermediate 2mo ago

From Vision Encoders to Perception Encoders: How Meta's EUPE Perception Encoder Beats the AI Giants.

Computer Vision

From Vision Encoders to Perception Encoders: How Meta's EUPE Perception Encoder Beats the AI Giants.

Byte Goose AI. Advanced 2mo ago

How I Built an AI Guitar Teacher | Learn To Use AI with Live Video

Computer Vision

How I Built an AI Guitar Teacher | Learn To Use AI with Live Video

Roboflow Beginner 2mo ago

Gemma 4 Vision Agent | Object Detection + VLM Pipeline

Computer Vision

Gemma 4 Vision Agent | Object Detection + VLM Pipeline

Prompt Engineering Beginner 2mo ago

Learn Drone Programming with Python – Tutorial

Computer Vision

Learn Drone Programming with Python – Tutorial

freeCodeCamp.org Beginner 2mo ago

The True Origin of Vision Transformers #ai #podcast

Computer Vision

The True Origin of Vision Transformers #ai #podcast

The MAD Podcast with Matt Turck Intermediate 2mo ago

De fundar Privalia a reinventar la construcción | 011h | #422

Computer Vision

De fundar Privalia a reinventar la construcción | 011h | #422

Itnig Beginner 2mo ago

Gemma 4 Explained: Google’s New Open-Source AI Models 🚀

Computer Vision

Gemma 4 Explained: Google’s New Open-Source AI Models 🚀

Analytics Vidhya Beginner 2mo ago

How AI Vision Evolved | Merve Noyan

Computer Vision

How AI Vision Evolved | Merve Noyan

Hugging Face Intermediate 2mo ago

I Tried Gemma 4 + OpenClaw Locally… INSANE Results!

Computer Vision

I Tried Gemma 4 + OpenClaw Locally… INSANE Results!

Muhammad Moin Beginner 2mo ago

Build Your Own AI Virtual Mouse using Python & OpenCV

Computer Vision

Build Your Own AI Virtual Mouse using Python & OpenCV

REGITE Intermediate 2mo ago

Bird's Eye View Traffic Analysis with YOLO26

Computer Vision

Bird's Eye View Traffic Analysis with YOLO26

Muhammad Moin Intermediate 2mo ago

Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz

Computer Vision

Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz

Alchemicy AI Intermediate 3mo ago

How do you build AI products that people actually trust, use, and scale?

Computer Vision

How do you build AI products that people actually trust, use, and scale?

BetterTech Intermediate 3mo ago

Yasser Benigmin - Domain Adaptation in the Era of Foundation Models

Computer Vision

Yasser Benigmin - Domain Adaptation in the Era of Foundation Models

Cohere Advanced 3mo ago

The Future of Vision in ML | Merve Noyan | HF Podcast #1

Computer Vision

The Future of Vision in ML | Merve Noyan | HF Podcast #1

Hugging Face Beginner 3mo ago

✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.

Computer Vision

✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.

OKR University Intermediate 3mo ago

🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt

Computer Vision

🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt

Digitek Nova Intermediate 3mo ago

China’s Secret Combat Robot Revealed at Lunar New Year Gala!

Computer Vision

China’s Secret Combat Robot Revealed at Lunar New Year Gala!

Technology Now Advanced 3mo ago

43 AI BASICS Benchmark datasets and leaderboards Part 1

Computer Vision

43 AI BASICS Benchmark datasets and leaderboards Part 1

Sinsavk AI for beginners Beginner 3mo ago

Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023

Computer Vision ⚡ AI Lesson

Why SEOs Need To Start Playing Offense Instead Of Defense by Chris Long | MozCon 2023

Moz Beginner 3mo ago

V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs

Computer Vision

V-JEPA 2.1 Explained: Dense Predictive Loss and Multi-Modal Tokenization. V-JEPA World Models. EBMs

AI Podcast Series. Byte Goose AI. Beginner 3mo ago

Mistral Small 4: One AI Model for Everything? 🤯

Computer Vision ⚡ AI Lesson

Mistral Small 4: One AI Model for Everything? 🤯

Analytics Vidhya Intermediate 3mo ago

Mistral Small 4 in 8 mins!

Computer Vision ⚡ AI Lesson

Mistral Small 4 in 8 mins!

1littlecoder Intermediate 3mo ago

Jueves de Quack con Nerdearla

Computer Vision

Jueves de Quack con Nerdearla

GitHub Beginner 3mo ago

Duolingo English Test 2026 - NEW Full Practice Test with Answers

Computer Vision

Duolingo English Test 2026 - NEW Full Practice Test with Answers

Teacher Luke - Duolingo English Test Intermediate 3mo ago

El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research

Computer Vision

El Gran Colapso del 2028 | Lo Que Está Viendo Citrini Research

Punto Base Beginner 3mo ago

What Is Multimodal AI? Real-World Examples

Computer Vision

What Is Multimodal AI? Real-World Examples

Coursera Beginner 3mo ago

Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin

Computer Vision

Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin

El Banquero del Pueblo Intermediate 4mo ago

TensorFlow: Advanced Techniques Specialization

Computer Vision ⚡ AI Lesson

TensorFlow: Advanced Techniques Specialization

DeepLearning.AI Advanced 4mo ago

IRPAPERS Explained!

Computer Vision ⚡ AI Lesson

IRPAPERS Explained!

Weaviate vector database Beginner 4mo ago

Music AI Sandbox | AI x Creativity: Wyclef Jean

Computer Vision ⚡ AI Lesson

Music AI Sandbox | AI x Creativity: Wyclef Jean

Google DeepMind Beginner 4mo ago

What is Machine Learning? 3 Types Explained Simply

Computer Vision

What is Machine Learning? 3 Types Explained Simply

NeuralKeith Beginner 4mo ago

Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik

Computer Vision

Bringing Visual Intelligence to AMRs: Peer Robotic’s Vishrut Kaushik

Roboflow Beginner 3mo ago

OpenClaw Explained: Create AI Agents Without Coding (Full Intro)

Computer Vision

OpenClaw Explained: Create AI Agents Without Coding (Full Intro)

Muhammad Moin Beginner 3mo ago

Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step

Computer Vision

Build a WhatsApp AI Agent (Auto Replies) Using OpenClaw – Step-by-Step

Muhammad Moin Beginner 3mo ago

Is YOLO26 Faster Than YOLO11? Full Comparison & Results

Computer Vision

Is YOLO26 Faster Than YOLO11? Full Comparison & Results

Muhammad Moin Beginner 3mo ago

Deploy Edge AI: Setting Up GigE Cameras

Computer Vision ⚡ AI Lesson

Deploy Edge AI: Setting Up GigE Cameras

Roboflow Intermediate 4mo ago

Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)

Computer Vision ⚡ AI Lesson

Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)

Roboflow Intermediate 4mo ago

📚 Continue on Coursera External links · Free to audit

View all →

Rural Marketing: Segmentation & Consumer Insights

📚 External: Coursera ↗

Rural Marketing: Segmentation & Consumer Insights

Opens on Coursera ↗

📚 External: Coursera ↗

Build a DIY Multimodal Question Answering System with Vertex AI

Opens on Coursera ↗

Future of data and technology in football

📚 External: Coursera ↗

Future of data and technology in football

Opens on Coursera ↗

Artificial Vision for Textile quality control

📚 External: Coursera ↗

Artificial Vision for Textile quality control

Opens on Coursera ↗

Introduction to Computer Vision and Image Processing

📚 External: Coursera ↗

Introduction to Computer Vision and Image Processing

Opens on Coursera ↗

📚 External: Coursera ↗

Automating Image Processing

Opens on Coursera ↗

📚 External: Coursera ↗

Create Image Captioning Models - Português Brasileiro

Opens on Coursera ↗

Introduction to Transformer Models for NLP: Unit 3

📚 External: Coursera ↗

Introduction to Transformer Models for NLP: Unit 3

Opens on Coursera ↗

📚 External: Coursera ↗

Form Parsing with Document AI (Python)

Opens on Coursera ↗

The "Who" of the Marketing Strategy:Segmentation & Targeting

📚 External: Coursera ↗

The "Who" of the Marketing Strategy:Segmentation & Targeting

Opens on Coursera ↗

Low Code Image Segmentation

📚 External: Coursera ↗

Low Code Image Segmentation

Opens on Coursera ↗

Market Research Case Study: Apply & Analyze

📚 External: Coursera ↗

Market Research Case Study: Apply & Analyze

Opens on Coursera ↗

Global Marketing and International Trade Strategies

📚 External: Coursera ↗

Global Marketing and International Trade Strategies

Opens on Coursera ↗

📚 External: Coursera ↗

Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital

Opens on Coursera ↗

📚 External: Coursera ↗

Classify Images of Clouds in the Cloud with AutoML Vision

Opens on Coursera ↗

Energía para campeones: Nutrición deportiva para el fútbol

📚 External: Coursera ↗

Energía para campeones: Nutrición deportiva para el fútbol

Opens on Coursera ↗

Introduction to Deep Learning for Computer Vision

📚 External: Coursera ↗

Introduction to Deep Learning for Computer Vision

Opens on Coursera ↗

📚 External: Coursera ↗

Infraestructura de IA: GPU de Cloud

Opens on Coursera ↗