Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 394

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

AI Traffic Camera Detects Speed & License Plates🚗

Computer Vision

AI Traffic Camera Detects Speed & License Plates🚗

Techie Sapien Intermediate 2d ago

How AI Builds Marketing Campaigns in Minutes (Not Days)

Computer Vision

How AI Builds Marketing Campaigns in Minutes (Not Days)

BugendaiTech Intermediate 6d ago

Why Selling to a Population is a Huge Mistake

Computer Vision

Why Selling to a Population is a Huge Mistake

Business Growth with Joe Intermediate 1w ago

How to build a custom vision agent

Computer Vision

How to build a custom vision agent

Google Cloud Tech Intermediate 1w ago

SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?

Computer Vision

SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?

El Banquero del Pueblo Intermediate 2w ago

AI Powered | Face Recognition @FameWorldEducationalHub #computereducation #facerecognition

Computer Vision

AI Powered | Face Recognition @FameWorldEducationalHub #computereducation #facerecognition

FAME WORLD EDUCATIONAL HUB Intermediate 2w ago

Student Team Designs Predictive AI System to Optimize Port Operations

Computer Vision

Student Team Designs Predictive AI System to Optimize Port Operations

Huawei Intermediate 2w ago

Walking the Fine Line Between YOLO Agents and Trust

Computer Vision

Walking the Fine Line Between YOLO Agents and Trust

Workday Intermediate 2w ago

Google Listens to Your Videos

Computer Vision

Google Listens to Your Videos

Ahrefs Intermediate 4w ago

Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..

Computer Vision

Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..

Cohere Intermediate 1mo ago

How Whering architects cost efficient multimodal AI apps

Computer Vision

How Whering architects cost efficient multimodal AI apps

Google Cloud Tech Intermediate 1mo ago

AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It

Computer Vision

AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It

DeepLearningAI Intermediate 1mo ago

[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen

Computer Vision

[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen

anucvml Intermediate 1mo ago

PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI

Computer Vision

PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI

CraftifAI Intermediate 1mo ago

Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante

Computer Vision

Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante

El Banquero del Pueblo Intermediate 1mo ago

AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities

Computer Vision

AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities

QuickTech Daily Intermediate 1mo ago

AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs

Computer Vision

AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs

QuickTech Daily Intermediate 1mo ago

Data is hungry for context

Computer Vision

Data is hungry for context

DeepLearningAI Intermediate 1mo ago

DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC

Computer Vision

DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC

NVIDIA Developer Intermediate 1mo ago

4 Retirement Income Strategies 💰

Computer Vision

4 Retirement Income Strategies 💰

Money Matters MD Intermediate 2mo ago

From Raw Video to Real Physics: The Google Cloud AI Breakdown

Computer Vision

From Raw Video to Real Physics: The Google Cloud AI Breakdown

Google Cloud Intermediate 2mo ago

He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA

Computer Vision

He Leído 447 Libros: Estas 5 LECCIONES Impulsarán tu RIQUEZA

El Club de Inversión Intermediate 2mo ago

Turn Images into Insights with Vision Events

Computer Vision

Turn Images into Insights with Vision Events

Roboflow Intermediate 2mo ago

Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions

Computer Vision

Real-Time Site Analysis: How to Build Custom Autodesk Forma Extensions

Autodesk Developer Intermediate 2mo ago

Animating the Xenomorph in Alien: Isolation.

Computer Vision

Animating the Xenomorph in Alien: Isolation.

AI and Games Intermediate 2mo ago

The True Origin of Vision Transformers #ai #podcast

Computer Vision

The True Origin of Vision Transformers #ai #podcast

The MAD Podcast with Matt Turck Intermediate 2mo ago

How AI Vision Evolved | Merve Noyan

Computer Vision

How AI Vision Evolved | Merve Noyan

Hugging Face Intermediate 2mo ago

Build Your Own AI Virtual Mouse using Python & OpenCV

Computer Vision

Build Your Own AI Virtual Mouse using Python & OpenCV

REGITE Intermediate 2mo ago

Bird's Eye View Traffic Analysis with YOLO26

Computer Vision

Bird's Eye View Traffic Analysis with YOLO26

Muhammad Moin Intermediate 2mo ago

Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz

Computer Vision

Alibaba właśnie ogłosiło Qwen3.5-Omni 🔥 AI które widzi, słyszy i mówi naraz

Alchemicy AI Intermediate 3mo ago

How do you build AI products that people actually trust, use, and scale?

Computer Vision

How do you build AI products that people actually trust, use, and scale?

BetterTech Intermediate 3mo ago

✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.

Computer Vision

✅Webinar — El Líder detrás del Prompt: En quién debes convertirte en la era de la IA.

OKR University Intermediate 3mo ago

🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt

Computer Vision

🎯 Multimodal AI in 2026: Images, Voice & Video in One Prompt

Digitek Nova Intermediate 3mo ago

Mistral Small 4: One AI Model for Everything? 🤯

Computer Vision ⚡ AI Lesson

Mistral Small 4: One AI Model for Everything? 🤯

Analytics Vidhya Intermediate 3mo ago

Mistral Small 4 in 8 mins!

Computer Vision ⚡ AI Lesson

Mistral Small 4 in 8 mins!

1littlecoder Intermediate 3mo ago

Duolingo English Test 2026 - NEW Full Practice Test with Answers

Computer Vision

Duolingo English Test 2026 - NEW Full Practice Test with Answers

Teacher Luke - Duolingo English Test Intermediate 3mo ago

Deploy Edge AI: Setting Up GigE Cameras

Computer Vision ⚡ AI Lesson

Deploy Edge AI: Setting Up GigE Cameras

Roboflow Intermediate 4mo ago

PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work

Computer Vision ⚡ AI Lesson

PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work

PyTorch Intermediate 4mo ago

Interactive Speaking Course for 120+ | Duolingo English Test

Computer Vision

Interactive Speaking Course for 120+ | Duolingo English Test

Teacher Luke - Duolingo English Test Intermediate 4mo ago

Rethinking Enterprise Networking, Open Architecture, Managed Operations | Statice Tech

Computer Vision

Rethinking Enterprise Networking, Open Architecture, Managed Operations | Statice Tech

Statice Tech Intermediate 4mo ago

»are they, a/i cartographers, drunk?« »infamous!«

Computer Vision

»are they, a/i cartographers, drunk?« »infamous!«

dmn*1975.1945.1915 Intermediate 4mo ago

X88 Pro 10 TV Box as a distraction-free productivity device in 2026

Computer Vision

X88 Pro 10 TV Box as a distraction-free productivity device in 2026

Cade Edwards Intermediate 6mo ago

Mistral OCR 3 Deep Dive: Document AI Done Right

Computer Vision

Mistral OCR 3 Deep Dive: Document AI Done Right

DataCreator AI Intermediate 6mo ago

Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin

Computer Vision

Una sola CUENTA para Acceder a las MEJORES OFERTAS de Ahorro | Raisin

El Banquero del Pueblo Intermediate 4mo ago

Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)

Computer Vision ⚡ AI Lesson

Multi-Object Tracking Made Easy | Trackers CLI + RF-DETR | Live Demo + Q&A (Feb 19th)

Roboflow Intermediate 4mo ago

RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)

Computer Vision

RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)

Roboflow Intermediate 5mo ago

Unlock data from your files with Agentic Document Extraction

Computer Vision

Unlock data from your files with Agentic Document Extraction

DeepLearningAI Intermediate 5mo ago

New course! Document AI: From OCR to Agentic Doc Extraction

Computer Vision

New course! Document AI: From OCR to Agentic Doc Extraction

DeepLearningAI Intermediate 5mo ago

📚 Continue on Coursera External links · Free to audit

View all →

📚 External: Coursera ↗

Intro to Operating Systems 2: Memory Management

Opens on Coursera ↗

Infraestructura: Tecnologías Detrás de Recintos Inteligentes

📚 External: Coursera ↗

Infraestructura: Tecnologías Detrás de Recintos Inteligentes

Opens on Coursera ↗

Low Code Image Segmentation

📚 External: Coursera ↗

Low Code Image Segmentation

Opens on Coursera ↗

Market Research Case Study: Apply & Analyze

📚 External: Coursera ↗

Market Research Case Study: Apply & Analyze

Opens on Coursera ↗

The Social Media Landscape

📚 External: Coursera ↗

The Social Media Landscape

Opens on Coursera ↗

Landing.AI for Beginners: Build Data Visualization AI Models

📚 External: Coursera ↗

Landing.AI for Beginners: Build Data Visualization AI Models

Opens on Coursera ↗

📚 External: Coursera ↗

Uptraining with Document AI Workbench

Opens on Coursera ↗

📚 External: Coursera ↗

Videojuegos: ¿de qué hablamos?

Opens on Coursera ↗

Create video, audio and infographics for online learning

📚 External: Coursera ↗

Create video, audio and infographics for online learning

Opens on Coursera ↗

📚 External: Coursera ↗

Using Specialized Processors with Document AI (Python)

Opens on Coursera ↗

Python Project: Software Engineering and Image Manipulation

📚 External: Coursera ↗

Python Project: Software Engineering and Image Manipulation

Opens on Coursera ↗

Análisis de fútbol aplicado - Mirando a los casos reales

📚 External: Coursera ↗

Análisis de fútbol aplicado - Mirando a los casos reales

Opens on Coursera ↗

📚 External: Coursera ↗

Process Documents with Python Using the Document AI API

Opens on Coursera ↗

AI Applications: Computer Vision and Speech Recognition

📚 External: Coursera ↗

AI Applications: Computer Vision and Speech Recognition

Opens on Coursera ↗

Aspectos conceptuales y operativos de la Telesalud

📚 External: Coursera ↗

Aspectos conceptuales y operativos de la Telesalud

Opens on Coursera ↗

📚 External: Coursera ↗

Build a DIY Multimodal Question Answering System with Vertex AI

Opens on Coursera ↗

Deep Learning for Object Detection

📚 External: Coursera ↗

Deep Learning for Object Detection

Opens on Coursera ↗

Artificial Vision for Textile quality control

📚 External: Coursera ↗

Artificial Vision for Textile quality control

Opens on Coursera ↗