Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,541

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 396

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

AI Traffic Camera Detects Speed & License Plates🚗

Computer Vision

AI Traffic Camera Detects Speed & License Plates🚗

Techie Sapien Intermediate 2d ago

How an Iris Recognition System works

Computer Vision

How an Iris Recognition System works

Academic Gain Tutorials Beginner 4d ago

How AI Builds Marketing Campaigns in Minutes (Not Days)

Computer Vision

How AI Builds Marketing Campaigns in Minutes (Not Days)

BugendaiTech Intermediate 1w ago

How a Biometric Vault Access System works

Computer Vision

How a Biometric Vault Access System works

Academic Gain Tutorials Beginner 1w ago

De 5.000 a 160.000 usuarios | PROXUS

Computer Vision

De 5.000 a 160.000 usuarios | PROXUS

Itnig Beginner 1w ago

The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning

Computer Vision

The Future of Multimodal Artificial Intelligence 🚀 #artificialintelligence #education #deeplearning

Professor Rahul Jain Beginner 1w ago

How AI Turns Words Into Images — Text-to-Image Explained

Computer Vision

How AI Turns Words Into Images — Text-to-Image Explained

Practical AI Pro Beginner 1w ago

Why Selling to a Population is a Huge Mistake

Computer Vision

Why Selling to a Population is a Huge Mistake

Business Growth with Joe Intermediate 1w ago

This Is What Happens When You CRUSH An AI Video Model

Computer Vision

This Is What Happens When You CRUSH An AI Video Model

Alex Ziskind Beginner 1w ago

How to build a custom vision agent

Computer Vision

How to build a custom vision agent

Google Cloud Tech Intermediate 1w ago

Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge

Computer Vision

Edge Multimodal Forecasting: Real-Time Disaster Insight at The Edge

QuickTech Daily Beginner 2w ago

TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile

Computer Vision

TCP b : Additive Increase Multiplicative Decrease & 'Slow Start' - Computerphile

Computerphile Beginner 2w ago

SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?

Computer Vision

SPACEX la Mayor SALIDA a BOLSA Nunca Vista ¿Burbuja o Gran Oportunidad?

El Banquero del Pueblo Intermediate 2w ago

Social World Models

Computer Vision

Social World Models

Simons Institute for the Theory of Computing Beginner 2w ago

AI Diaries Episode Multimodal Drug Safety at the Edge

Computer Vision

AI Diaries Episode Multimodal Drug Safety at the Edge

QuickTech Daily Advanced 2w ago

What's New on Everlaw October 29, 2025

Computer Vision

What's New on Everlaw October 29, 2025

Everlaw Beginner 2w ago

AI Powered | Face Recognition @FameWorldEducationalHub #computereducation #facerecognition

Computer Vision

AI Powered | Face Recognition @FameWorldEducationalHub #computereducation #facerecognition

FAME WORLD EDUCATIONAL HUB Intermediate 2w ago

Student Team Designs Predictive AI System to Optimize Port Operations

Computer Vision

Student Team Designs Predictive AI System to Optimize Port Operations

Huawei Intermediate 2w ago

Walking the Fine Line Between YOLO Agents and Trust

Computer Vision

Walking the Fine Line Between YOLO Agents and Trust

Workday Intermediate 2w ago

AI: YOLO for Routine, Not Critical Tasks #ai #podcast #futureofwork

Computer Vision

AI: YOLO for Routine, Not Critical Tasks #ai #podcast #futureofwork

Workday Beginner 2w ago

Getac and the Future of Rugged Technology and the Deskless Workforce

Computer Vision

Getac and the Future of Rugged Technology and the Deskless Workforce

Neil C. Hughes Advanced 2w ago

Manetho: AI-Powered Hieroglyphic Translation for Museums

Computer Vision

Manetho: AI-Powered Hieroglyphic Translation for Museums

Huawei Beginner 2w ago

Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.

Computer Vision

Rigid adherence to “zero-opioid” targets may inadvertently introduce risks to patient safety.

Anesthesia Patient Safety Foundation Beginner 3w ago

Are we creating new patient safety risks in the name of opioid reduction?

Computer Vision

Are we creating new patient safety risks in the name of opioid reduction?

Anesthesia Patient Safety Foundation Beginner 3w ago

Google Listens to Your Videos

Computer Vision

Google Listens to Your Videos

Ahrefs Intermediate 4w ago

Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..

Computer Vision

Rafi Ibn Sultan - WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation..

Cohere Intermediate 1mo ago

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation

Computer Vision

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation

Stanford Online Beginner 1mo ago

Neuralink's DJ Seo: Inside the Race to Connect Brains and AI

Computer Vision

Neuralink's DJ Seo: Inside the Race to Connect Brains and AI

Sequoia Capital Beginner 1mo ago

How Whering architects cost efficient multimodal AI apps

Computer Vision

How Whering architects cost efficient multimodal AI apps

Google Cloud Tech Intermediate 1mo ago

Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113

Computer Vision

Fraud Is a Full Time Business: Inside the Organised Crime Stealing Crores From India | FWS 113

Finance With Sharan Beginner 1mo ago

AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It

Computer Vision

AI Dev 26 x SF | Ashwyn Sharma: Every App Needs a Voice UI. Here's How to Build It

DeepLearningAI Intermediate 1mo ago

Track objects in video with SORT and OC-SORT

Computer Vision

Track objects in video with SORT and OC-SORT

Roboflow Beginner 1mo ago

[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen

Computer Vision

[CVPR 2026]: RoMo: A Large-Scale Richly Organized Dataset and Semantic Taxonomy for Human Motion Gen

anucvml Intermediate 1mo ago

PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI

Computer Vision

PipeGen Demo: Build End-to-End Edge AI Pipelines Automatically | CraftifAI

CraftifAI Intermediate 1mo ago

Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante

Computer Vision

Invertí en ACCIONES de Dividendo… y Descubrí un Problema Preocupante

El Banquero del Pueblo Intermediate 1mo ago

Data is hungry for context

Computer Vision

Data is hungry for context

DeepLearningAI Intermediate 1mo ago

Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯

Computer Vision

Mira Murati’s Thinking Machines: The End of Turn-Based AI! 🤯

K-Transfer Beginner 1mo ago

This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT

Computer Vision

This Startup Is Fixing India’s Construction Inefficiencies With AI | ICTDD2026 | RealtyNXT

RealtyNXT Beginner 1mo ago

KREA.AI: la startup de IA con más de 30 millones de usuarios | itnig podcast

Computer Vision

KREA.AI: la startup de IA con más de 30 millones de usuarios | itnig podcast

Itnig Advanced 1mo ago

Build an AI Face Recognition Meme Matcher

Computer Vision

Build an AI Face Recognition Meme Matcher

DataCamp Beginner 1mo ago

Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI

Computer Vision

Deploy NVIDIA Nemotron 3 Nano Omni on a Single NVIDIA H100: Video, Audio & Document AI

Hyperstack Beginner 1mo ago

DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC

Computer Vision

DGX Spark Live: NYC Spark Hack Winner feature - A 3D time machine for every building in NYC

NVIDIA Developer Intermediate 1mo ago

Neural Architecture Search: Train the Right Vision Model for Your Hardware

Computer Vision

Neural Architecture Search: Train the Right Vision Model for Your Hardware

Roboflow Beginner 2mo ago

4 Retirement Income Strategies 💰

Computer Vision

4 Retirement Income Strategies 💰

Money Matters MD Intermediate 2mo ago

Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio

Computer Vision

Top 5 Beginner Computer Vision Projects to Boost Your AI Portfolio

Analytics Vidhya Beginner 2mo ago

Edge-Driven Multimodal Hypothesis Testing for Real-Time Research

Computer Vision

Edge-Driven Multimodal Hypothesis Testing for Real-Time Research

QuickTech Daily Beginner 1mo ago

AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities

Computer Vision

AI Diaries Episode Multimodal Environmental Sensing for Smarter Cities

QuickTech Daily Intermediate 1mo ago

AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs

Computer Vision

AI Diaries Episode Unified Multimodal Sensing for Smart Biotech Labs

QuickTech Daily Intermediate 1mo ago

📚 Continue on Coursera External links · Free to audit

View all →

Networking and Security Architecture with VMware NSX

📚 External: Coursera ↗

Networking and Security Architecture with VMware NSX

Opens on Coursera ↗

📚 External: Coursera ↗

Create Image Captioning Models - Português Brasileiro

Opens on Coursera ↗

Entendiendo la depresión a lo largo del ciclo vital

📚 External: Coursera ↗

Entendiendo la depresión a lo largo del ciclo vital

Opens on Coursera ↗

📚 External: Coursera ↗

Create and Test a Document AI Processor

Opens on Coursera ↗

Azure Practical - Cognitive Services

📚 External: Coursera ↗

Azure Practical - Cognitive Services

Opens on Coursera ↗

Marketing Fundamentals Mastery: Apply, Analyze & Evaluate

📚 External: Coursera ↗

Marketing Fundamentals Mastery: Apply, Analyze & Evaluate

Opens on Coursera ↗

Advanced Algorithms and Complexity

📚 External: Coursera ↗

Advanced Algorithms and Complexity

Opens on Coursera ↗

Analyze Video Data Using OpenCV and Python

📚 External: Coursera ↗

Analyze Video Data Using OpenCV and Python

Opens on Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

📚 External: Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

Opens on Coursera ↗

Network Visualization and Intervention

📚 External: Coursera ↗

Network Visualization and Intervention

Opens on Coursera ↗

The Social Media Landscape

📚 External: Coursera ↗

The Social Media Landscape

Opens on Coursera ↗

CompTIA Cloud CV0-003: Unit 3

📚 External: Coursera ↗

CompTIA Cloud CV0-003: Unit 3

Opens on Coursera ↗

📚 External: Coursera ↗

Custom Document Extraction with Document AI Workbench

Opens on Coursera ↗

Energía para campeones: Nutrición deportiva para el fútbol

📚 External: Coursera ↗

Energía para campeones: Nutrición deportiva para el fútbol

Opens on Coursera ↗

Jetson Nano Starter to Pro - A Computer Vision Course

📚 External: Coursera ↗

Jetson Nano Starter to Pro - A Computer Vision Course

Opens on Coursera ↗

Deep Learning for Object Detection

📚 External: Coursera ↗

Deep Learning for Object Detection

Opens on Coursera ↗

Interdisciplinarity in Thought and Practice

📚 External: Coursera ↗

Interdisciplinarity in Thought and Practice

Opens on Coursera ↗

Sales Transformation Fundamentals

📚 External: Coursera ↗

Sales Transformation Fundamentals

Opens on Coursera ↗