Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Drowsiness Detection with Vision AI | Improve Safety with AI
Computer Vision
Drowsiness Detection with Vision AI | Improve Safety with AI
Roboflow Intermediate 1y ago
Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez
Computer Vision
Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez
PyTorch Intermediate 1y ago
Instagram Profile Scraper using Apify and Google Sheets in n8n
Computer Vision
Instagram Profile Scraper using Apify and Google Sheets in n8n
Muhammad Moin Beginner 1y ago
MedGemma LLM: Doctors, Meet Your AI Assistant 🧠
Computer Vision ⚡ AI Lesson
MedGemma LLM: Doctors, Meet Your AI Assistant 🧠
AI Anytime Intermediate 1y ago
Introduction to AI Agents: LLMs, Workflows, and AI Agents
Computer Vision
Introduction to AI Agents: LLMs, Workflows, and AI Agents
Muhammad Moin Beginner 1y ago
[CVPR 2025] Pos3R: 6D Pose Estimation for Unseen Objects Made Easy
Computer Vision
[CVPR 2025] Pos3R: 6D Pose Estimation for Unseen Objects Made Easy
anucvml Intermediate 1y ago
Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained
Computer Vision
Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained
Thinking Neuron Beginner 1y ago
FastVLM brings advanced computer vision to your phone...
Computer Vision ⚡ AI Lesson
FastVLM brings advanced computer vision to your phone...
NeuralNine Advanced 1y ago
Building a Vision Transformer Model from Scratch with PyTorch
Computer Vision ⚡ AI Lesson
Building a Vision Transformer Model from Scratch with PyTorch
freeCodeCamp.org Beginner 1y ago
China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!
Computer Vision
China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!
Analytics Vidhya Intermediate 1y ago
Seulki Park - Visually Consistent Hierarchical Image Classification
Computer Vision
Seulki Park - Visually Consistent Hierarchical Image Classification
Cohere Beginner 1y ago
AI Personal Tutor for Everyone
Computer Vision
AI Personal Tutor for Everyone
Y Combinator Beginner 1y ago
OpenAI Multimodal CLIP Architecture in 60 Seconds
Computer Vision
OpenAI Multimodal CLIP Architecture in 60 Seconds
HowCanAIHelp Beginner 1y ago
Computer Vision in 100 Seconds
Computer Vision
Computer Vision in 100 Seconds
Infinite Codes Beginner 1y ago
How to Segment Your Audience in Mailchimp
9:16
Computer Vision ⚡ AI Lesson
How to Segment Your Audience in Mailchimp
Intuit Mailchimp Intermediate 1y ago
Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python
Computer Vision
Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python
Code In a Jiffy Beginner 1y ago
How to Detect People in Danger Zones with AI
Computer Vision
How to Detect People in Danger Zones with AI
Roboflow Beginner 1y ago
Multimodal AI with Logan Kilpatrick
Computer Vision
Multimodal AI with Logan Kilpatrick
Google Cloud Beginner 1y ago
DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1
Computer Vision
DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1
ExplainingAI Beginner 1y ago
Find out how Nevada DETR achieved 4x faster approvals with Vertex AI
Computer Vision
Find out how Nevada DETR achieved 4x faster approvals with Vertex AI
Google Cloud Advanced 1y ago
Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI
Computer Vision
Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI
Bytes of AI Beginner 1y ago
Multimodal AI & Next Gen Databases | Data Brew | Episode 42
Computer Vision ⚡ AI Lesson
Multimodal AI & Next Gen Databases | Data Brew | Episode 42
Databricks Intermediate 1y ago
PaliGemma – Making Gemma 2 see by adding a vision encoder
Computer Vision
PaliGemma – Making Gemma 2 see by adding a vision encoder
Google for Developers Advanced 1y ago
Aya Vision Challenge, Ep. 3
Computer Vision
Aya Vision Challenge, Ep. 3
Cohere Beginner 1y ago
Nurturing Customer Relationships - Behind the Keynotes - Season 3 Episode 8
Computer Vision
Nurturing Customer Relationships - Behind the Keynotes - Season 3 Episode 8
Nordic Business Forum Beginner 1y ago
Seminar: Segment Anything - Meta AI (15-03-2025)
Computer Vision
Seminar: Segment Anything - Meta AI (15-03-2025)
IEC Seminar Intermediate 1y ago
Building a travel buddy with Gemma
Computer Vision
Building a travel buddy with Gemma
Google for Developers Intermediate 1y ago
George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO
Computer Vision
George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO
george hotz archive Advanced 1y ago
Le meilleur OCR au monde : Mistral AI
Computer Vision
Le meilleur OCR au monde : Mistral AI
LAW I.A. Avocat & intelligence artificielle Lexvox Advanced 1y ago
Microsoft's Phi-4 Multimodal : NEW Opensource LLM is a TINY BEAST! (Full Test & Review)
Computer Vision
Microsoft's Phi-4 Multimodal : NEW Opensource LLM is a TINY BEAST! (Full Test & Review)
Codewello Beginner 1y ago
Open-source AI models are surpassing closed source (fast) | AI/ML Monthly
Computer Vision ⚡ AI Lesson
Open-source AI models are surpassing closed source (fast) | AI/ML Monthly
Daniel Bourke Beginner 1y ago
⚙️ How Does a Capacitive Proximity Sensor Work? #automation #sensor #proximitysensors #basics
Computer Vision
⚙️ How Does a Capacitive Proximity Sensor Work? #automation #sensor #proximitysensors #basics
Mr. SMART Engineering Beginner 1y ago
Train Foundation Models Better with LightlyTrain – Achieve Better Accuracy with Less Effort
Computer Vision
Train Foundation Models Better with LightlyTrain – Achieve Better Accuracy with Less Effort
Muhammad Moin Beginner 1y ago
Intuit uses Google Cloud Document AI to further simplify tax prep for millions
Computer Vision
Intuit uses Google Cloud Document AI to further simplify tax prep for millions
Google Cloud Intermediate 1y ago
RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?
Computer Vision
RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?
Roboflow Beginner 1y ago
RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow
Computer Vision
RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow
Roboflow Intermediate 1y ago
Expedition Aya Kick Off Event
Computer Vision
Expedition Aya Kick Off Event
Cohere Intermediate 1y ago
RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)
Computer Vision
RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)
Roboflow Advanced 1y ago
How to Train RF-DETR Object Detection Transformer on Custom Dataset for Potholes Detection
Computer Vision
How to Train RF-DETR Object Detection Transformer on Custom Dataset for Potholes Detection
Muhammad Moin Beginner 1y ago
RF-DETR: Real-Time Object Detection in Images and Videos | A Step-by-Step Guide
Computer Vision
RF-DETR: Real-Time Object Detection in Images and Videos | A Step-by-Step Guide
Muhammad Moin Beginner 1y ago
Build a Football Analysis System Using YOLO11 and Supervision
Computer Vision
Build a Football Analysis System Using YOLO11 and Supervision
Muhammad Moin Intermediate 1y ago
Aya Vision Challenge, Ep. 2
Computer Vision ⚡ AI Lesson
Aya Vision Challenge, Ep. 2
Cohere Beginner 1y ago
YOLOE: Real-Time Zero-Shot Object Detection and Segmentation Explained | Visual Prompting
Computer Vision
YOLOE: Real-Time Zero-Shot Object Detection and Segmentation Explained | Visual Prompting
Muhammad Moin Advanced 1y ago
Build a Tennis Analysis System with YOLOv12 and OpenCV
Computer Vision
Build a Tennis Analysis System with YOLOv12 and OpenCV
Muhammad Moin Beginner 1y ago
New Course: YOLOv12 – Custom Object Detection, Tracking & Web Apps
Computer Vision
New Course: YOLOv12 – Custom Object Detection, Tracking & Web Apps
Muhammad Moin Intermediate 1y ago
How to Train YOLOv12 Models on Your Custom Dataset in Google Colab
Computer Vision
How to Train YOLOv12 Models on Your Custom Dataset in Google Colab
Muhammad Moin Beginner 1y ago
YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)
Computer Vision
YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)
Roboflow Advanced 1y ago
Measure Objects with AI | Identifying Common Pitfalls and Increasing Precision
Computer Vision
Measure Objects with AI | Identifying Common Pitfalls and Increasing Precision
Roboflow Beginner 1y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Create video, audio and infographics for online learning
📚 External: Coursera ↗
Self-paced
Create video, audio and infographics for online learning
Opens on Coursera ↗
AI and Disaster Management
📚 External: Coursera ↗
Self-paced
AI and Disaster Management
Opens on Coursera ↗
Craft Sales Strategy
📚 External: Coursera ↗
Self-paced
Craft Sales Strategy
Opens on Coursera ↗
Business Economics and Game Theory for Decision Making
📚 External: Coursera ↗
Self-paced
Business Economics and Game Theory for Decision Making
Opens on Coursera ↗
Artificial Vision for Textile quality control
📚 External: Coursera ↗
Self-paced
Artificial Vision for Textile quality control
Opens on Coursera ↗
Process Images, Create Captioning AI Models
📚 External: Coursera ↗
Self-paced
Process Images, Create Captioning AI Models
Opens on Coursera ↗