Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Computer Vision ⚡ AI Lesson
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Muhammad Moin Beginner 6mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision ⚡ AI Lesson
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 6mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision ⚡ AI Lesson
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 6mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 6mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 6mo ago
OneDrive’s AI is scanning your PHOTOS
Computer Vision ⚡ AI Lesson
OneDrive’s AI is scanning your PHOTOS
David Bombal Beginner 7mo ago
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Computer Vision
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Teacher Luke - Duolingo English Test Beginner 7mo ago
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Computer Vision
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Simplilearn Beginner 7mo ago
🚨 Smart AI for Wildlife & Traffic Safety! 🐘🚦
Computer Vision
🚨 Smart AI for Wildlife & Traffic Safety! 🐘🚦
Arivi by HCL GUVI Beginner 7mo ago
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Computer Vision
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Chrome for Developers Beginner 7mo ago
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
Computer Vision ⚡ AI Lesson
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
AWS Developers Beginner 7mo ago
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Computer Vision
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Muhammad Moin Beginner 7mo ago
What is multimodality? A deep dive on multimodality in Gemma 3
Computer Vision
What is multimodality? A deep dive on multimodality in Gemma 3
Google for Developers Beginner 7mo ago
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Stanford Online Beginner 8mo ago
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Stanford Online Beginner 8mo ago
Computer Vision with Arduino Tutorial – 2 Projects
Computer Vision ⚡ AI Lesson
Computer Vision with Arduino Tutorial – 2 Projects
freeCodeCamp.org Beginner 8mo ago
(Self-Supervised Learning 🤔) with Code Implementation in Tamil | AI Coach John
Computer Vision
(Self-Supervised Learning 🤔) with Code Implementation in Tamil | AI Coach John
AI Coach John (Tamil) Beginner 8mo ago
Almost All Email Campaigns Are Doing This Wrong
Computer Vision
Almost All Email Campaigns Are Doing This Wrong
Neil Patel Beginner 9mo ago
YOLOv5 Tutorial | Architecture, Assigning Targets & Loss Function Explained
Computer Vision
YOLOv5 Tutorial | Architecture, Assigning Targets & Loss Function Explained
ExplainingAI Beginner 9mo ago
Timothée Darcet - Scaling Self Supervised Learning for Vision  An Introduction to DINOv2
Computer Vision ⚡ AI Lesson
Timothée Darcet - Scaling Self Supervised Learning for Vision An Introduction to DINOv2
Cohere Beginner 10mo ago
3 Insane Algorithms Netflix Uses to Scan BILLIONS of Frames
Computer Vision
3 Insane Algorithms Netflix Uses to Scan BILLIONS of Frames
Coding with Lewis Beginner 10mo ago
What is Computer Vision
Computer Vision
What is Computer Vision
AI Simplified Beginner 10mo ago
Multimodal Document Intelligence with NVIDIA Llama Nemotron Nano VL
Computer Vision ⚡ AI Lesson
Multimodal Document Intelligence with NVIDIA Llama Nemotron Nano VL
NVIDIA Developer Beginner 10mo ago
Why More Researchers Should become Content Creators
Computer Vision
Why More Researchers Should become Content Creators
Jia-Bin Huang Beginner 10mo ago
Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained
Computer Vision
Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained
Thinking Neuron Beginner 11mo ago
Building a Vision Transformer Model from Scratch with PyTorch
Computer Vision ⚡ AI Lesson
Building a Vision Transformer Model from Scratch with PyTorch
freeCodeCamp.org Beginner 11mo ago
Seulki Park - Visually Consistent Hierarchical Image Classification
Computer Vision
Seulki Park - Visually Consistent Hierarchical Image Classification
Cohere Beginner 11mo ago
AI Personal Tutor for Everyone
Computer Vision
AI Personal Tutor for Everyone
Y Combinator Beginner 1y ago
Computer Vision in 100 Seconds
Computer Vision
Computer Vision in 100 Seconds
Infinite Codes Beginner 1y ago
Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python
Computer Vision
Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python
Code In a Jiffy Beginner 1y ago
Multimodal AI with Logan Kilpatrick
Computer Vision
Multimodal AI with Logan Kilpatrick
Google Cloud Beginner 1y ago
DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1
Computer Vision
DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1
ExplainingAI Beginner 1y ago
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Computer Vision ⚡ AI Lesson
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Roboflow Beginner 7mo ago
Audi Reader: Reinventing the Car User Manual with Vision AI
Computer Vision
Audi Reader: Reinventing the Car User Manual with Vision AI
Roboflow Beginner 7mo ago
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Computer Vision ⚡ AI Lesson
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Roboflow Beginner 7mo ago
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Computer Vision
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Roboflow Beginner 7mo ago
How to Automate Quality Inspections with ResNet Classification Models
Computer Vision
How to Automate Quality Inspections with ResNet Classification Models
Roboflow Beginner 8mo ago
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Computer Vision
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Muhammad Moin Beginner 8mo ago
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Computer Vision
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Roboflow Beginner 8mo ago
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Computer Vision ⚡ AI Lesson
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Roboflow Beginner 8mo ago
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Stanford Online Beginner 8mo ago
Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding
Computer Vision
Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding
Stanford Online Beginner 8mo ago
Testing DeepSeek V3.1 – The BEST Open Source AI Model?
Computer Vision ⚡ AI Lesson
Testing DeepSeek V3.1 – The BEST Open Source AI Model?
Muhammad Moin Beginner 8mo ago
Introducing CodeSpy.ai – Detect AI-Generated Code with Confidence
Computer Vision ⚡ AI Lesson
Introducing CodeSpy.ai – Detect AI-Generated Code with Confidence
Muhammad Moin Beginner 9mo ago
Control PTZ Cameras with AI | ONVIF Integration with Object Tracking
Computer Vision
Control PTZ Cameras with AI | ONVIF Integration with Object Tracking
Roboflow Beginner 9mo ago
Getting Started with Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR
Computer Vision
Getting Started with Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR
Muhammad Moin Beginner 9mo ago
Auto Labeling Image Data | How to Annotate a Dataset and Train a Vision AI Model
Computer Vision
Auto Labeling Image Data | How to Annotate a Dataset and Train a Vision AI Model
Roboflow Beginner 10mo ago
How to Detect People in Danger Zones with AI
Computer Vision
How to Detect People in Danger Zones with AI
Roboflow Beginner 1y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Marketing Communications: Intro to Consumer Behavior
📚 Coursera Course ↗
Self-paced
Marketing Communications: Intro to Consumer Behavior
Opens on Coursera ↗
Camera and Imaging
📚 Coursera Course ↗
Self-paced
Camera and Imaging
Opens on Coursera ↗
Marketing Management
📚 Coursera Course ↗
Self-paced
Marketing Management
Opens on Coursera ↗
AutoML: Build ML Models without Code
📚 Coursera Course ↗
Self-paced
AutoML: Build ML Models without Code
Opens on Coursera ↗
IoT Networking
📚 Coursera Course ↗
Self-paced
IoT Networking
Opens on Coursera ↗
Sync CRM Contacts
📚 Coursera Course ↗
Self-paced
Sync CRM Contacts
Opens on Coursera ↗