Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map โ†’
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Computer Vision
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Roboflow Beginner 6mo ago
Choosing Your Path: AI Professional Program Course Selection Guide
Computer Vision โšก AI Lesson
Choosing Your Path: AI Professional Program Course Selection Guide
Stanford Online Beginner 6mo ago
As we outsource more to smart home gadgets, have we thought about how weโ€™d react in their place?
Computer Vision
As we outsource more to smart home gadgets, have we thought about how weโ€™d react in their place?
The Verge Intermediate 6mo ago
Real Time AI Video Object Tracking! ๐Ÿ’ฅEdgeTAM - Sam 2 for On-Device ๐Ÿ”ฅ
Computer Vision
Real Time AI Video Object Tracking! ๐Ÿ’ฅEdgeTAM - Sam 2 for On-Device ๐Ÿ”ฅ
1littlecoder Intermediate 6mo ago
How to Create a Profitable Paid Search Strategy for 2026
Computer Vision
How to Create a Profitable Paid Search Strategy for 2026
Exposure Ninja Intermediate 6mo ago
Vibe Coding with AI in 2025 โ€“ Build Anything with Google AI Studio
Computer Vision โšก AI Lesson
Vibe Coding with AI in 2025 โ€“ Build Anything with Google AI Studio
Muhammad Moin Beginner 6mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision โšก AI Lesson
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 6mo ago
Genomcore impulsa la investigaciรณn biomรฉdica con AWS e IA | Amazon Web Services
Computer Vision
Genomcore impulsa la investigaciรณn biomรฉdica con AWS e IA | Amazon Web Services
Amazon Web Services Advanced 6mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision โšก AI Lesson
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 6mo ago
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Computer Vision
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Abhishek Thakur Intermediate 6mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 6mo ago
Where Hazel is at and what we've been up to // October 2025 Hazel Dev Log
Computer Vision โšก AI Lesson
Where Hazel is at and what we've been up to // October 2025 Hazel Dev Log
The Cherno Intermediate 6mo ago
Multimodal Data Analysis with AI
Computer Vision โšก AI Lesson
Multimodal Data Analysis with AI
Latent Space Intermediate 6mo ago
Generate Image Captions That Focus on What You Need
Computer Vision โšก AI Lesson
Generate Image Captions That Focus on What You Need
NVIDIA Developer Intermediate 6mo ago
Meta Engineer on Industrial Computer Vision systems
Computer Vision
Meta Engineer on Industrial Computer Vision systems
MLOps.community Intermediate 6mo ago
Duolingo English Test - NEW Complete Practice Test with Answers
Computer Vision
Duolingo English Test - NEW Complete Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 6mo ago
Ashmal Vayani - Seeing the World as It Speaks  Multilingual, Culturally Aware Multimodal AI
Computer Vision โšก AI Lesson
Ashmal Vayani - Seeing the World as It Speaks Multilingual, Culturally Aware Multimodal AI
Cohere Advanced 6mo ago
OneDriveโ€™s AI is scanning your PHOTOS
Computer Vision โšก AI Lesson
OneDriveโ€™s AI is scanning your PHOTOS
David Bombal Beginner 7mo ago
The SECRET to Hyper Segmentation (and Sales)
0:35
Computer Vision โšก AI Lesson
The SECRET to Hyper Segmentation (and Sales)
Optimum7 Intermediate 7mo ago
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Computer Vision
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Teacher Luke - Duolingo English Test Beginner 7mo ago
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Computer Vision
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Simplilearn Beginner 7mo ago
๐Ÿšจ Smart AI for Wildlife & Traffic Safety! ๐Ÿ˜๐Ÿšฆ
Computer Vision
๐Ÿšจ Smart AI for Wildlife & Traffic Safety! ๐Ÿ˜๐Ÿšฆ
Arivi by HCL GUVI Beginner 7mo ago
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Computer Vision
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Chrome for Developers Beginner 7mo ago
Shashanka Venkataramana and Valentinos Pariza - Franca  Nested Matryoshka Clustering for Scalable Vi
Computer Vision โšก AI Lesson
Shashanka Venkataramana and Valentinos Pariza - Franca Nested Matryoshka Clustering for Scalable Vi
Cohere Advanced 7mo ago
Industrial AI Machine Vision in Action with Databricks & Crosser
Computer Vision โšก AI Lesson
Industrial AI Machine Vision in Action with Databricks & Crosser
Databricks Intermediate 7mo ago
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
Computer Vision โšก AI Lesson
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
AWS Developers Beginner 7mo ago
Qwen3-Omni: The First Open All-in-One AI?
Computer Vision
Qwen3-Omni: The First Open All-in-One AI?
What's AI by Louis-Franรงois Bouchard Advanced 7mo ago
"Smartest" VISION AI in Cars Do Reasoning?
Computer Vision
"Smartest" VISION AI in Cars Do Reasoning?
Discover AI Intermediate 7mo ago
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Computer Vision
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Muhammad Moin Beginner 7mo ago
What is multimodality? A deep dive on multimodality in Gemma 3
Computer Vision
What is multimodality? A deep dive on multimodality in Gemma 3
Google for Developers Beginner 7mo ago
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Stanford Online Beginner 8mo ago
Computer Vision with Arduino Tutorial โ€“ 2 Projects
Computer Vision โšก AI Lesson
Computer Vision with Arduino Tutorial โ€“ 2 Projects
freeCodeCamp.org Beginner 8mo ago
(Self-Supervised Learning ๐Ÿค”) with Code Implementation in Tamil | AI Coach John
Computer Vision
(Self-Supervised Learning ๐Ÿค”) with Code Implementation in Tamil | AI Coach John
AI Coach John (Tamil) Beginner 8mo ago
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Computer Vision
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Google Cloud Intermediate 8mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 6mo ago
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Computer Vision โšก AI Lesson
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Roboflow Beginner 7mo ago
Audi Reader: Reinventing the Car User Manual with Vision AI
Computer Vision
Audi Reader: Reinventing the Car User Manual with Vision AI
Roboflow Beginner 7mo ago
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Computer Vision โšก AI Lesson
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Roboflow Beginner 7mo ago
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Computer Vision
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Roboflow Beginner 7mo ago
How to Automate Quality Inspections with ResNet Classification Models
Computer Vision
How to Automate Quality Inspections with ResNet Classification Models
Roboflow Beginner 8mo ago
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Computer Vision
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Muhammad Moin Beginner 8mo ago
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Computer Vision
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Roboflow Beginner 8mo ago
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Computer Vision โšก AI Lesson
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Roboflow Beginner 8mo ago
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Stanford Online Beginner 8mo ago
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Stanford Online Beginner 8mo ago
Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding
Computer Vision
Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding
Stanford Online Beginner 8mo ago
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Computer Vision
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Roboflow Intermediate 8mo ago
Testing DeepSeek V3.1 โ€“ The BEST Open Source AI Model?
Computer Vision โšก AI Lesson
Testing DeepSeek V3.1 โ€“ The BEST Open Source AI Model?
Muhammad Moin Beginner 8mo ago
๐Ÿ“š Coursera Courses Opens on Coursera ยท Free to audit
1 / 3 View all โ†’
Autoscaling TensorFlow Model Deployments with TF Serving and Kubernetes
๐Ÿ“š Coursera Course โ†—
Self-paced
Autoscaling TensorFlow Model Deployments with TF Serving and Kubernetes
Opens on Coursera โ†—
Cisco Software-Defined Wan for Enterprise & Cloud: Unit 1
๐Ÿ“š Coursera Course โ†—
Self-paced
Cisco Software-Defined Wan for Enterprise & Cloud: Unit 1
Opens on Coursera โ†—
Introduction to Image Processing
๐Ÿ“š Coursera Course โ†—
Self-paced
Introduction to Image Processing
Opens on Coursera โ†—
Analyze Video Data Using OpenCV and Python
๐Ÿ“š Coursera Course โ†—
Self-paced
Analyze Video Data Using OpenCV and Python
Opens on Coursera โ†—
Positioning: What you need for a successful Marketing Strategy
๐Ÿ“š Coursera Course โ†—
Self-paced
Positioning: What you need for a successful Marketing Strategy
Opens on Coursera โ†—
Machine Learning in Python: Analyze & Apply
๐Ÿ“š Coursera Course โ†—
Self-paced
Machine Learning in Python: Analyze & Apply
Opens on Coursera โ†—