Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,538
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
A no nonsense intro to BM25
Computer Vision ⚡ AI Lesson
A no nonsense intro to BM25
Abhishek Thakur Beginner 7mo ago
Use this Template for Speak About the Photo + 10 Practice Questions | Duolingo English Test
Computer Vision
Use this Template for Speak About the Photo + 10 Practice Questions | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 7mo ago
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
Computer Vision ⚡ AI Lesson
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
PyData Beginner 7mo ago
The biggest mistake companies make deploying AI  #podcast #interview #dataanalysis #ai #datascience
Computer Vision ⚡ AI Lesson
The biggest mistake companies make deploying AI #podcast #interview #dataanalysis #ai #datascience
Abhishek Thakur Intermediate 7mo ago
Basic Network Segmentation
Computer Vision ⚡ AI Lesson
Basic Network Segmentation
John Hammond Intermediate 7mo ago
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Computer Vision ⚡ AI Lesson
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Roboflow Beginner 7mo ago
Build a RAG Application from Scratch — No LangChain, No LlamaIndex
Computer Vision
Build a RAG Application from Scratch — No LangChain, No LlamaIndex
Muhammad Moin Intermediate 7mo ago
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Computer Vision
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Roboflow Beginner 7mo ago
Choosing Your Path: AI Professional Program Course Selection Guide
Computer Vision ⚡ AI Lesson
Choosing Your Path: AI Professional Program Course Selection Guide
Stanford Online Beginner 7mo ago
Real Time AI Video Object Tracking! 💥EdgeTAM - Sam 2 for On-Device 🔥
Computer Vision
Real Time AI Video Object Tracking! 💥EdgeTAM - Sam 2 for On-Device 🔥
1littlecoder Intermediate 7mo ago
How to Create a Profitable Paid Search Strategy for 2026
Computer Vision
How to Create a Profitable Paid Search Strategy for 2026
Exposure Ninja Intermediate 7mo ago
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Computer Vision ⚡ AI Lesson
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Muhammad Moin Beginner 7mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision ⚡ AI Lesson
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 7mo ago
Build DIY Home Security With Computer Vision and a Raspberry Pi
Computer Vision
Build DIY Home Security With Computer Vision and a Raspberry Pi
The Dividor Daily Intermediate 7mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision ⚡ AI Lesson
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 7mo ago
Zhiwen Fan -  VLM 3R  Vision Language Models Augmented with Instruction Aligned 3D Reconstruction
Computer Vision ⚡ AI Lesson
Zhiwen Fan - VLM 3R Vision Language Models Augmented with Instruction Aligned 3D Reconstruction
Cohere Advanced 8mo ago
Multimodal Data Analysis with AI
Computer Vision ⚡ AI Lesson
Multimodal Data Analysis with AI
Latent Space Intermediate 8mo ago
Stop Losing Luggage: AI Computer Vision for Global Bag Tracking
Computer Vision
Stop Losing Luggage: AI Computer Vision for Global Bag Tracking
The Dividor Daily Intermediate 8mo ago
Generate Image Captions That Focus on What You Need
Computer Vision ⚡ AI Lesson
Generate Image Captions That Focus on What You Need
NVIDIA Developer Intermediate 8mo ago
Meta Engineer on Industrial Computer Vision systems
Computer Vision
Meta Engineer on Industrial Computer Vision systems
MLOps.community Intermediate 8mo ago
Duolingo English Test - NEW Complete Practice Test with Answers
Computer Vision
Duolingo English Test - NEW Complete Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 8mo ago
Ashmal Vayani - Seeing the World as It Speaks  Multilingual, Culturally Aware Multimodal AI
Computer Vision ⚡ AI Lesson
Ashmal Vayani - Seeing the World as It Speaks Multilingual, Culturally Aware Multimodal AI
Cohere Advanced 8mo ago
Innovations in Neurodevelopmental Sensory Processing Research (insp!re) Lab Overview
Computer Vision
Innovations in Neurodevelopmental Sensory Processing Research (insp!re) Lab Overview
USC Chan Division of Occupational Science and Occupational Therapy Beginner 8mo ago
OneDrive’s AI is scanning your PHOTOS
Computer Vision ⚡ AI Lesson
OneDrive’s AI is scanning your PHOTOS
David Bombal Beginner 8mo ago
The SECRET to Hyper Segmentation (and Sales)
0:35
Computer Vision ⚡ AI Lesson
The SECRET to Hyper Segmentation (and Sales)
Optimum7 Intermediate 8mo ago
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Computer Vision
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Simplilearn Beginner 8mo ago
🚨 Smart AI for Wildlife & Traffic Safety! 🐘🚦
Computer Vision
🚨 Smart AI for Wildlife & Traffic Safety! 🐘🚦
Arivi by HCL GUVI Beginner 8mo ago
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Computer Vision
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Chrome for Developers Beginner 9mo ago
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
Computer Vision ⚡ AI Lesson
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
AWS Developers Beginner 9mo ago
Qwen3-Omni: The First Open All-in-One AI?
Computer Vision
Qwen3-Omni: The First Open All-in-One AI?
What's AI by Louis-François Bouchard Advanced 9mo ago
"Smartest" VISION AI in Cars Do Reasoning?
Computer Vision
"Smartest" VISION AI in Cars Do Reasoning?
Discover AI Intermediate 9mo ago
What is multimodality? A deep dive on multimodality in Gemma 3
Computer Vision
What is multimodality? A deep dive on multimodality in Gemma 3
Google for Developers Beginner 9mo ago
How to focus on building your skills when everything's so distracting with Ania Kubów [Podcast #187]
Computer Vision ⚡ AI Lesson
How to focus on building your skills when everything's so distracting with Ania Kubów [Podcast #187]
freeCodeCamp.org Intermediate 9mo ago
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Stanford Online Beginner 10mo ago
Demystifying AI & Data Science (w/ Luca Massaron) 📱
Computer Vision ⚡ AI Lesson
Demystifying AI & Data Science (w/ Luca Massaron) 📱
Abhishek Thakur Intermediate 7mo ago
Demystifying AI & Data Science (w/ Luca Massaron)
Computer Vision ⚡ AI Lesson
Demystifying AI & Data Science (w/ Luca Massaron)
Abhishek Thakur Intermediate 7mo ago
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Computer Vision
How to Stay Relevant in AI & Data Science (w/ Alexey Grigorev)
Abhishek Thakur Intermediate 8mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 8mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 8mo ago
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Computer Vision ⚡ AI Lesson
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Roboflow Beginner 9mo ago
Audi Reader: Reinventing the Car User Manual with Vision AI
Computer Vision
Audi Reader: Reinventing the Car User Manual with Vision AI
Roboflow Beginner 9mo ago
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Computer Vision ⚡ AI Lesson
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Roboflow Beginner 9mo ago
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Computer Vision
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Roboflow Beginner 9mo ago
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Computer Vision
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Muhammad Moin Beginner 9mo ago
How to Automate Quality Inspections with ResNet Classification Models
Computer Vision
How to Automate Quality Inspections with ResNet Classification Models
Roboflow Beginner 9mo ago
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Computer Vision
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Muhammad Moin Beginner 9mo ago
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Computer Vision
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Roboflow Beginner 9mo ago
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Computer Vision ⚡ AI Lesson
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Roboflow Beginner 10mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Using Specialized Processors with Document AI (Python)
📚 External: Coursera ↗
Self-paced
Using Specialized Processors with Document AI (Python)
Opens on Coursera ↗
Analyze Video Data Using OpenCV and Python
📚 External: Coursera ↗
Self-paced
Analyze Video Data Using OpenCV and Python
Opens on Coursera ↗
Fine-Tuning and Evaluating Vision AI Models
📚 External: Coursera ↗
Self-paced
Fine-Tuning and Evaluating Vision AI Models
Opens on Coursera ↗
Materiales para envase y embalaje
📚 External: Coursera ↗
Self-paced
Materiales para envase y embalaje
Opens on Coursera ↗
UiPath Automation Developer Professional
📚 External: Coursera ↗
Self-paced
UiPath Automation Developer Professional
Opens on Coursera ↗
Introduction to Computer Vision with TensorFlow
📚 External: Coursera ↗
Self-paced
Introduction to Computer Vision with TensorFlow
Opens on Coursera ↗