Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

2,353
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
One Open AI Model Built My Website, Image & Video
Computer Vision
One Open AI Model Built My Website, Image & Video
Analytics Vidhya Beginner 4mo ago
Every Type of AI is Converging Into One #Shorts #AI #NeuralKeith
Computer Vision
Every Type of AI is Converging Into One #Shorts #AI #NeuralKeith
NeuralKeith Beginner 4mo ago
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
Computer Vision
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
The Verge Beginner 4mo ago
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Computer Vision ⚡ AI Lesson
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Roboflow Beginner 5mo ago
Full Duolingo English Test with Answers: January 2026 Format
Computer Vision ⚡ AI Lesson
Full Duolingo English Test with Answers: January 2026 Format
Teacher Luke - Duolingo English Test Beginner 5mo ago
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks  Learning Canonical Embeddings for Human Heads
Computer Vision ⚡ AI Lesson
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks Learning Canonical Embeddings for Human Heads
Cohere Beginner 5mo ago
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Computer Vision
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Analytics Vidhya Beginner 5mo ago
Deploy Vision Models to NVIDIA Jetson Orin in Minutes | AI at the Edge
Computer Vision ⚡ AI Lesson
Deploy Vision Models to NVIDIA Jetson Orin in Minutes | AI at the Edge
Roboflow Beginner 6mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Computer Vision ⚡ AI Lesson
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 6mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 7mo ago
Email Segmentation: Getting More Sales With Less Traffic
Computer Vision
Email Segmentation: Getting More Sales With Less Traffic
Social Media Examiner Beginner 7mo ago
I Took the Duolingo English Test and Here’s What Happened
Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 7mo ago
Should AI be introduced to kids early?  #podcast #interview
Computer Vision ⚡ AI Lesson
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 7mo ago
Multimodal and Multi-model AI in Action
Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 7mo ago
A no nonsense intro to BM25
Computer Vision ⚡ AI Lesson
A no nonsense intro to BM25
Abhishek Thakur Beginner 7mo ago
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
Computer Vision ⚡ AI Lesson
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
PyData Beginner 7mo ago
Choosing Your Path: AI Professional Program Course Selection Guide
Computer Vision ⚡ AI Lesson
Choosing Your Path: AI Professional Program Course Selection Guide
Stanford Online Beginner 7mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
Computer Vision ⚡ AI Lesson
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 7mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
Computer Vision ⚡ AI Lesson
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 8mo ago
Innovations in Neurodevelopmental Sensory Processing Research (insp!re) Lab Overview
Computer Vision
Innovations in Neurodevelopmental Sensory Processing Research (insp!re) Lab Overview
USC Chan Division of Occupational Science and Occupational Therapy Beginner 8mo ago
OneDrive’s AI is scanning your PHOTOS
Computer Vision ⚡ AI Lesson
OneDrive’s AI is scanning your PHOTOS
David Bombal Beginner 8mo ago
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Computer Vision
Salary of Computer Vision Engineer | How Much does a Computer Vision Engineer Make?
Simplilearn Beginner 8mo ago
🚨 Smart AI for Wildlife & Traffic Safety! 🐘🚦
Computer Vision
🚨 Smart AI for Wildlife & Traffic Safety! 🐘🚦
Arivi by HCL GUVI Beginner 8mo ago
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Computer Vision
Discover Web AI: Client side Agents, Gen AI, and machine learning in the browser
Chrome for Developers Beginner 9mo ago
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
Computer Vision ⚡ AI Lesson
Mistral AI Models on Amazon Bedrock: When to Use Pixtral Large vs Mistral Small 3.0
AWS Developers Beginner 9mo ago
What is multimodality? A deep dive on multimodality in Gemma 3
Computer Vision
What is multimodality? A deep dive on multimodality in Gemma 3
Google for Developers Beginner 9mo ago
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 2: Image Classification with Linear Classifiers
Stanford Online Beginner 10mo ago
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Computer Vision
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Analytics Vidhya Beginner 6mo ago
AI for Occupancy Analytics | Building a Smart Parking System
Computer Vision ⚡ AI Lesson
AI for Occupancy Analytics | Building a Smart Parking System
Roboflow Beginner 6mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Computer Vision ⚡ AI Lesson
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 7mo ago
How to Deploy Vision AI Models in the Cloud | Serverless, Dedicated, Batch Processing
Computer Vision ⚡ AI Lesson
How to Deploy Vision AI Models in the Cloud | Serverless, Dedicated, Batch Processing
Roboflow Beginner 7mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Computer Vision ⚡ AI Lesson
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 7mo ago
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Computer Vision ⚡ AI Lesson
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Roboflow Beginner 7mo ago
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Computer Vision
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Roboflow Beginner 7mo ago
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Computer Vision ⚡ AI Lesson
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Muhammad Moin Beginner 7mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 8mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 8mo ago
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Computer Vision ⚡ AI Lesson
Meta's Daniel Bolya on Perception Encoder and Improving Visual Understanding
Roboflow Beginner 9mo ago
Audi Reader: Reinventing the Car User Manual with Vision AI
Computer Vision
Audi Reader: Reinventing the Car User Manual with Vision AI
Roboflow Beginner 9mo ago
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Computer Vision ⚡ AI Lesson
AI for Robotics: How Almond Uses Computer Vision with Manufacturing Robots
Roboflow Beginner 9mo ago
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Computer Vision
AI for Food Processing: How FloVision Uses Computer Vision to Reduce Waste and Improve Efficiency
Roboflow Beginner 9mo ago
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Computer Vision
Build an Agentic RAG with LangGraph | Step-by-Step Guide + Code
Muhammad Moin Beginner 9mo ago
How to Automate Quality Inspections with ResNet Classification Models
Computer Vision
How to Automate Quality Inspections with ResNet Classification Models
Roboflow Beginner 9mo ago
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Computer Vision
Agentic RAG Explained: The Future of AI Agents & Retrieval Augmented Generation
Muhammad Moin Beginner 9mo ago
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Computer Vision
Logo Recognition and Brand Analysis with AI | Learn to Use Vision Language Models
Roboflow Beginner 9mo ago
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Computer Vision ⚡ AI Lesson
RMFG Factory Tour: Automating Sheet Metal Operations with Vision AI
Roboflow Beginner 10mo ago
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Stanford Online Beginner 10mo ago
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Stanford Online Beginner 10mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Landing.AI for Beginners: Build Data Visualization AI Models
📚 External: Coursera ↗
Self-paced
Landing.AI for Beginners: Build Data Visualization AI Models
Opens on Coursera ↗
Sync CRM Contacts
📚 External: Coursera ↗
Self-paced
Sync CRM Contacts
Opens on Coursera ↗
Brand Management and Brand Equity Strategy
📚 External: Coursera ↗
Self-paced
Brand Management and Brand Equity Strategy
Opens on Coursera ↗
Applied Machine Learning: Techniques and Applications
📚 External: Coursera ↗
Self-paced
Applied Machine Learning: Techniques and Applications
Opens on Coursera ↗
Introduction to Computer Vision and Image Processing
📚 External: Coursera ↗
Self-paced
Introduction to Computer Vision and Image Processing
Opens on Coursera ↗
Sales Transformation Fundamentals
📚 External: Coursera ↗
Self-paced
Sales Transformation Fundamentals
Opens on Coursera ↗