Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 5: Image Classification with CNNs
Stanford Online Beginner 10mo ago
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Computer Vision
Stanford CS231N | Spring 2025 | Lecture 9: Object Detection, Image Segmentation, Visualizing
Stanford Online Beginner 10mo ago
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Computer Vision
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Roboflow Intermediate 10mo ago
Testing DeepSeek V3.1 – The BEST Open Source AI Model?
Computer Vision ⚡ AI Lesson
Testing DeepSeek V3.1 – The BEST Open Source AI Model?
Muhammad Moin Beginner 10mo ago
Computer Vision with Arduino Tutorial – 2 Projects
Computer Vision ⚡ AI Lesson
Computer Vision with Arduino Tutorial – 2 Projects
freeCodeCamp.org Beginner 10mo ago
Business Strategy Discussion Developing Business Solutions A Comprehensive Guide.
Computer Vision
Business Strategy Discussion Developing Business Solutions A Comprehensive Guide.
Strategic Marketing Beginner 10mo ago
Discover the Future of AI: Multimodal AI Revolution!
Computer Vision
Discover the Future of AI: Multimodal AI Revolution!
AIHub101 Intermediate 10mo ago
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Computer Vision
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Google Cloud Intermediate 10mo ago
EV Pickups Are a Bust for US Carmakers
Computer Vision
EV Pickups Are a Bust for US Carmakers
Bloomberg Technology Intermediate 10mo ago
I trained a Sign Language Detection Transformer (here's how you can do it too!)
Computer Vision ⚡ AI Lesson
I trained a Sign Language Detection Transformer (here's how you can do it too!)
Nicholas Renotte Beginner 10mo ago
Almost All Email Campaigns Are Doing This Wrong
Computer Vision
Almost All Email Campaigns Are Doing This Wrong
Neil Patel Beginner 10mo ago
David Fan & Peter Tong  - Scaling Language Free Visual Representation Learning
Computer Vision ⚡ AI Lesson
David Fan & Peter Tong - Scaling Language Free Visual Representation Learning
Cohere Advanced 10mo ago
Introducing CodeSpy.ai – Detect AI-Generated Code with Confidence
Computer Vision ⚡ AI Lesson
Introducing CodeSpy.ai – Detect AI-Generated Code with Confidence
Muhammad Moin Beginner 10mo ago
Vision AI in 2025 — Peter Robicheaux, Roboflow
Computer Vision
Vision AI in 2025 — Peter Robicheaux, Roboflow
AI Engineer Intermediate 11mo ago
The Segmentation Tweak That Quietly BOOSTS Klaviyo Revenue #shorts #emailmarketing
1:26
Computer Vision ⚡ AI Lesson
The Segmentation Tweak That Quietly BOOSTS Klaviyo Revenue #shorts #emailmarketing
Emissary 2.0 Intermediate 11mo ago
YOLOv5 Tutorial | Architecture, Assigning Targets & Loss Function Explained
Computer Vision
YOLOv5 Tutorial | Architecture, Assigning Targets & Loss Function Explained
ExplainingAI Beginner 11mo ago
Architecture, Engineering & Construction Industry - Trends
Computer Vision
Architecture, Engineering & Construction Industry - Trends
Primerli Beginner 11mo ago
Control PTZ Cameras with AI | ONVIF Integration with Object Tracking
Computer Vision
Control PTZ Cameras with AI | ONVIF Integration with Object Tracking
Roboflow Beginner 11mo ago
Top-Ranked RAG: NeMo Retriever Leads Visual Document Retrieval Leaderboards
Computer Vision ⚡ AI Lesson
Top-Ranked RAG: NeMo Retriever Leads Visual Document Retrieval Leaderboards
NVIDIA Developer Intermediate 11mo ago
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Computer Vision ⚡ AI Lesson
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Microsoft Research Intermediate 11mo ago
2025 EC3 & CIB W78 - Partl, Rainer - Deep Neural Networks for Object-detection and Instance Seg...
Computer Vision
2025 EC3 & CIB W78 - Partl, Rainer - Deep Neural Networks for Object-detection and Instance Seg...
European Council on Computing in Construction Advanced 11mo ago
Is Your Business Running on Empty? 🤖
Computer Vision
Is Your Business Running on Empty? 🤖
imFORZA Intermediate 11mo ago
​Productionizing Prompts: How Pinterest Turned Every Team into GenAI Power Users
Computer Vision
​Productionizing Prompts: How Pinterest Turned Every Team into GenAI Power Users
Predibase by Rubrik Intermediate 11mo ago
Timothée Darcet - Scaling Self Supervised Learning for Vision  An Introduction to DINOv2
Computer Vision ⚡ AI Lesson
Timothée Darcet - Scaling Self Supervised Learning for Vision An Introduction to DINOv2
Cohere Beginner 11mo ago
Transforming Guest Experiences: GoTo Foods’ Data Journey with Amperity & Databricks
Computer Vision ⚡ AI Lesson
Transforming Guest Experiences: GoTo Foods’ Data Journey with Amperity & Databricks
Databricks Advanced 11mo ago
Transforming Data Governance for Multimodal Data at Amgen With Databricks
Computer Vision ⚡ AI Lesson
Transforming Data Governance for Multimodal Data at Amgen With Databricks
Databricks Intermediate 11mo ago
3 Insane Algorithms Netflix Uses to Scan BILLIONS of Frames
Computer Vision
3 Insane Algorithms Netflix Uses to Scan BILLIONS of Frames
Coding with Lewis Beginner 12mo ago
What is Computer Vision
Computer Vision
What is Computer Vision
AI Simplified Beginner 1y ago
Multimodal Document Intelligence with NVIDIA Llama Nemotron Nano VL
Computer Vision ⚡ AI Lesson
Multimodal Document Intelligence with NVIDIA Llama Nemotron Nano VL
NVIDIA Developer Beginner 1y ago
Train YOLO on Custom Dataset | Object Detection Step-by-Step Tutorial
Computer Vision
Train YOLO on Custom Dataset | Object Detection Step-by-Step Tutorial
Samin Learns AI Advanced 1y ago
Why More Researchers Should become Content Creators
Computer Vision
Why More Researchers Should become Content Creators
Jia-Bin Huang Beginner 1y ago
LLMs for Equities Feature Forecasting at Two Sigma [Ben Wellington] - 736
Computer Vision ⚡ AI Lesson
LLMs for Equities Feature Forecasting at Two Sigma [Ben Wellington] - 736
The TWIML AI Podcast with Sam Charrington Beginner 1y ago
Unsupervised Learning: Uncover Hidden Patterns & Data Secrets!
Computer Vision
Unsupervised Learning: Uncover Hidden Patterns & Data Secrets!
The AI Standard Beginner 1y ago
Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding
Computer Vision
Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding
Stanford Online Beginner 10mo ago
How to Build a Smart Football Analysis System Using YOLO11 #computervision #yolo11 #objectdetection
Computer Vision
How to Build a Smart Football Analysis System Using YOLO11 #computervision #yolo11 #objectdetection
Muhammad Moin Beginner 11mo ago
Getting Started with Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR
Computer Vision
Getting Started with Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR
Muhammad Moin Beginner 11mo ago
Auto Labeling Image Data | How to Annotate a Dataset and Train a Vision AI Model
Computer Vision
Auto Labeling Image Data | How to Annotate a Dataset and Train a Vision AI Model
Roboflow Beginner 11mo ago
YOLO11 + SAHI = Better Detection for Small Objects! (Step-by-Step Guide)
Computer Vision
YOLO11 + SAHI = Better Detection for Small Objects! (Step-by-Step Guide)
Muhammad Moin Beginner 11mo ago
Improve Tiny Object Detection with YOLO11 + SAHI 🔍
Computer Vision
Improve Tiny Object Detection with YOLO11 + SAHI 🔍
Muhammad Moin Intermediate 11mo ago
Kimi K2 Coder: NEW Best Free AI Coding Tool? (Open-Source Review)
Computer Vision
Kimi K2 Coder: NEW Best Free AI Coding Tool? (Open-Source Review)
Muhammad Moin Beginner 11mo ago
Build a Car & License Plate Recognition System with YOLO11 + PaddleOCR
Computer Vision
Build a Car & License Plate Recognition System with YOLO11 + PaddleOCR
Muhammad Moin Beginner 11mo ago
How to Fine-Tune SmolVLM2 | Convert Documents into JSON
Computer Vision
How to Fine-Tune SmolVLM2 | Convert Documents into JSON
Roboflow Intermediate 11mo ago
Build a Local RAG App with DeepSeek R1 & Ollama in Streamlit – Step-by-Step Tutorial
Computer Vision
Build a Local RAG App with DeepSeek R1 & Ollama in Streamlit – Step-by-Step Tutorial
Muhammad Moin Intermediate 11mo ago
Gemini Code Assist - AI Coding Agents: A Step-by-Step Tutorial
Computer Vision
Gemini Code Assist - AI Coding Agents: A Step-by-Step Tutorial
Muhammad Moin Beginner 11mo ago
Build a PDF Text Extractor App with Streamlit, n8n & Mistral OCR API – Step-by-Step Tutorial
Computer Vision
Build a PDF Text Extractor App with Streamlit, n8n & Mistral OCR API – Step-by-Step Tutorial
Muhammad Moin Beginner 11mo ago
Gemini CLI + MCP Server: A Step-by-Step Tutorial
Computer Vision
Gemini CLI + MCP Server: A Step-by-Step Tutorial
Muhammad Moin Intermediate 12mo ago
Building MCP Servers with LangChain in Python
Computer Vision
Building MCP Servers with LangChain in Python
Muhammad Moin Intermediate 1y ago
Build an AI Agent in n8n to Analyze YouTube Comments & Report Insights
Computer Vision
Build an AI Agent in n8n to Analyze YouTube Comments & Report Insights
Muhammad Moin Beginner 1y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Document AI: Project & API Writing
📚 External: Coursera ↗
Self-paced
Document AI: Project & API Writing
Opens on Coursera ↗
AI Applications: Computer Vision and Speech Recognition
📚 External: Coursera ↗
Self-paced
AI Applications: Computer Vision and Speech Recognition
Opens on Coursera ↗
Uptraining with Document AI Workbench
📚 External: Coursera ↗
Self-paced
Uptraining with Document AI Workbench
Opens on Coursera ↗
Create Image Captioning Models - Português Brasileiro
📚 External: Coursera ↗
Self-paced
Create Image Captioning Models - Português Brasileiro
Opens on Coursera ↗
Build Real-Time Face Recognition with OpenCV
📚 External: Coursera ↗
Self-paced
Build Real-Time Face Recognition with OpenCV
Opens on Coursera ↗
Market Analysis
📚 External: Coursera ↗
Self-paced
Market Analysis
Opens on Coursera ↗