Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,542
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Multimodal AI Business Companions
Computer Vision
Multimodal AI Business Companions
Daniel Finkenstadt Intermediate 2y ago
Enrich Scenario Planning with Multimodal Wargames
Computer Vision
Enrich Scenario Planning with Multimodal Wargames
Daniel Finkenstadt Intermediate 2y ago
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
2:29
Computer Vision ⚡ AI Lesson
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
Burned Guitarist Intermediate 2y ago
Google's New PaliGemma-Open Vision Language Model
Computer Vision
Google's New PaliGemma-Open Vision Language Model
Krish Naik Beginner 2y ago
Getting started With Google's PaliGemma: Open Vision-Language Model
Computer Vision ⚡ AI Lesson
Getting started With Google's PaliGemma: Open Vision-Language Model
Krish Naik Beginner 2y ago
What is Document AI?
Computer Vision
What is Document AI?
Google Cloud Beginner 2y ago
New2Cyber en Espanol | El final de la era del profesional de seguridad
Computer Vision ⚡ AI Lesson
New2Cyber en Espanol | El final de la era del profesional de seguridad
SANS Institute Intermediate 2y ago
How To Fine-tune LLaVA Model (From Your Laptop!)
Computer Vision
How To Fine-tune LLaVA Model (From Your Laptop!)
Brev Intermediate 2y ago
Build computer vision applications easily with Roboflow and Google Cloud
Computer Vision
Build computer vision applications easily with Roboflow and Google Cloud
Google Cloud Advanced 2y ago
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Computer Vision
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Google for Developers Beginner 2y ago
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Computer Vision
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Code In a Jiffy Beginner 2y ago
YOLO V9 Tutorial - How to use Kaggle 30 hrs GPU with Roboflow
Computer Vision
YOLO V9 Tutorial - How to use Kaggle 30 hrs GPU with Roboflow
عمرو عبداللطيف Beginner 2y ago
Dwell Time Analysis | Real-Time Stream Processing | Community Q&A (April 11)
Computer Vision
Dwell Time Analysis | Real-Time Stream Processing | Community Q&A (April 11)
Roboflow Beginner 2y ago
The Longevity Expert: Is There A Link Between Milk & Cancer? + Ozempic Can Really Mess You Up!
Computer Vision
The Longevity Expert: Is There A Link Between Milk & Cancer? + Ozempic Can Really Mess You Up!
The Diary Of A CEO Beginner 2y ago
The lies that sell fast fashion
Computer Vision ⚡ AI Lesson
The lies that sell fast fashion
Vox Beginner 2y ago
Mean Average Precision (mAP) | Explanation and Implementation for Object Detection
Computer Vision
Mean Average Precision (mAP) | Explanation and Implementation for Object Detection
ExplainingAI Intermediate 2y ago
Dwell Time Analysis with Computer Vision | Real-Time Stream Processing
Computer Vision
Dwell Time Analysis with Computer Vision | Real-Time Stream Processing
Roboflow Beginner 2y ago
Is synthetic data from generative models ready for image recognition? (ICLR 2023, spotlight)
Computer Vision
Is synthetic data from generative models ready for image recognition? (ICLR 2023, spotlight)
MIPAL-SNU Intermediate 2y ago
Real-Time Car Speed Tracking & Object Classification Revealed
Computer Vision
Real-Time Car Speed Tracking & Object Classification Revealed
Mervin Praison Beginner 2y ago
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
Computer Vision ⚡ AI Lesson
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
a16z Intermediate 2y ago
What are Good Features to Track? Shi-Tomasi Corner Detector Explained
Computer Vision
What are Good Features to Track? Shi-Tomasi Corner Detector Explained
Jia-Bin Huang Beginner 2y ago
R-CNN Explained
Computer Vision
R-CNN Explained
ExplainingAI Beginner 2y ago
How to perform object detection with KerasCV
Computer Vision ⚡ AI Lesson
How to perform object detection with KerasCV
TensorFlow Official Beginner 2y ago
Overview of KerasCV and KerasNLP
Computer Vision ⚡ AI Lesson
Overview of KerasCV and KerasNLP
TensorFlow Official Beginner 2y ago
Build an AI/ML Tennis Analysis system with YOLO, PyTorch, and Key Point Extraction
Computer Vision ⚡ AI Lesson
Build an AI/ML Tennis Analysis system with YOLO, PyTorch, and Key Point Extraction
Code In a Jiffy Beginner 2y ago
Multi-Modal NSFW Detection with AI
Computer Vision
Multi-Modal NSFW Detection with AI
James Briggs Intermediate 2y ago
This VLM can be your MultiModal AI with less than 6GB Memory!!!
Computer Vision
This VLM can be your MultiModal AI with less than 6GB Memory!!!
1littlecoder Intermediate 2y ago
New course with Hugging Face: Open Source Models with Hugging Face
Computer Vision ⚡ AI Lesson
New course with Hugging Face: Open Source Models with Hugging Face
DeepLearningAI Intermediate 2y ago
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Computer Vision
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Dwarkesh Patel Intermediate 2y ago
Pick What You FEED Into Your MIND! | Evan Carmichael
Computer Vision
Pick What You FEED Into Your MIND! | Evan Carmichael
Evan Carmichael Beginner 2y ago
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
Computer Vision
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
MIPAL-SNU Intermediate 2y ago
Vision Transformer (ViT)
Computer Vision
Vision Transformer (ViT)
Machine Learning Studio Intermediate 2y ago
Ben Shapiro vs Destiny Debate: Politics, Jan 6, Israel, Ukraine & Wokeism | Lex Fridman Podcast #410
Computer Vision ⚡ AI Lesson
Ben Shapiro vs Destiny Debate: Politics, Jan 6, Israel, Ukraine & Wokeism | Lex Fridman Podcast #410
Lex Fridman Beginner 2y ago
The Future Of Computer Vision
Computer Vision ⚡ AI Lesson
The Future Of Computer Vision
a16z Intermediate 2y ago
Matthew Cox: FBI Most Wanted Con Man - $55 Million in Bank Fraud | Lex Fridman Podcast #409
Computer Vision ⚡ AI Lesson
Matthew Cox: FBI Most Wanted Con Man - $55 Million in Bank Fraud | Lex Fridman Podcast #409
Lex Fridman Beginner 2y ago
Image Classification with Hugging Face
Computer Vision ⚡ AI Lesson
Image Classification with Hugging Face
DataCamp Beginner 2y ago
Create a Custom Document Extractor with Document AI
Computer Vision ⚡ AI Lesson
Create a Custom Document Extractor with Document AI
Google Cloud Tech Intermediate 2y ago
Tune in to know what are the most exciting opportunities to look out for in computer vision!
Computer Vision ⚡ AI Lesson
Tune in to know what are the most exciting opportunities to look out for in computer vision!
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
Computer Vision ⚡ AI Lesson
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
CS50x 2024 - Lecture 6 - Python
Computer Vision
CS50x 2024 - Lecture 6 - Python
CS50 Beginner 2y ago
Email Segmentation Full Guide (2026)
33:12
Computer Vision ⚡ AI Lesson
Email Segmentation Full Guide (2026)
Derek Stroh Beginner 2y ago
YOLOv9 Live Coding & Community Q&A (March 14)
Computer Vision ⚡ AI Lesson
YOLOv9 Live Coding & Community Q&A (March 14)
Roboflow Beginner 2y ago
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Computer Vision ⚡ AI Lesson
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Roboflow Intermediate 2y ago
YOLO-World Live Coding & Community Q&A (Feb 27)
Computer Vision
YOLO-World Live Coding & Community Q&A (Feb 27)
Roboflow Beginner 2y ago
YOLO-World: Real-Time, Zero-Shot Object Detection Explained
Computer Vision
YOLO-World: Real-Time, Zero-Shot Object Detection Explained
Roboflow Beginner 2y ago
Speed Estimation & Vehicle Tracking | Computer Vision | Open Source
Computer Vision
Speed Estimation & Vehicle Tracking | Computer Vision | Open Source
Roboflow Beginner 2y ago
AI Trends 2024: Computer Vision with Naila Murray - 665
Computer Vision ⚡ AI Lesson
AI Trends 2024: Computer Vision with Naila Murray - 665
The TWIML AI Podcast with Sam Charrington Beginner 2y ago
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
Computer Vision ⚡ AI Lesson
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
a16z Intermediate 2y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Introduction to Vertex AI Embeddings: Text and Multimodal
📚 External: Coursera ↗
Self-paced
Introduction to Vertex AI Embeddings: Text and Multimodal
Opens on Coursera ↗
Implement Hand Gesture Recognition with OpenCV
📚 External: Coursera ↗
Self-paced
Implement Hand Gesture Recognition with OpenCV
Opens on Coursera ↗
AutoML: Build ML Models without Code
📚 External: Coursera ↗
Self-paced
AutoML: Build ML Models without Code
Opens on Coursera ↗
Bases teóricas de la gestión de la salud y las lesiones
📚 External: Coursera ↗
Self-paced
Bases teóricas de la gestión de la salud y las lesiones
Opens on Coursera ↗
The Social Media Landscape
📚 External: Coursera ↗
Self-paced
The Social Media Landscape
Opens on Coursera ↗
Artificial Vision for Textile quality control
📚 External: Coursera ↗
Self-paced
Artificial Vision for Textile quality control
Opens on Coursera ↗