Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
The Longevity Expert: Is There A Link Between Milk & Cancer? + Ozempic Can Really Mess You Up!
Computer Vision
The Longevity Expert: Is There A Link Between Milk & Cancer? + Ozempic Can Really Mess You Up!
The Diary Of A CEO Beginner 2y ago
The lies that sell fast fashion
Computer Vision ⚡ AI Lesson
The lies that sell fast fashion
Vox Beginner 2y ago
Dwell Time Analysis with Computer Vision | Real-Time Stream Processing
Computer Vision
Dwell Time Analysis with Computer Vision | Real-Time Stream Processing
Roboflow Beginner 2y ago
Stanford Seminar - Silicon Valley & The U.S. Government: Vannevar Lab's Brett Granberg
Computer Vision
Stanford Seminar - Silicon Valley & The U.S. Government: Vannevar Lab's Brett Granberg
Stanford Online Intermediate 2y ago
Real-Time Car Speed Tracking & Object Classification Revealed
Computer Vision
Real-Time Car Speed Tracking & Object Classification Revealed
Mervin Praison Beginner 2y ago
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
Computer Vision ⚡ AI Lesson
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
a16z Intermediate 2y ago
YOLOv9 Live Coding & Community Q&A (March 14)
Computer Vision ⚡ AI Lesson
YOLOv9 Live Coding & Community Q&A (March 14)
Roboflow Beginner 2y ago
How to perform object detection with KerasCV
Computer Vision ⚡ AI Lesson
How to perform object detection with KerasCV
TensorFlow Official Beginner 2y ago
Build an AI/ML Tennis Analysis system with YOLO, PyTorch, and Key Point Extraction
Computer Vision ⚡ AI Lesson
Build an AI/ML Tennis Analysis system with YOLO, PyTorch, and Key Point Extraction
Code In a Jiffy Beginner 2y ago
Multi-Modal NSFW Detection with AI
Computer Vision
Multi-Modal NSFW Detection with AI
James Briggs Intermediate 2y ago
This VLM can be your MultiModal AI with less than 6GB Memory!!!
Computer Vision
This VLM can be your MultiModal AI with less than 6GB Memory!!!
1littlecoder Intermediate 2y ago
New course with Hugging Face: Open Source Models with Hugging Face
Computer Vision ⚡ AI Lesson
New course with Hugging Face: Open Source Models with Hugging Face
DeepLearningAI Intermediate 2y ago
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Computer Vision
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Dwarkesh Patel Intermediate 2y ago
Pick What You FEED Into Your MIND! | Evan Carmichael
Computer Vision
Pick What You FEED Into Your MIND! | Evan Carmichael
Evan Carmichael Beginner 2y ago
Vision Transformer (ViT)
Computer Vision
Vision Transformer (ViT)
Machine Learning Studio Intermediate 2y ago
From the Macintosh to the Vision Pro — and beyond | The Vergecast
Computer Vision
From the Macintosh to the Vision Pro — and beyond | The Vergecast
The Verge Beginner 2y ago
Ben Shapiro vs Destiny Debate: Politics, Jan 6, Israel, Ukraine & Wokeism | Lex Fridman Podcast #410
Computer Vision ⚡ AI Lesson
Ben Shapiro vs Destiny Debate: Politics, Jan 6, Israel, Ukraine & Wokeism | Lex Fridman Podcast #410
Lex Fridman Beginner 2y ago
The Future Of Computer Vision
Computer Vision ⚡ AI Lesson
The Future Of Computer Vision
a16z Intermediate 2y ago
Matthew Cox: FBI Most Wanted Con Man - $55 Million in Bank Fraud | Lex Fridman Podcast #409
Computer Vision ⚡ AI Lesson
Matthew Cox: FBI Most Wanted Con Man - $55 Million in Bank Fraud | Lex Fridman Podcast #409
Lex Fridman Beginner 2y ago
Controlnet Open Pose Stable Diffusion Tutorial In 7 Minutes (Automatic1111)
Computer Vision
Controlnet Open Pose Stable Diffusion Tutorial In 7 Minutes (Automatic1111)
Bitesized Genius Beginner 2y ago
Image Classification with Hugging Face
Computer Vision ⚡ AI Lesson
Image Classification with Hugging Face
DataCamp Beginner 2y ago
Create a Custom Document Extractor with Document AI
Computer Vision ⚡ AI Lesson
Create a Custom Document Extractor with Document AI
Google Cloud Tech Intermediate 2y ago
Tune in to know what are the most exciting opportunities to look out for in computer vision!
Computer Vision ⚡ AI Lesson
Tune in to know what are the most exciting opportunities to look out for in computer vision!
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
Computer Vision ⚡ AI Lesson
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
CS50x 2024 - Lecture 6 - Python
Computer Vision
CS50x 2024 - Lecture 6 - Python
CS50 Beginner 2y ago
Email Segmentation Full Guide (2026)
33:12
Computer Vision ⚡ AI Lesson
Email Segmentation Full Guide (2026)
Derek Stroh Beginner 2y ago
Instance Segmentation in Masked RCNN Explained #deeplearning #machinelearning
Computer Vision ⚡ AI Lesson
Instance Segmentation in Masked RCNN Explained #deeplearning #machinelearning
CodeEmporium Beginner 2y ago
Interactive segmentation - ML on Web with MediaPipe: Episode 6
Computer Vision
Interactive segmentation - ML on Web with MediaPipe: Episode 6
Google for Developers Beginner 2y ago
Image segmentation - ML on Android with MediaPipe Series
Computer Vision
Image segmentation - ML on Android with MediaPipe Series
Google for Developers Intermediate 2y ago
Semantic Segmentation explained #deeplearning #machinelearning
Computer Vision ⚡ AI Lesson
Semantic Segmentation explained #deeplearning #machinelearning
CodeEmporium Beginner 2y ago
Stanford Seminar - Foundations of Spatial Perception for Robotics
Computer Vision ⚡ AI Lesson
Stanford Seminar - Foundations of Spatial Perception for Robotics
Stanford Online Beginner 2y ago
Simple OCR in Python with easyocr
Computer Vision ⚡ AI Lesson
Simple OCR in Python with easyocr
NeuralNine Beginner 2y ago
¿La verdadera razón detrás de la transformación digital?
Computer Vision
¿La verdadera razón detrás de la transformación digital?
Google Cloud Intermediate 2y ago
Want to find the BEST segmentation for your business?
Computer Vision
Want to find the BEST segmentation for your business?
Adam Erhart Intermediate 2y ago
Face Detection using Python and OpenCV with webcam | Python Projects | GeeksforGeeks
Computer Vision
Face Detection using Python and OpenCV with webcam | Python Projects | GeeksforGeeks
GeeksforGeeks Beginner 2y ago
120k players in a week: Lessons from the first viral CLIP app: Joseph Nelson
Computer Vision
120k players in a week: Lessons from the first viral CLIP app: Joseph Nelson
AI Engineer Beginner 2y ago
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Computer Vision ⚡ AI Lesson
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Roboflow Intermediate 2y ago
YOLO-World Live Coding & Community Q&A (Feb 27)
Computer Vision
YOLO-World Live Coding & Community Q&A (Feb 27)
Roboflow Beginner 2y ago
YOLO-World: Real-Time, Zero-Shot Object Detection Explained
Computer Vision
YOLO-World: Real-Time, Zero-Shot Object Detection Explained
Roboflow Beginner 2y ago
Speed Estimation & Vehicle Tracking | Computer Vision | Open Source
Computer Vision
Speed Estimation & Vehicle Tracking | Computer Vision | Open Source
Roboflow Beginner 2y ago
AI Trends 2024: Computer Vision with Naila Murray - 665
Computer Vision ⚡ AI Lesson
AI Trends 2024: Computer Vision with Naila Murray - 665
The TWIML AI Podcast with Sam Charrington Beginner 2y ago
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
Computer Vision ⚡ AI Lesson
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
a16z Intermediate 2y ago
Teddy Atlas: Mike Tyson, Cus D'Amato, Boxing, Loyalty, Fear & Greatness | Lex Fridman Podcast #406
Computer Vision
Teddy Atlas: Mike Tyson, Cus D'Amato, Boxing, Loyalty, Fear & Greatness | Lex Fridman Podcast #406
Lex Fridman Beginner 2y ago
GPT-4V Alternative (Self-Hosted): Deploy CogVLM on AWS
Computer Vision
GPT-4V Alternative (Self-Hosted): Deploy CogVLM on AWS
Roboflow Beginner 2y ago
Image classification - ML on Raspberry Pi with MediaPipe Series
Computer Vision
Image classification - ML on Raspberry Pi with MediaPipe Series
Google for Developers Beginner 2y ago
COCO Dataset and its use in Computer Vision #machinelearning #deeplearning
Computer Vision ⚡ AI Lesson
COCO Dataset and its use in Computer Vision #machinelearning #deeplearning
CodeEmporium Beginner 2y ago
Data, Systems and ML for Visual Understanding with Cody Coleman - 660
Computer Vision ⚡ AI Lesson
Data, Systems and ML for Visual Understanding with Cody Coleman - 660
The TWIML AI Podcast with Sam Charrington Beginner 2y ago
Object detection - ML on Raspberry Pi with MediaPipe Series
Computer Vision
Object detection - ML on Raspberry Pi with MediaPipe Series
Google for Developers Beginner 2y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Modern AI Models for Vision and Multimodal Understanding
📚 Coursera Course ↗
Self-paced
Modern AI Models for Vision and Multimodal Understanding
Opens on Coursera ↗
Videojuegos: ¿de qué hablamos?
📚 Coursera Course ↗
Self-paced
Videojuegos: ¿de qué hablamos?
Opens on Coursera ↗
Preparing Multimodal Data: Vision, Audio, and NLP Pipelines
📚 Coursera Course ↗
Self-paced
Preparing Multimodal Data: Vision, Audio, and NLP Pipelines
Opens on Coursera ↗
Create and Test a Document AI Processor
📚 Coursera Course ↗
Self-paced
Create and Test a Document AI Processor
Opens on Coursera ↗
Landing.AI for Beginners: Build Data Visualization AI Models
📚 Coursera Course ↗
Self-paced
Landing.AI for Beginners: Build Data Visualization AI Models
Opens on Coursera ↗
Machine Learning in Python: Analyze & Apply
📚 Coursera Course ↗
Self-paced
Machine Learning in Python: Analyze & Apply
Opens on Coursera ↗