Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026
Computer Vision
Why Legacy SIEM Models Are Struggling | Ali Ghodsi at RSAC 2026
Databricks Advanced 1mo ago
Yasser Benigmin - Domain Adaptation in the Era of Foundation Models
Computer Vision
Yasser Benigmin - Domain Adaptation in the Era of Foundation Models
Cohere Advanced 1mo ago
How Audi Uses AI to Transform Automotive Manufacturing at Scale | Amazon Web Services
Computer Vision ⚡ AI Lesson
How Audi Uses AI to Transform Automotive Manufacturing at Scale | Amazon Web Services
Amazon Web Services Advanced 2mo ago
TensorFlow: Advanced Techniques Specialization
Computer Vision ⚡ AI Lesson
TensorFlow: Advanced Techniques Specialization
DeepLearning.AI Advanced 2mo ago
AI Guidance for Physical Work
Computer Vision
AI Guidance for Physical Work
Y Combinator Advanced 3mo ago
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Computer Vision ⚡ AI Lesson
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Roboflow Advanced 3mo ago
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Computer Vision
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Microsoft Research Advanced 4mo ago
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Computer Vision ⚡ AI Lesson
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Cohere Advanced 4mo ago
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Computer Vision ⚡ AI Lesson
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Roboflow Advanced 5mo ago
Genomcore impulsa la investigación biomédica con AWS e IA | Amazon Web Services
Computer Vision
Genomcore impulsa la investigación biomédica con AWS e IA | Amazon Web Services
Amazon Web Services Advanced 6mo ago
Qwen3-Omni: The First Open All-in-One AI?
Computer Vision
Qwen3-Omni: The First Open All-in-One AI?
What's AI by Louis-François Bouchard Advanced 7mo ago
Distilling Transformers and Diffusion Models for Robust Edge Use Cases [Fatih Porikli] - 738
Computer Vision ⚡ AI Lesson
Distilling Transformers and Diffusion Models for Robust Edge Use Cases [Fatih Porikli] - 738
The TWIML AI Podcast with Sam Charrington Advanced 10mo ago
VGG From Scratch – Deep Learning Theory & PyTorch Implementation (Full Course)
Computer Vision ⚡ AI Lesson
VGG From Scratch – Deep Learning Theory & PyTorch Implementation (Full Course)
freeCodeCamp.org Advanced 10mo ago
Transforming Guest Experiences: GoTo Foods’ Data Journey with Amperity & Databricks
Computer Vision ⚡ AI Lesson
Transforming Guest Experiences: GoTo Foods’ Data Journey with Amperity & Databricks
Databricks Advanced 10mo ago
Train YOLO on Custom Dataset | Object Detection Step-by-Step Tutorial
Computer Vision
Train YOLO on Custom Dataset | Object Detection Step-by-Step Tutorial
Samin Learns AI Advanced 10mo ago
FastVLM brings advanced computer vision to your phone...
Computer Vision ⚡ AI Lesson
FastVLM brings advanced computer vision to your phone...
NeuralNine Advanced 11mo ago
Find out how Nevada DETR achieved 4x faster approvals with Vertex AI
Computer Vision
Find out how Nevada DETR achieved 4x faster approvals with Vertex AI
Google Cloud Advanced 1y ago
PaliGemma – Making Gemma 2 see by adding a vision encoder
Computer Vision
PaliGemma – Making Gemma 2 see by adding a vision encoder
Google for Developers Advanced 1y ago
George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO
Computer Vision
George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO
george hotz archive Advanced 1y ago
Microsoft’s Phi-4 SLM: Open-Source AI for Text, Vision & Audio!
Computer Vision
Microsoft’s Phi-4 SLM: Open-Source AI for Text, Vision & Audio!
Analytics Vidhya Advanced 1y ago
Deepseek is back with VISION
Computer Vision
Deepseek is back with VISION
1littlecoder Advanced 1y ago
Using Vertex AI for healthcare
Computer Vision
Using Vertex AI for healthcare
Google Cloud Tech Advanced 1y ago
Enhance Generative AI Model Accuracy Through High-Quality Multimodal Data Processing
Computer Vision
Enhance Generative AI Model Accuracy Through High-Quality Multimodal Data Processing
NVIDIA Developer Advanced 1y ago
YOLOv2 (YOLO9000) and YOLOv3 Explained
Computer Vision ⚡ AI Lesson
YOLOv2 (YOLO9000) and YOLOv3 Explained
ExplainingAI Advanced 1y ago
New Video AI by META & Stanford Univ: APOLLO (7B)
Computer Vision ⚡ AI Lesson
New Video AI by META & Stanford Univ: APOLLO (7B)
Discover AI Advanced 1y ago
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
Computer Vision
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
Discover AI Advanced 1y ago
open-animal-tracks
Computer Vision ⚡ AI Lesson
open-animal-tracks
Data Skeptic Advanced 1y ago
Bird Distribution Modeling with Satbird
Computer Vision ⚡ AI Lesson
Bird Distribution Modeling with Satbird
Data Skeptic Advanced 1y ago
Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google
Computer Vision
Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google
Talks at Google Advanced 1y ago
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Computer Vision
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Microsoft Research Advanced 1y ago
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Computer Vision
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Latent Space Advanced 1y ago
JETSON AI LAB | Research Group Meeting (8/6/2024)
Computer Vision
JETSON AI LAB | Research Group Meeting (8/6/2024)
NVIDIA Developer Advanced 1y ago
Audience Segmentation Tips: 3 Ways to Segment Your Email List
3:24
Computer Vision ⚡ AI Lesson
Audience Segmentation Tips: 3 Ways to Segment Your Email List
Klaviyo Advanced 1y ago
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
Computer Vision ⚡ AI Lesson
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
The TWIML AI Podcast with Sam Charrington Advanced 1y ago
New Microsoft Vision Model has AMAZING TRICKS!!!
Computer Vision ⚡ AI Lesson
New Microsoft Vision Model has AMAZING TRICKS!!!
1littlecoder Advanced 1y ago
Robotics AI for Industrial Applications
Computer Vision
Robotics AI for Industrial Applications
Weights & Biases Advanced 1y ago
Ashmal Vayani - Seeing the World as It Speaks  Multilingual, Culturally Aware Multimodal AI
Computer Vision ⚡ AI Lesson
Ashmal Vayani - Seeing the World as It Speaks Multilingual, Culturally Aware Multimodal AI
Cohere Advanced 6mo ago
Shashanka Venkataramana and Valentinos Pariza - Franca  Nested Matryoshka Clustering for Scalable Vi
Computer Vision ⚡ AI Lesson
Shashanka Venkataramana and Valentinos Pariza - Franca Nested Matryoshka Clustering for Scalable Vi
Cohere Advanced 7mo ago
David Fan & Peter Tong  - Scaling Language Free Visual Representation Learning
Computer Vision ⚡ AI Lesson
David Fan & Peter Tong - Scaling Language Free Visual Representation Learning
Cohere Advanced 9mo ago
RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)
Computer Vision
RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)
Roboflow Advanced 1y ago
YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)
Computer Vision
YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)
Roboflow Advanced 1y ago
Aya Vision - The Challenges & Breakthroughs
Computer Vision ⚡ AI Lesson
Aya Vision - The Challenges & Breakthroughs
Cohere Advanced 1y ago
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Computer Vision
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Cohere Advanced 1y ago
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Computer Vision
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Cohere Advanced 1y ago
Gwanghyun (Bradley) Kim - BeyondScene: Higher-Resolution Human-Scene Generation
Computer Vision
Gwanghyun (Bradley) Kim - BeyondScene: Higher-Resolution Human-Scene Generation
Cohere Advanced 1y ago
Football AI | Community Q&A (Aug 29)
Computer Vision ⚡ AI Lesson
Football AI | Community Q&A (Aug 29)
Roboflow Advanced 1y ago
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Computer Vision
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Roboflow Advanced 1y ago
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Computer Vision ⚡ AI Lesson
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Microsoft Research Advanced 1y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Enhance Images: Quality Fixes Fast
📚 Coursera Course ↗
Self-paced
Enhance Images: Quality Fixes Fast
Opens on Coursera ↗
Deep Learning Applications for Computer Vision
📚 Coursera Course ↗
Self-paced
Deep Learning Applications for Computer Vision
Opens on Coursera ↗
Vision Models: Train and Evaluate
📚 Coursera Course ↗
Self-paced
Vision Models: Train and Evaluate
Opens on Coursera ↗
Prompt Engineering for Vision Models
📚 Coursera Course ↗
Self-paced
Prompt Engineering for Vision Models
Opens on Coursera ↗
Behavioral Marketing
📚 Coursera Course ↗
Self-paced
Behavioral Marketing
Opens on Coursera ↗
Start Remote Sensing
📚 Coursera Course ↗
Self-paced
Start Remote Sensing
Opens on Coursera ↗