✕ Clear filters
1,132 lessons

👁️ Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

All ▶ YouTube 193,843📚 External: Coursera 17,642
TensorFlow: Advanced Techniques Specialization
Computer Vision ⚡ AI Lesson
TensorFlow: Advanced Techniques Specialization
DeepLearning.AI Advanced 3mo ago
Music AI Sandbox | AI x Creativity: Wyclef Jean
Computer Vision ⚡ AI Lesson
Music AI Sandbox | AI x Creativity: Wyclef Jean
Google DeepMind Beginner 3mo ago
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
Computer Vision ⚡ AI Lesson
PyTorch Day India 2026 Exploring Tile based Programming Abstractions for KLA’s Image Processing Work
PyTorch Intermediate 3mo ago
One Open AI Model Built My Website, Image & Video
Computer Vision
One Open AI Model Built My Website, Image & Video
Analytics Vidhya Beginner 3mo ago
Ultimate Data Science API Testing Tool
Computer Vision ⚡ AI Lesson
Ultimate Data Science API Testing Tool
Krish Naik Intermediate 3mo ago
Interactive Speaking Course for 120+ | Duolingo English Test
Computer Vision
Interactive Speaking Course for 120+ | Duolingo English Test
Teacher Luke - Duolingo English Test Intermediate 3mo ago
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
Computer Vision
Page Match lets you quickly sync your spot in a physical or ebook with an audiobook.
The Verge Beginner 4mo ago
AI Guidance for Physical Work
Computer Vision
AI Guidance for Physical Work
Y Combinator Advanced 4mo ago
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
Computer Vision ⚡ AI Lesson
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
Julia Turc Beginner 4mo ago
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Computer Vision ⚡ AI Lesson
Helping Sports Teams Improve Decision Making with AI: Interview with PlayVision's Marc Zoghby
Roboflow Beginner 4mo ago
The Hairy Ball Theorem
Computer Vision ⚡ AI Lesson
The Hairy Ball Theorem
3Blue1Brown Intermediate 4mo ago
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Computer Vision
RF-DETR Segmentation. Benchmarks, Inference, Training | Live Coding + Q&A (Jan 29th)
Roboflow Intermediate 4mo ago
Full Duolingo English Test with Answers: January 2026 Format
Computer Vision ⚡ AI Lesson
Full Duolingo English Test with Answers: January 2026 Format
Teacher Luke - Duolingo English Test Beginner 4mo ago
Unlock data from your files with Agentic Document Extraction
Computer Vision
Unlock data from your files with Agentic Document Extraction
DeepLearningAI Intermediate 4mo ago
New course! Document AI: From OCR to Agentic Doc Extraction
Computer Vision
New course! Document AI: From OCR to Agentic Doc Extraction
DeepLearningAI Intermediate 4mo ago
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks  Learning Canonical Embeddings for Human Heads
Computer Vision ⚡ AI Lesson
Artem Sevastopolsky and Dmitrii Pozdeev - DenseMarks Learning Canonical Embeddings for Human Heads
Cohere Beginner 4mo ago
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Computer Vision
On-Device AI Just Leveled Up: Liquid AI’s LFM-2.5 Explained
Analytics Vidhya Beginner 5mo ago
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Computer Vision
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Microsoft Research Advanced 5mo ago
I Became "Radicalized" About AI
Computer Vision ⚡ AI Lesson
I Became "Radicalized" About AI
Ken Jee Intermediate 5mo ago
Mistral OCR 3 Deep Dive: Document AI Done Right
Computer Vision
Mistral OCR 3 Deep Dive: Document AI Done Right
DataCreator AI Intermediate 5mo ago
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Computer Vision ⚡ AI Lesson
Anthony Fuller & Yousef Yassin - LookWhere? Efficient Visual Recognition by Learning Where to Look
Cohere Advanced 5mo ago
The Next Frontier of AI: Real-Time Multimodal Decision Making
Computer Vision
The Next Frontier of AI: Real-Time Multimodal Decision Making
The Information Intermediate 5mo ago
What does AI mean for education?
Computer Vision ⚡ AI Lesson
What does AI mean for education?
Anthropic Beginner 5mo ago
Are Humanoid Robots Actually Coming to Your Home? | Nikolaus, Rerun
Computer Vision
Are Humanoid Robots Actually Coming to Your Home? | Nikolaus, Rerun
Weights & Biases Intermediate 5mo ago
AI Paradox: Use Text for Logic, Avatars for Meaning
Computer Vision
AI Paradox: Use Text for Logic, Avatars for Meaning
Discover AI Intermediate 5mo ago
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
Computer Vision ⚡ AI Lesson
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
TWIML AI Podcast Beginner 6mo ago
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
Computer Vision ⚡ AI Lesson
PixelTable: Revolutionizing Multimodal AI Development Simplified #shorts #youtube
AI Anytime Intermediate 6mo ago
Grounding DINO: Open Vocabulary Object Detection on Videos
Computer Vision
Grounding DINO: Open Vocabulary Object Detection on Videos
PyImageSearch Intermediate 6mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Computer Vision ⚡ AI Lesson
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 6mo ago
Insane Results with YOLOv8 & YOLO11 — Detection, Segmentation, Pose & More!
Computer Vision
Insane Results with YOLOv8 & YOLO11 — Detection, Segmentation, Pose & More!
Muhammad Moin Intermediate 6mo ago
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Computer Vision
Gemini 3 Demo: Building a Music Rhythm Game with Computer Vision
Google for Developers Intermediate 6mo ago
Why are Transformers replacing CNNs?
Computer Vision
Why are Transformers replacing CNNs?
Julia Turc Beginner 6mo ago
Should AI be introduced to kids early?  #podcast #interview
Computer Vision ⚡ AI Lesson
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 6mo ago
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Computer Vision ⚡ AI Lesson
Stanford Robotics Seminar ENGR319 | Autumn 2025 | General Compliant Robot Interaction
Stanford Online Intermediate 6mo ago
AI Video Editing Hack
Computer Vision ⚡ AI Lesson
AI Video Editing Hack
Matt Wolfe Intermediate 6mo ago
Multimodal and Multi-model AI in Action
Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 6mo ago
InferenceJS: Real-time computer vision in your browser
Computer Vision
InferenceJS: Real-time computer vision in your browser
Chrome for Developers Intermediate 6mo ago
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Computer Vision ⚡ AI Lesson
YOLO26 Fine-Tuning | Detection and Instance Segmentation | Live Coding + Q&A (Jan 15th)
Roboflow Advanced 4mo ago
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Computer Vision
Google T5Gemma 2 Explained: The AI Built for Long Documents & Multimodal Reasoning
Analytics Vidhya Beginner 5mo ago
AI for Occupancy Analytics | Building a Smart Parking System
Computer Vision ⚡ AI Lesson
AI for Occupancy Analytics | Building a Smart Parking System
Roboflow Beginner 5mo ago
Roboflow Rapid Livestream | Use text prompts to train vision models
Computer Vision ⚡ AI Lesson
Roboflow Rapid Livestream | Use text prompts to train vision models
Roboflow Intermediate 5mo ago
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Computer Vision ⚡ AI Lesson
Basketball AI: Player Tracking, Team Detection, and Number Recognition with Python
Roboflow Advanced 6mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 6mo ago
I Took the Duolingo English Test and Here’s What Happened
Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 6mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Computer Vision ⚡ AI Lesson
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 6mo ago
SAM 3: The AI That Lets You “Segment Anything” — Images, Videos & Concepts
Computer Vision
SAM 3: The AI That Lets You “Segment Anything” — Images, Videos & Concepts
Analytics Vidhya Intermediate 6mo ago
Duolingo Test SPEAKING Practice! Interactive Speaking - 7 Questions & Answers
Computer Vision
Duolingo Test SPEAKING Practice! Interactive Speaking - 7 Questions & Answers
Teacher Luke - Duolingo English Test Intermediate 6mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Computer Vision ⚡ AI Lesson
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 6mo ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Supply Chain Sourcing
📚 External: Coursera ↗
Self-paced
Supply Chain Sourcing
Opens on Coursera ↗
Future of data and technology in football
📚 External: Coursera ↗
Self-paced
Future of data and technology in football
Opens on Coursera ↗
Marketing Management
📚 External: Coursera ↗
Self-paced
Marketing Management
Opens on Coursera ↗
Behavioral Marketing
📚 External: Coursera ↗
Self-paced
Behavioral Marketing
Opens on Coursera ↗
Computer Vision: Face Recognition Quick Starter in Python
📚 External: Coursera ↗
Self-paced
Computer Vision: Face Recognition Quick Starter in Python
Opens on Coursera ↗
Preparing Multimodal Data: Vision, Audio, and NLP Pipelines
📚 External: Coursera ↗
Self-paced
Preparing Multimodal Data: Vision, Audio, and NLP Pipelines
Opens on Coursera ↗