✕ Clear filters
1,132 lessons

👁️ Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

All ▶ YouTube 194,119📚 External: Coursera 17,665
Next AI Project is Image Classification in Python🔍🤖
Computer Vision ⚡ AI Lesson
Next AI Project is Image Classification in Python🔍🤖
Tech With Tim Intermediate 1y ago
YOLOv2 (YOLO9000) and YOLOv3 Explained
Computer Vision ⚡ AI Lesson
YOLOv2 (YOLO9000) and YOLOv3 Explained
ExplainingAI Advanced 1y ago
Best of 2024 in Vision [LS Live @ NeurIPS]
Computer Vision ⚡ AI Lesson
Best of 2024 in Vision [LS Live @ NeurIPS]
Latent Space Intermediate 1y ago
How to Do Email Segmentation the Right Way
0:47
Computer Vision ⚡ AI Lesson
How to Do Email Segmentation the Right Way
Spark Bridge Digital | Email Marketing Agency Intermediate 1y ago
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
Computer Vision
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
OpenAI Intermediate 1y ago
New Video AI by META & Stanford Univ: APOLLO (7B)
Computer Vision ⚡ AI Lesson
New Video AI by META & Stanford Univ: APOLLO (7B)
Discover AI Advanced 1y ago
Ethan Norville EXPOSES Coronation Project Secrets
Computer Vision
Ethan Norville EXPOSES Coronation Project Secrets
Professor Charley T Intermediate 1y ago
Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate
Computer Vision ⚡ AI Lesson
Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate
Latent Space Beginner 1y ago
Florence-2: Create and Deploy a Custom Vision Language Model
Computer Vision
Florence-2: Create and Deploy a Custom Vision Language Model
Roboflow Intermediate 1y ago
Use Semantic Search to Create Computer Vision Datasets
Computer Vision
Use Semantic Search to Create Computer Vision Datasets
Roboflow Beginner 1y ago
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Computer Vision
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Cohere Advanced 1y ago
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Computer Vision
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Cohere Advanced 1y ago
MediaPipe Web: Bringing cross-platform AI tech to the browser
Computer Vision ⚡ AI Lesson
MediaPipe Web: Bringing cross-platform AI tech to the browser
Chrome for Developers Intermediate 1y ago
Multimodal Embeddings: Introduction & Use Cases (with Python)
Computer Vision
Multimodal Embeddings: Introduction & Use Cases (with Python)
Shaw Talebi Beginner 1y ago
Demo Lecture-Image Processing-Computer Vision With Generative AI Bootcamp With Doubts Solving
Computer Vision
Demo Lecture-Image Processing-Computer Vision With Generative AI Bootcamp With Doubts Solving
Krish Naik Beginner 1y ago
Insights from a Kaggle Grandmaster: Multimodal Models, Agents, Document AI & more
Computer Vision
Insights from a Kaggle Grandmaster: Multimodal Models, Agents, Document AI & more
Analytics Vidhya Beginner 1y ago
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
Computer Vision
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
Discover AI Advanced 1y ago
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
Computer Vision ⚡ AI Lesson
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
AI Engineer Intermediate 1y ago
Transformers.js: State-of-the-art Machine Learning for the web
Computer Vision ⚡ AI Lesson
Transformers.js: State-of-the-art Machine Learning for the web
Chrome for Developers Intermediate 1y ago
NLP Engineer & Computer Vision Engineer #codebasics #nlp #computervision #datajob #shorts
Computer Vision ⚡ AI Lesson
NLP Engineer & Computer Vision Engineer #codebasics #nlp #computervision #datajob #shorts
codebasics Beginner 1y ago
Stanford Seminar - Open-world Segmentation and Tracking in 3D
Computer Vision
Stanford Seminar - Open-world Segmentation and Tracking in 3D
Stanford Online Intermediate 1y ago
Revolutionizing sign language with AI
Computer Vision ⚡ AI Lesson
Revolutionizing sign language with AI
TensorFlow Official Beginner 1y ago
Neuralift AI builds trust using W&B Weave
Computer Vision
Neuralift AI builds trust using W&B Weave
Weights & Biases Beginner 1y ago
The Next Decade in AI and Computer Vision
Computer Vision ⚡ AI Lesson
The Next Decade in AI and Computer Vision
a16z Intermediate 1y ago
Single Shot Multibox Detector | SSD Object Detection Explained and Implemented
Computer Vision
Single Shot Multibox Detector | SSD Object Detection Explained and Implemented
ExplainingAI Beginner 1y ago
Data As a Corporate Asset—the GenAI-era Take (Part 2)
Computer Vision ⚡ AI Lesson
Data As a Corporate Asset—the GenAI-era Take (Part 2)
Microsoft Developer Beginner 1y ago
Computer Vision Explained in 30s
Computer Vision
Computer Vision Explained in 30s
365 Data Science Beginner 1y ago
Multimodal RAG YT Video
Computer Vision
Multimodal RAG YT Video
Srikantan Sankaran Intermediate 1y ago
New Way Now: Plenitude streamlines customer onboarding and fraud prevention with Google Cloud AI
Computer Vision
New Way Now: Plenitude streamlines customer onboarding and fraud prevention with Google Cloud AI
Google Cloud Beginner 1y ago
Testing CA’s Computer Vision Robot Arm @LEGO @raspberrypi @Core-Electronics
Computer Vision
Testing CA’s Computer Vision Robot Arm @LEGO @raspberrypi @Core-Electronics
Creator Academy Australia Intermediate 1y ago
Blobs to Clips: Efficient End-to-End Video Data Loading - Andrew Ho & Ahmad Sharif, Meta
Computer Vision ⚡ AI Lesson
Blobs to Clips: Efficient End-to-End Video Data Loading - Andrew Ho & Ahmad Sharif, Meta
PyTorch Beginner 1y ago
Llama 3.2: Best Multimodal Model Yet? (Vision Test)
Computer Vision ⚡ AI Lesson
Llama 3.2: Best Multimodal Model Yet? (Vision Test)
Mervin Praison Beginner 1y ago
CS50x 2025 - Lecture 4 - Memory
Computer Vision
CS50x 2025 - Lecture 4 - Memory
CS50 Beginner 1y ago
Data As a Corporate Asset—the GenAI-era Take (Part 1)
Computer Vision
Data As a Corporate Asset—the GenAI-era Take (Part 1)
Microsoft Developer Beginner 1y ago
Free Live 3 Days Computer Vision and Object Detection Workshop
Computer Vision
Free Live 3 Days Computer Vision and Object Detection Workshop
Krish Naik Beginner 1y ago
Using PyTorch for Monocular Depth Estimation Webinar
Computer Vision ⚡ AI Lesson
Using PyTorch for Monocular Depth Estimation Webinar
PyTorch Beginner 1y ago
SAM-2.1: How to Fine-Tune for Image Segmentation
Computer Vision
SAM-2.1: How to Fine-Tune for Image Segmentation
Roboflow Beginner 1y ago
How to Build a Smart Parking System - License Plate Detection & OCR
Computer Vision ⚡ AI Lesson
How to Build a Smart Parking System - License Plate Detection & OCR
Roboflow Beginner 1y ago
Web AI Summit 2024: State of client side machine learning
Computer Vision ⚡ AI Lesson
Web AI Summit 2024: State of client side machine learning
Chrome for Developers Beginner 1y ago
Gwanghyun (Bradley) Kim - BeyondScene: Higher-Resolution Human-Scene Generation
Computer Vision
Gwanghyun (Bradley) Kim - BeyondScene: Higher-Resolution Human-Scene Generation
Cohere Advanced 1y ago
[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu
Computer Vision ⚡ AI Lesson
[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu
Latent Space Beginner 1y ago
YOLOv11: How to Train for Object Detection on a Custom Dataset | Step-by-step guide
Computer Vision
YOLOv11: How to Train for Object Detection on a Custom Dataset | Step-by-step guide
Roboflow Beginner 1y ago
YOLO11: Performance Benchmark and Real World Use Cases
Computer Vision
YOLO11: Performance Benchmark and Real World Use Cases
Roboflow Intermediate 1y ago
Video Analytics with AI | Live Coding & Q&A (Oct 9th)
Computer Vision
Video Analytics with AI | Live Coding & Q&A (Oct 9th)
Roboflow Intermediate 1y ago
How to use OCR | Get Started with Optical Character Recognition
Computer Vision
How to use OCR | Get Started with Optical Character Recognition
Roboflow Beginner 1y ago
GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)
Computer Vision
GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)
Roboflow Intermediate 1y ago
YOLO11: How to Train for Object Detection | Live Coding & Q&A (Sep 30)
Computer Vision
YOLO11: How to Train for Object Detection | Live Coding & Q&A (Sep 30)
Roboflow Intermediate 1y ago
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Computer Vision
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Roboflow Intermediate 1y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Market Research Case Study: Apply & Analyze
📚 External: Coursera ↗
Self-paced
Market Research Case Study: Apply & Analyze
Opens on Coursera ↗
Implement Real-Time Face Detection with OpenCV & Python
📚 External: Coursera ↗
Self-paced
Implement Real-Time Face Detection with OpenCV & Python
Opens on Coursera ↗
Form Parsing Using Document AI
📚 External: Coursera ↗
Self-paced
Form Parsing Using Document AI
Opens on Coursera ↗
AI and Disaster Management
📚 External: Coursera ↗
Self-paced
AI and Disaster Management
Opens on Coursera ↗
Business Economics and Game Theory for Decision Making
📚 External: Coursera ↗
Self-paced
Business Economics and Game Theory for Decision Making
Opens on Coursera ↗
Humanidades digitales
📚 External: Coursera ↗
Self-paced
Humanidades digitales
Opens on Coursera ↗