Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Meta Engineer on Industrial Computer Vision systems
Computer Vision
Meta Engineer on Industrial Computer Vision systems
MLOps.community Intermediate 6mo ago
Duolingo English Test - NEW Complete Practice Test with Answers
Computer Vision
Duolingo English Test - NEW Complete Practice Test with Answers
Teacher Luke - Duolingo English Test Intermediate 6mo ago
The SECRET to Hyper Segmentation (and Sales)
0:35
Computer Vision ⚡ AI Lesson
The SECRET to Hyper Segmentation (and Sales)
Optimum7 Intermediate 7mo ago
Industrial AI Machine Vision in Action with Databricks & Crosser
Computer Vision ⚡ AI Lesson
Industrial AI Machine Vision in Action with Databricks & Crosser
Databricks Intermediate 7mo ago
"Smartest" VISION AI in Cars Do Reasoning?
Computer Vision
"Smartest" VISION AI in Cars Do Reasoning?
Discover AI Intermediate 7mo ago
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Computer Vision
RF-DETR: How to Train SOTA for Object Detection on a Custom Dataset | Step-by-step guide
Roboflow Intermediate 8mo ago
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Computer Vision
New Way Now: Simbe's AI robotic vision tech improves retail sales and margin with Google Cloud
Google Cloud Intermediate 8mo ago
EV Pickups Are a Bust for US Carmakers
Computer Vision
EV Pickups Are a Bust for US Carmakers
Bloomberg Technology Intermediate 8mo ago
Vision AI in 2025 — Peter Robicheaux, Roboflow
Computer Vision
Vision AI in 2025 — Peter Robicheaux, Roboflow
AI Engineer Intermediate 9mo ago
The Segmentation Tweak That Quietly BOOSTS Klaviyo Revenue #shorts #emailmarketing
1:26
Computer Vision ⚡ AI Lesson
The Segmentation Tweak That Quietly BOOSTS Klaviyo Revenue #shorts #emailmarketing
Emissary 2.0 Intermediate 9mo ago
I trained an AI Model to Detect Trading Candlesticks (from scratch using ViTs)
Computer Vision ⚡ AI Lesson
I trained an AI Model to Detect Trading Candlesticks (from scratch using ViTs)
Nicholas Renotte Intermediate 9mo ago
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Computer Vision ⚡ AI Lesson
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
Microsoft Research Intermediate 9mo ago
Is Your Business Running on Empty? 🤖
Computer Vision
Is Your Business Running on Empty? 🤖
imFORZA Intermediate 10mo ago
How to Fine-Tune SmolVLM2 | Convert Documents into JSON
Computer Vision
How to Fine-Tune SmolVLM2 | Convert Documents into JSON
Roboflow Intermediate 10mo ago
Transforming Data Governance for Multimodal Data at Amgen With Databricks
Computer Vision ⚡ AI Lesson
Transforming Data Governance for Multimodal Data at Amgen With Databricks
Databricks Intermediate 10mo ago
Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez
Computer Vision
Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez
PyTorch Intermediate 11mo ago
MedGemma LLM: Doctors, Meet Your AI Assistant 🧠
Computer Vision ⚡ AI Lesson
MedGemma LLM: Doctors, Meet Your AI Assistant 🧠
AI Anytime Intermediate 11mo ago
China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!
Computer Vision
China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!
Analytics Vidhya Intermediate 11mo ago
Uber CEO Dara Khosrowshahi on the company's new Route Share feature. Presented by @AdobeExpress
Computer Vision
Uber CEO Dara Khosrowshahi on the company's new Route Share feature. Presented by @AdobeExpress
The Verge Intermediate 11mo ago
The Shape of Intelligence
Computer Vision ⚡ AI Lesson
The Shape of Intelligence
Latent Space Intermediate 11mo ago
How to Segment Your Audience in Mailchimp
9:16
Computer Vision ⚡ AI Lesson
How to Segment Your Audience in Mailchimp
Intuit Mailchimp Intermediate 1y ago
Intuit uses Google Cloud Document AI to further simplify tax prep for millions
Computer Vision
Intuit uses Google Cloud Document AI to further simplify tax prep for millions
Google Cloud Intermediate 1y ago
Expedition Aya Kick Off Event
Computer Vision
Expedition Aya Kick Off Event
Cohere Intermediate 1y ago
Building a travel buddy with Gemma
Computer Vision
Building a travel buddy with Gemma
Google for Developers Intermediate 1y ago
Peter Tong - MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Computer Vision
Peter Tong - MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
Cohere Intermediate 1y ago
How to Quickly Leverage Computer Vision in Python
Computer Vision ⚡ AI Lesson
How to Quickly Leverage Computer Vision in Python
Data Professor Intermediate 1y ago
Next Multi trillion dollar industry?
Computer Vision
Next Multi trillion dollar industry?
Full Disclosure Intermediate 1y ago
DeepSeek’s Janus-Pro-7B Crushes DALL·E 3!  #deepseek #openai
Computer Vision
DeepSeek’s Janus-Pro-7B Crushes DALL·E 3! #deepseek #openai
Analytics Vidhya Intermediate 1y ago
This Python module is your go-to for speech and image recognition!
Computer Vision ⚡ AI Lesson
This Python module is your go-to for speech and image recognition!
Tech With Tim Intermediate 1y ago
Not ElevenLabs, This new #1 Text to Speech AI is FREE!!!!
Computer Vision
Not ElevenLabs, This new #1 Text to Speech AI is FREE!!!!
1littlecoder Intermediate 1y ago
Next AI Project is Image Classification in Python🔍🤖
Computer Vision ⚡ AI Lesson
Next AI Project is Image Classification in Python🔍🤖
Tech With Tim Intermediate 1y ago
Best of 2024 in Vision [LS Live @ NeurIPS]
Computer Vision ⚡ AI Lesson
Best of 2024 in Vision [LS Live @ NeurIPS]
Latent Space Intermediate 1y ago
How to Do Email Segmentation the Right Way
0:47
Computer Vision ⚡ AI Lesson
How to Do Email Segmentation the Right Way
Spark Bridge Digital | Email Marketing Agency Intermediate 1y ago
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
Computer Vision
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
OpenAI Intermediate 1y ago
Ethan Norville EXPOSES Coronation Project Secrets
Computer Vision
Ethan Norville EXPOSES Coronation Project Secrets
Professor Charley T Intermediate 1y ago
MediaPipe Web: Bringing cross-platform AI tech to the browser
Computer Vision ⚡ AI Lesson
MediaPipe Web: Bringing cross-platform AI tech to the browser
Chrome for Developers Intermediate 1y ago
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
Computer Vision ⚡ AI Lesson
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
AI Engineer Intermediate 1y ago
Transformers.js: State-of-the-art Machine Learning for the web
Computer Vision ⚡ AI Lesson
Transformers.js: State-of-the-art Machine Learning for the web
Chrome for Developers Intermediate 1y ago
Stanford Seminar - Open-world Segmentation and Tracking in 3D
Computer Vision
Stanford Seminar - Open-world Segmentation and Tracking in 3D
Stanford Online Intermediate 1y ago
The Next Decade in AI and Computer Vision
Computer Vision ⚡ AI Lesson
The Next Decade in AI and Computer Vision
a16z Intermediate 1y ago
Multimodal RAG YT Video
Computer Vision
Multimodal RAG YT Video
Srikantan Sankaran Intermediate 1y ago
Drowsiness Detection with Vision AI | Improve Safety with AI
Computer Vision
Drowsiness Detection with Vision AI | Improve Safety with AI
Roboflow Intermediate 11mo ago
Multimodal AI & Next Gen Databases | Data Brew | Episode 42
Computer Vision ⚡ AI Lesson
Multimodal AI & Next Gen Databases | Data Brew | Episode 42
Databricks Intermediate 1y ago
RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow
Computer Vision
RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow
Roboflow Intermediate 1y ago
Build an AI-Powered Self-Serve Checkout & Cost Calculator in 10 Minutes (Almost)
Computer Vision
Build an AI-Powered Self-Serve Checkout & Cost Calculator in 10 Minutes (Almost)
Roboflow Intermediate 1y ago
Measure Liquid Levels with AI | Build a Web App Powered by Computer Vision
Computer Vision
Measure Liquid Levels with AI | Build a Web App Powered by Computer Vision
Roboflow Intermediate 1y ago
Florence-2: Create and Deploy a Custom Vision Language Model
Computer Vision
Florence-2: Create and Deploy a Custom Vision Language Model
Roboflow Intermediate 1y ago
YOLO11: Performance Benchmark and Real World Use Cases
Computer Vision
YOLO11: Performance Benchmark and Real World Use Cases
Roboflow Intermediate 1y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Intro to Operating Systems 2: Memory Management
📚 Coursera Course ↗
Self-paced
Intro to Operating Systems 2: Memory Management
Opens on Coursera ↗
IoT Networking
📚 Coursera Course ↗
Self-paced
IoT Networking
Opens on Coursera ↗
Advancing Your Career in Computer Vision Engineering
📚 Coursera Course ↗
Self-paced
Advancing Your Career in Computer Vision Engineering
Opens on Coursera ↗
Features and Boundaries
📚 Coursera Course ↗
Self-paced
Features and Boundaries
Opens on Coursera ↗
Future of data and technology in football
📚 Coursera Course ↗
Self-paced
Future of data and technology in football
Opens on Coursera ↗
Automating Image Processing
📚 Coursera Course ↗
Self-paced
Automating Image Processing
Opens on Coursera ↗