Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Video Analytics with AI | Live Coding & Q&A (Oct 9th)
Computer Vision
Video Analytics with AI | Live Coding & Q&A (Oct 9th)
Roboflow Intermediate 1y ago
Testing CA’s Computer Vision Robot Arm @LEGO @raspberrypi @Core-Electronics
Computer Vision
Testing CA’s Computer Vision Robot Arm @LEGO @raspberrypi @Core-Electronics
Creator Academy Australia Intermediate 1y ago
GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)
Computer Vision
GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)
Roboflow Intermediate 1y ago
The era of unbounded products: Designing for Multimodal IO: Ben Hylak
Computer Vision
The era of unbounded products: Designing for Multimodal IO: Ben Hylak
AI Engineer Intermediate 1y ago
Why Zero Trust is the Key to Cybersecurity in 2024 and Beyond
Computer Vision ⚡ AI Lesson
Why Zero Trust is the Key to Cybersecurity in 2024 and Beyond
SANS Institute Intermediate 1y ago
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Computer Vision
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Neil Patel Intermediate 1y ago
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
Computer Vision ⚡ AI Lesson
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
AI Anytime Intermediate 1y ago
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
Computer Vision ⚡ AI Lesson
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
DataCamp Intermediate 1y ago
How to run SAM 2 (Segment Anything AI Model)?
Computer Vision ⚡ AI Lesson
How to run SAM 2 (Segment Anything AI Model)?
AI Anytime Intermediate 1y ago
SAM 2 is going to transform COMPUTER VISION!!!
Computer Vision
SAM 2 is going to transform COMPUTER VISION!!!
1littlecoder Intermediate 1y ago
Excitement for the Generative AI era: Multi-Modal inputs
Computer Vision
Excitement for the Generative AI era: Multi-Modal inputs
Weights & Biases Intermediate 1y ago
Reimagine document processing and understanding with generative AI
Computer Vision
Reimagine document processing and understanding with generative AI
Google Cloud Intermediate 1y ago
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Computer Vision ⚡ AI Lesson
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Cohere Intermediate 1y ago
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
2:29
Computer Vision ⚡ AI Lesson
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
Burned Guitarist Intermediate 1y ago
New2Cyber en Espanol | El final de la era del profesional de seguridad
Computer Vision ⚡ AI Lesson
New2Cyber en Espanol | El final de la era del profesional de seguridad
SANS Institute Intermediate 2y ago
How To Fine-tune LLaVA Model (From Your Laptop!)
Computer Vision
How To Fine-tune LLaVA Model (From Your Laptop!)
Brev Intermediate 2y ago
Stanford Seminar - Silicon Valley & The U.S. Government: Vannevar Lab's Brett Granberg
Computer Vision
Stanford Seminar - Silicon Valley & The U.S. Government: Vannevar Lab's Brett Granberg
Stanford Online Intermediate 2y ago
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
Computer Vision ⚡ AI Lesson
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
a16z Intermediate 2y ago
Multi-Modal NSFW Detection with AI
Computer Vision
Multi-Modal NSFW Detection with AI
James Briggs Intermediate 2y ago
This VLM can be your MultiModal AI with less than 6GB Memory!!!
Computer Vision
This VLM can be your MultiModal AI with less than 6GB Memory!!!
1littlecoder Intermediate 2y ago
New course with Hugging Face: Open Source Models with Hugging Face
Computer Vision ⚡ AI Lesson
New course with Hugging Face: Open Source Models with Hugging Face
DeepLearningAI Intermediate 2y ago
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Computer Vision
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Dwarkesh Patel Intermediate 2y ago
Vision Transformer (ViT)
Computer Vision
Vision Transformer (ViT)
Machine Learning Studio Intermediate 2y ago
The Future Of Computer Vision
Computer Vision ⚡ AI Lesson
The Future Of Computer Vision
a16z Intermediate 2y ago
Create a Custom Document Extractor with Document AI
Computer Vision ⚡ AI Lesson
Create a Custom Document Extractor with Document AI
Google Cloud Tech Intermediate 2y ago
Tune in to know what are the most exciting opportunities to look out for in computer vision!
Computer Vision ⚡ AI Lesson
Tune in to know what are the most exciting opportunities to look out for in computer vision!
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
Computer Vision ⚡ AI Lesson
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
Image segmentation - ML on Android with MediaPipe Series
Computer Vision
Image segmentation - ML on Android with MediaPipe Series
Google for Developers Intermediate 2y ago
¿La verdadera razón detrás de la transformación digital?
Computer Vision
¿La verdadera razón detrás de la transformación digital?
Google Cloud Intermediate 2y ago
Want to find the BEST segmentation for your business?
Computer Vision
Want to find the BEST segmentation for your business?
Adam Erhart Intermediate 2y ago
Accelerating Explorations in Vision and Multimodal AI Using Pytorch...- Nicolas, Philip, Evan & Peng
Computer Vision
Accelerating Explorations in Vision and Multimodal AI Using Pytorch...- Nicolas, Philip, Evan & Peng
PyTorch Intermediate 2y ago
TIME Best Invention of 2023: NVIDIA Neuralangelo
Computer Vision
TIME Best Invention of 2023: NVIDIA Neuralangelo
NVIDIA Developer Intermediate 2y ago
Next big thing in Gen AI | Sandeep Singh, Head of Applied AI @ Beans.AI | Leading With Data 02
Computer Vision ⚡ AI Lesson
Next big thing in Gen AI | Sandeep Singh, Head of Applied AI @ Beans.AI | Leading With Data 02
Analytics Vidhya Intermediate 2y ago
META releases new Translation AI: SeamlessM4T for 100 languages
Computer Vision ⚡ AI Lesson
META releases new Translation AI: SeamlessM4T for 100 languages
Discover AI Intermediate 2y ago
Segmentation in Email Automation Hacks
0:14
Computer Vision ⚡ AI Lesson
Segmentation in Email Automation Hacks
Email Mastery Pro Intermediate 2y ago
AWS ML Heroes in 15: Amazon Rekognition for Wildlife Conservation-AWS Machine Learning in 15
Computer Vision
AWS ML Heroes in 15: Amazon Rekognition for Wildlife Conservation-AWS Machine Learning in 15
AWS Developers Intermediate 2y ago
No Priors Ep. 24 | With Devi Parikh from Meta
Computer Vision
No Priors Ep. 24 | With Devi Parikh from Meta
No Priors: AI, Machine Learning, Tech, & Startups Intermediate 2y ago
YOLO11: How to Train for Object Detection | Live Coding & Q&A (Sep 30)
Computer Vision
YOLO11: How to Train for Object Detection | Live Coding & Q&A (Sep 30)
Roboflow Intermediate 1y ago
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Computer Vision
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Roboflow Intermediate 1y ago
Use Dedicated Deployments with Computer Vision Workflows
Computer Vision
Use Dedicated Deployments with Computer Vision Workflows
Roboflow Intermediate 1y ago
Football AI Tutorial: From Basics to Advanced Stats with Python
Computer Vision
Football AI Tutorial: From Basics to Advanced Stats with Python
Roboflow Intermediate 1y ago
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Computer Vision ⚡ AI Lesson
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Roboflow Intermediate 1y ago
PaliGemma by Google: Train Model on Custom Detection Dataset
Computer Vision
PaliGemma by Google: Train Model on Custom Detection Dataset
Roboflow Intermediate 1y ago
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Computer Vision ⚡ AI Lesson
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Roboflow Intermediate 2y ago
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
Computer Vision ⚡ AI Lesson
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
a16z Intermediate 2y ago
C360 for BigQuery powered by Lytics fuels next gen AI, analytics, and predictions
Computer Vision
C360 for BigQuery powered by Lytics fuels next gen AI, analytics, and predictions
Google Cloud Intermediate 2y ago
AI.engineer 2023: Live Coding a Multimodal Game, paint.wtf
Computer Vision
AI.engineer 2023: Live Coding a Multimodal Game, paint.wtf
Roboflow Intermediate 2y ago
Autodistill: Train YOLOv8 with ZERO Annotations
Computer Vision ⚡ AI Lesson
Autodistill: Train YOLOv8 with ZERO Annotations
Roboflow Intermediate 2y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Marketing Communications: Intro to Consumer Behavior
📚 Coursera Course ↗
Self-paced
Marketing Communications: Intro to Consumer Behavior
Opens on Coursera ↗
Videojuegos: ¿de qué hablamos?
📚 Coursera Course ↗
Self-paced
Videojuegos: ¿de qué hablamos?
Opens on Coursera ↗
Open Source Models with Hugging Face
📚 Coursera Course ↗
Self-paced
Open Source Models with Hugging Face
Opens on Coursera ↗
H2O Cloud AI Developer Services
📚 Coursera Course ↗
Self-paced
H2O Cloud AI Developer Services
Opens on Coursera ↗
Introduction to Image Processing
📚 Coursera Course ↗
Self-paced
Introduction to Image Processing
Opens on Coursera ↗
Deep Learning Applications for Computer Vision
📚 Coursera Course ↗
Self-paced
Deep Learning Applications for Computer Vision
Opens on Coursera ↗