Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Computer Vision
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Neil Patel Intermediate 1y ago
Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google
Computer Vision
Missy Franklin, Angela Ruggiero & Ashton Eaton | Olympic Panel | Talks at Google
Talks at Google Advanced 1y ago
C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya
Computer Vision
C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya
Cohere Beginner 1y ago
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Computer Vision
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Microsoft Research Advanced 1y ago
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
Computer Vision ⚡ AI Lesson
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
AI Anytime Intermediate 1y ago
Football AI | Community Q&A (Aug 29)
Computer Vision ⚡ AI Lesson
Football AI | Community Q&A (Aug 29)
Roboflow Advanced 1y ago
Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218
Computer Vision
Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218
Real Python Beginner 1y ago
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
Computer Vision ⚡ AI Lesson
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
DataCamp Intermediate 1y ago
Football AI Tutorial: From Basics to Advanced Stats with Python
Computer Vision
Football AI Tutorial: From Basics to Advanced Stats with Python
Roboflow Intermediate 1y ago
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Computer Vision
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Latent Space Advanced 1y ago
How to run SAM 2 (Segment Anything AI Model)?
Computer Vision ⚡ AI Lesson
How to run SAM 2 (Segment Anything AI Model)?
AI Anytime Intermediate 1y ago
JETSON AI LAB | Research Group Meeting (8/6/2024)
Computer Vision
JETSON AI LAB | Research Group Meeting (8/6/2024)
NVIDIA Developer Advanced 1y ago
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Computer Vision
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Deepak Bhaskaran Beginner 1y ago
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Computer Vision
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Google Cloud Beginner 1y ago
SAM 2 is going to transform COMPUTER VISION!!!
Computer Vision
SAM 2 is going to transform COMPUTER VISION!!!
1littlecoder Intermediate 1y ago
Audience Segmentation Tips: 3 Ways to Segment Your Email List
3:24
Computer Vision ⚡ AI Lesson
Audience Segmentation Tips: 3 Ways to Segment Your Email List
Klaviyo Advanced 1y ago
An Overview of Object Recognition Tasks
Computer Vision ⚡ AI Lesson
An Overview of Object Recognition Tasks
Machine Learning Studio Beginner 1y ago
Excitement for the Generative AI era: Multi-Modal inputs
Computer Vision
Excitement for the Generative AI era: Multi-Modal inputs
Weights & Biases Intermediate 1y ago
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
Computer Vision ⚡ AI Lesson
Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - 692
The TWIML AI Podcast with Sam Charrington Advanced 1y ago
Denoising Images with OpenCV in Python
Computer Vision ⚡ AI Lesson
Denoising Images with OpenCV in Python
NeuralNine Beginner 1y ago
Reimagine document processing and understanding with generative AI
Computer Vision
Reimagine document processing and understanding with generative AI
Google Cloud Intermediate 1y ago
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Computer Vision
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Mervin Praison Beginner 1y ago
Florence 2 - The Best Small VLM Out There?
Computer Vision ⚡ AI Lesson
Florence 2 - The Best Small VLM Out There?
Sam Witteveen Beginner 1y ago
New Microsoft Vision Model has AMAZING TRICKS!!!
Computer Vision ⚡ AI Lesson
New Microsoft Vision Model has AMAZING TRICKS!!!
1littlecoder Advanced 1y ago
From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240
Computer Vision ⚡ AI Lesson
From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240
MLOps.community Beginner 1y ago
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Computer Vision ⚡ AI Lesson
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Microsoft Research Advanced 1y ago
OpenAI CLIP model explained
Computer Vision
OpenAI CLIP model explained
Machine Learning Studio Beginner 1y ago
Using PAM EXEC to Log Passwords on Linux
Computer Vision ⚡ AI Lesson
Using PAM EXEC to Log Passwords on Linux
IppSec Beginner 1y ago
Robotics AI for Industrial Applications
Computer Vision
Robotics AI for Industrial Applications
Weights & Biases Advanced 1y ago
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Computer Vision ⚡ AI Lesson
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Cohere Intermediate 1y ago
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
2:29
Computer Vision ⚡ AI Lesson
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
Burned Guitarist Intermediate 1y ago
Getting started With Google's PaliGemma: Open Vision-Language Model
Computer Vision ⚡ AI Lesson
Getting started With Google's PaliGemma: Open Vision-Language Model
Krish Naik Beginner 1y ago
New2Cyber en Espanol | El final de la era del profesional de seguridad
Computer Vision ⚡ AI Lesson
New2Cyber en Espanol | El final de la era del profesional de seguridad
SANS Institute Intermediate 2y ago
How To Fine-tune LLaVA Model (From Your Laptop!)
Computer Vision
How To Fine-tune LLaVA Model (From Your Laptop!)
Brev Intermediate 2y ago
New course with Comet: Prompt Engineering for Vision Models
Computer Vision ⚡ AI Lesson
New course with Comet: Prompt Engineering for Vision Models
DeepLearningAI Beginner 2y ago
It's easy to get stuck in our ways
Computer Vision ⚡ AI Lesson
It's easy to get stuck in our ways
General Musings with Kevin Powell Beginner 2y ago
Analyze documents in BigQuery with Document AI
Computer Vision
Analyze documents in BigQuery with Document AI
Google Cloud Tech Beginner 2y ago
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Computer Vision
Pose landmark detection - ML on Web with MediaPipe: Episode 8
Google for Developers Beginner 2y ago
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Computer Vision
Build an AI/ML Football Analysis system with YOLO, OpenCV, and Python
Code In a Jiffy Beginner 2y ago
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Computer Vision ⚡ AI Lesson
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Roboflow Intermediate 1y ago
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Computer Vision
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Roboflow Beginner 1y ago
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Computer Vision
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Roboflow Advanced 1y ago
Florence-2: Fine-tune Microsoft’s Multimodal Model
Computer Vision
Florence-2: Fine-tune Microsoft’s Multimodal Model
Roboflow Beginner 1y ago
How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)
Computer Vision
How good is YOLOv10? | Hacking Google's new VLM, PaliGemma | Community Q&A (Jun 6)
Roboflow Beginner 1y ago
PaliGemma by Google: Train Model on Custom Detection Dataset
Computer Vision
PaliGemma by Google: Train Model on Custom Detection Dataset
Roboflow Intermediate 1y ago
What is Document AI?
Computer Vision
What is Document AI?
Google Cloud Beginner 1y ago
Build computer vision applications easily with Roboflow and Google Cloud
Computer Vision
Build computer vision applications easily with Roboflow and Google Cloud
Google Cloud Advanced 2y ago
Dwell Time Analysis | Real-Time Stream Processing | Community Q&A (April 11)
Computer Vision
Dwell Time Analysis | Real-Time Stream Processing | Community Q&A (April 11)
Roboflow Beginner 2y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Analyze Video Data Using OpenCV and Python
📚 Coursera Course ↗
Self-paced
Analyze Video Data Using OpenCV and Python
Opens on Coursera ↗
Preparing Multimodal Data: Vision, Audio, and NLP Pipelines
📚 Coursera Course ↗
Self-paced
Preparing Multimodal Data: Vision, Audio, and NLP Pipelines
Opens on Coursera ↗
AI for Video Production
📚 Coursera Course ↗
Self-paced
AI for Video Production
Opens on Coursera ↗
Marketing Communications: Intro to Consumer Behavior
📚 Coursera Course ↗
Self-paced
Marketing Communications: Intro to Consumer Behavior
Opens on Coursera ↗
Anatomy of the Abdomen and Pelvis; a journey from basis to clinic.
📚 Coursera Course ↗
Self-paced
Anatomy of the Abdomen and Pelvis; a journey from basis to clinic.
Opens on Coursera ↗
Marketing Fundamentals Mastery: Apply, Analyze & Evaluate
📚 Coursera Course ↗
Self-paced
Marketing Fundamentals Mastery: Apply, Analyze & Evaluate
Opens on Coursera ↗