Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
open-animal-tracks
Computer Vision ⚡ AI Lesson
open-animal-tracks
Data Skeptic Advanced 1y ago
Model Evaluation for Computer Vision
Computer Vision ⚡ AI Lesson
Model Evaluation for Computer Vision
Roboflow Beginner 1y ago
Bird Distribution Modeling with Satbird
Computer Vision ⚡ AI Lesson
Bird Distribution Modeling with Satbird
Data Skeptic Advanced 1y ago
Active Learning in Computer Vision
Computer Vision
Active Learning in Computer Vision
Roboflow Beginner 1y ago
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Computer Vision
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Neil Patel Intermediate 1y ago
YOLO Object Detection | YoloV1 Explanation and Implementation Tutorial
Computer Vision
YOLO Object Detection | YoloV1 Explanation and Implementation Tutorial
ExplainingAI Advanced 1y ago
Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and  Llama 3.1
Computer Vision
Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and Llama 3.1
Muhammad Moin Intermediate 1y ago
C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya
Computer Vision
C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya
Cohere Beginner 1y ago
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Computer Vision
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Microsoft Research Advanced 1y ago
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
Computer Vision ⚡ AI Lesson
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
AI Anytime Intermediate 1y ago
Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218
Computer Vision
Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218
Real Python Beginner 1y ago
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
Computer Vision ⚡ AI Lesson
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
DataCamp Intermediate 1y ago
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Computer Vision
Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson
Latent Space Advanced 1y ago
How to run SAM 2 (Segment Anything AI Model)?
Computer Vision ⚡ AI Lesson
How to run SAM 2 (Segment Anything AI Model)?
AI Anytime Intermediate 1y ago
JETSON AI LAB | Research Group Meeting (8/6/2024)
Computer Vision
JETSON AI LAB | Research Group Meeting (8/6/2024)
NVIDIA Developer Advanced 1y ago
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Computer Vision
Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai
Deepak Bhaskaran Beginner 1y ago
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Computer Vision
Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.
Google Cloud Beginner 1y ago
SAM 2 is going to transform COMPUTER VISION!!!
Computer Vision
SAM 2 is going to transform COMPUTER VISION!!!
1littlecoder Intermediate 1y ago
LlamaIndex Webinar: ColPali - Efficient Document Retrieval with Vision Language Models
Computer Vision
LlamaIndex Webinar: ColPali - Efficient Document Retrieval with Vision Language Models
LlamaIndex Advanced 1y ago
Real-Time Object Tracking using YOLO10 and DeepSORT Algorithm
Computer Vision
Real-Time Object Tracking using YOLO10 and DeepSORT Algorithm
Muhammad Moin Beginner 1y ago
Audience Segmentation Tips: 3 Ways to Segment Your Email List
3:24
Computer Vision ⚡ AI Lesson
Audience Segmentation Tips: 3 Ways to Segment Your Email List
Klaviyo Advanced 1y ago
Visual PDF Reader: ColPALI for RAG  #ai
Computer Vision
Visual PDF Reader: ColPALI for RAG #ai
Discover AI Advanced 1y ago
New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud
Computer Vision
New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud
Google Cloud Intermediate 1y ago
An Overview of Object Recognition Tasks
Computer Vision ⚡ AI Lesson
An Overview of Object Recognition Tasks
Machine Learning Studio Beginner 1y ago
Excitement for the Generative AI era: Multi-Modal inputs
Computer Vision
Excitement for the Generative AI era: Multi-Modal inputs
Weights & Biases Intermediate 1y ago
Denoising Images with OpenCV in Python
Computer Vision ⚡ AI Lesson
Denoising Images with OpenCV in Python
NeuralNine Beginner 1y ago
Image Recognition with LLaVa in Python
Computer Vision ⚡ AI Lesson
Image Recognition with LLaVa in Python
NeuralNine Beginner 2y ago
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Computer Vision
Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!
Mervin Praison Beginner 2y ago
OCR Using Microsoft's Florence-2 Vision Model on Free Google Colab
Computer Vision
OCR Using Microsoft's Florence-2 Vision Model on Free Google Colab
TheAILearner Beginner 2y ago
Florence 2 - The Best Small VLM Out There?
Computer Vision ⚡ AI Lesson
Florence 2 - The Best Small VLM Out There?
Sam Witteveen Beginner 2y ago
New Microsoft Vision Model has AMAZING TRICKS!!!
Computer Vision ⚡ AI Lesson
New Microsoft Vision Model has AMAZING TRICKS!!!
1littlecoder Advanced 2y ago
CVPR 2024 Paper: Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance
Computer Vision
CVPR 2024 Paper: Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance
anucvml Beginner 2y ago
[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation
Computer Vision
[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation
anucvml Intermediate 2y ago
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Computer Vision ⚡ AI Lesson
Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum
Microsoft Research Advanced 2y ago
OpenAI CLIP model explained
Computer Vision
OpenAI CLIP model explained
Machine Learning Studio Beginner 2y ago
Using PAM EXEC to Log Passwords on Linux
Computer Vision ⚡ AI Lesson
Using PAM EXEC to Log Passwords on Linux
IppSec Beginner 2y ago
Robotics AI for Industrial Applications
Computer Vision
Robotics AI for Industrial Applications
Weights & Biases Advanced 2y ago
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Computer Vision ⚡ AI Lesson
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Cohere Intermediate 2y ago
Use Dedicated Deployments with Computer Vision Workflows
Computer Vision
Use Dedicated Deployments with Computer Vision Workflows
Roboflow Intermediate 1y ago
Football AI | Community Q&A (Aug 29)
Computer Vision ⚡ AI Lesson
Football AI | Community Q&A (Aug 29)
Roboflow Advanced 1y ago
Football AI Tutorial: From Basics to Advanced Stats with Python
Computer Vision
Football AI Tutorial: From Basics to Advanced Stats with Python
Roboflow Intermediate 1y ago
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Computer Vision ⚡ AI Lesson
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Roboflow Intermediate 1y ago
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Computer Vision
AI-Assisted Data Labeling | Weekly Roboflow Product Session
Roboflow Beginner 1y ago
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Computer Vision
Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)
Roboflow Advanced 1y ago
License Plate Detection & Recognition with YOLOv10 and PaddleOCR | Save Data to SQL Database
Computer Vision
License Plate Detection & Recognition with YOLOv10 and PaddleOCR | Save Data to SQL Database
Muhammad Moin Beginner 1y ago
Florence-2: Fine-tune Microsoft’s Multimodal Model
Computer Vision
Florence-2: Fine-tune Microsoft’s Multimodal Model
Roboflow Beginner 1y ago
Reimagine document processing and understanding with generative AI
Computer Vision
Reimagine document processing and understanding with generative AI
Google Cloud Intermediate 1y ago
PaliGemma by Google: Train Model on Custom Detection Dataset
Computer Vision
PaliGemma by Google: Train Model on Custom Detection Dataset
Roboflow Intermediate 2y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Hands-on Data Centric Visual AI
📚 External: Coursera ↗
Self-paced
Hands-on Data Centric Visual AI
Opens on Coursera ↗
Artificial Vision for Textile quality control
📚 External: Coursera ↗
Self-paced
Artificial Vision for Textile quality control
Opens on Coursera ↗
Marketing in the Age of AI
📚 External: Coursera ↗
Self-paced
Marketing in the Age of AI
Opens on Coursera ↗
Marketing Communications: Intro to Consumer Behavior
📚 External: Coursera ↗
Self-paced
Marketing Communications: Intro to Consumer Behavior
Opens on Coursera ↗
Jetson Nano Starter to Pro - A Computer Vision Course
📚 External: Coursera ↗
Self-paced
Jetson Nano Starter to Pro - A Computer Vision Course
Opens on Coursera ↗
Advanced Algorithms and Complexity
📚 External: Coursera ↗
Self-paced
Advanced Algorithms and Complexity
Opens on Coursera ↗