Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?
Computer Vision
RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?
Roboflow Beginner 1y ago
Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI
Computer Vision
Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI
Bytes of AI Beginner 1y ago
Aya Vision Challenge, Ep. 3
Computer Vision
Aya Vision Challenge, Ep. 3
Cohere Beginner 1y ago
Aya Vision Challenge, Ep. 2
Computer Vision ⚡ AI Lesson
Aya Vision Challenge, Ep. 2
Cohere Beginner 1y ago
Measure Objects with AI | Identifying Common Pitfalls and Increasing Precision
Computer Vision
Measure Objects with AI | Identifying Common Pitfalls and Increasing Precision
Roboflow Beginner 1y ago
Vision Transformer from Scratch Tutorial
Computer Vision ⚡ AI Lesson
Vision Transformer from Scratch Tutorial
freeCodeCamp.org Beginner 1y ago
AI Race: Luck or Skill in 2025?
Computer Vision ⚡ AI Lesson
AI Race: Luck or Skill in 2025?
Lex Frid Clips Beginner 1y ago
22 Machine Learning Projects That Will Make You A God At Data Science
Computer Vision ⚡ AI Lesson
22 Machine Learning Projects That Will Make You A God At Data Science
Infinite Codes Beginner 1y ago
Outro Of Project: Cutomer segmentation
Computer Vision
Outro Of Project: Cutomer segmentation
GeeksforGeeks Beginner 1y ago
Model Pusher: Customer Segmentation
Computer Vision
Model Pusher: Customer Segmentation
GeeksforGeeks Beginner 1y ago
Discriminative AI explained in 60 seconds #ai #aiexplained #learning #artificialintelligence
Computer Vision
Discriminative AI explained in 60 seconds #ai #aiexplained #learning #artificialintelligence
AI Waves Beginner 1y ago
Multimodal AI Agents Are Revolutionising Image & Video Analysis!
Computer Vision
Multimodal AI Agents Are Revolutionising Image & Video Analysis!
Mervin Praison Beginner 1y ago
Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate
Computer Vision ⚡ AI Lesson
Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate
Latent Space Beginner 1y ago
Multimodal Embeddings: Introduction & Use Cases (with Python)
Computer Vision
Multimodal Embeddings: Introduction & Use Cases (with Python)
Shaw Talebi Beginner 1y ago
Demo Lecture-Image Processing-Computer Vision With Generative AI Bootcamp With Doubts Solving
Computer Vision
Demo Lecture-Image Processing-Computer Vision With Generative AI Bootcamp With Doubts Solving
Krish Naik Beginner 1y ago
Insights from a Kaggle Grandmaster: Multimodal Models, Agents, Document AI & more
Computer Vision
Insights from a Kaggle Grandmaster: Multimodal Models, Agents, Document AI & more
Analytics Vidhya Beginner 1y ago
Web AI Summit 2024: State of client side machine learning
Computer Vision ⚡ AI Lesson
Web AI Summit 2024: State of client side machine learning
Chrome for Developers Beginner 1y ago
NLP Engineer & Computer Vision Engineer #codebasics #nlp #computervision #datajob #shorts
Computer Vision ⚡ AI Lesson
NLP Engineer & Computer Vision Engineer #codebasics #nlp #computervision #datajob #shorts
codebasics Beginner 1y ago
Revolutionizing sign language with AI
Computer Vision ⚡ AI Lesson
Revolutionizing sign language with AI
TensorFlow Official Beginner 1y ago
Neuralift AI builds trust using W&B Weave
Computer Vision
Neuralift AI builds trust using W&B Weave
Weights & Biases Beginner 1y ago
[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu
Computer Vision ⚡ AI Lesson
[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu
Latent Space Beginner 1y ago
Single Shot Multibox Detector | SSD Object Detection Explained and Implemented
Computer Vision
Single Shot Multibox Detector | SSD Object Detection Explained and Implemented
ExplainingAI Beginner 1y ago
Data As a Corporate Asset—the GenAI-era Take (Part 2)
Computer Vision ⚡ AI Lesson
Data As a Corporate Asset—the GenAI-era Take (Part 2)
Microsoft Developer Beginner 1y ago
Computer Vision Explained in 30s
Computer Vision
Computer Vision Explained in 30s
365 Data Science Beginner 1y ago
New Way Now: Plenitude streamlines customer onboarding and fraud prevention with Google Cloud AI
Computer Vision
New Way Now: Plenitude streamlines customer onboarding and fraud prevention with Google Cloud AI
Google Cloud Beginner 1y ago
Blobs to Clips: Efficient End-to-End Video Data Loading - Andrew Ho & Ahmad Sharif, Meta
Computer Vision ⚡ AI Lesson
Blobs to Clips: Efficient End-to-End Video Data Loading - Andrew Ho & Ahmad Sharif, Meta
PyTorch Beginner 1y ago
Llama 3.2: Best Multimodal Model Yet? (Vision Test)
Computer Vision ⚡ AI Lesson
Llama 3.2: Best Multimodal Model Yet? (Vision Test)
Mervin Praison Beginner 1y ago
CS50x 2025 - Lecture 4 - Memory
Computer Vision
CS50x 2025 - Lecture 4 - Memory
CS50 Beginner 1y ago
Data As a Corporate Asset—the GenAI-era Take (Part 1)
Computer Vision
Data As a Corporate Asset—the GenAI-era Take (Part 1)
Microsoft Developer Beginner 1y ago
Free Live 3 Days Computer Vision and Object Detection Workshop
Computer Vision
Free Live 3 Days Computer Vision and Object Detection Workshop
Krish Naik Beginner 1y ago
Using PyTorch for Monocular Depth Estimation Webinar
Computer Vision ⚡ AI Lesson
Using PyTorch for Monocular Depth Estimation Webinar
PyTorch Beginner 1y ago
Handwriting Transcription with AI: Digitizing Documents Using Computer Vision
Computer Vision
Handwriting Transcription with AI: Digitizing Documents Using Computer Vision
Macgence Beginner 1y ago
Object Detection: Importance of High-Quality Data
Computer Vision ⚡ AI Lesson
Object Detection: Importance of High-Quality Data
Macgence Beginner 1y ago
“The Future of AI is Here” — Fei-Fei Li Unveils the Next Frontier of AI
Computer Vision ⚡ AI Lesson
“The Future of AI is Here” — Fei-Fei Li Unveils the Next Frontier of AI
a16z Beginner 1y ago
Aya Vision - The Research Behind the Model
Computer Vision
Aya Vision - The Research Behind the Model
Cohere Beginner 1y ago
Aya Vision Challenge, Ep. 1
Computer Vision ⚡ AI Lesson
Aya Vision Challenge, Ep. 1
Cohere Beginner 1y ago
YOLOv12 Object Detection Training Tutorial
Computer Vision ⚡ AI Lesson
YOLOv12 Object Detection Training Tutorial
Roboflow Beginner 1y ago
Model Evaluation: Customer Segmentation
Computer Vision
Model Evaluation: Customer Segmentation
GeeksforGeeks Beginner 1y ago
Model Evaluation: Customer Segmentation
Computer Vision
Model Evaluation: Customer Segmentation
GeeksforGeeks Beginner 1y ago
Intro Of Project - Customer Segmentation
Computer Vision
Intro Of Project - Customer Segmentation
GeeksforGeeks Beginner 1y ago
How to Manage Hundreds of Edge Vision AI Devices in One Place
Computer Vision ⚡ AI Lesson
How to Manage Hundreds of Edge Vision AI Devices in One Place
Roboflow Beginner 1y ago
From DETR to SAM2: Reviewing the TOP Vision AI Advances of 2024
Computer Vision
From DETR to SAM2: Reviewing the TOP Vision AI Advances of 2024
Roboflow Beginner 1y ago
Use Semantic Search to Create Computer Vision Datasets
Computer Vision
Use Semantic Search to Create Computer Vision Datasets
Roboflow Beginner 1y ago
SAM-2.1: How to Fine-Tune for Image Segmentation
Computer Vision
SAM-2.1: How to Fine-Tune for Image Segmentation
Roboflow Beginner 1y ago
How to Build a Smart Parking System - License Plate Detection & OCR
Computer Vision ⚡ AI Lesson
How to Build a Smart Parking System - License Plate Detection & OCR
Roboflow Beginner 1y ago
YOLOv11: How to Train for Object Detection on a Custom Dataset | Step-by-step guide
Computer Vision
YOLOv11: How to Train for Object Detection on a Custom Dataset | Step-by-step guide
Roboflow Beginner 1y ago
How to use OCR | Get Started with Optical Character Recognition
Computer Vision
How to use OCR | Get Started with Optical Character Recognition
Roboflow Beginner 1y ago
Xiang Yue - Measuring Multimodal Reasoning with the MMMU Benchmarks
Computer Vision
Xiang Yue - Measuring Multimodal Reasoning with the MMMU Benchmarks
Cohere Beginner 1y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Image Segmentation, Filtering, and Region Analysis
📚 Coursera Course ↗
Self-paced
Image Segmentation, Filtering, and Region Analysis
Opens on Coursera ↗
Process Documents with Python Using the Document AI API
📚 Coursera Course ↗
Self-paced
Process Documents with Python Using the Document AI API
Opens on Coursera ↗
Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital
📚 Coursera Course ↗
Self-paced
Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital
Opens on Coursera ↗
Process SAR & Multispectral
📚 Coursera Course ↗
Self-paced
Process SAR & Multispectral
Opens on Coursera ↗
Create Image Captioning Models - Español
📚 Coursera Course ↗
Self-paced
Create Image Captioning Models - Español
Opens on Coursera ↗
Unity: Design & Deform Meshes for 3D Geometry Control
📚 Coursera Course ↗
Self-paced
Unity: Design & Deform Meshes for 3D Geometry Control
Opens on Coursera ↗