Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 394

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Drowsiness Detection with Vision AI | Improve Safety with AI

Computer Vision

Drowsiness Detection with Vision AI | Improve Safety with AI

Roboflow Intermediate 1y ago

Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez

Computer Vision

Multimodal Open Source at Kyutai, From Online Demos to On-Device - Alexandre Défossez

PyTorch Intermediate 1y ago

Computer Vision

Instagram Profile Scraper using Apify and Google Sheets in n8n

Muhammad Moin Beginner 1y ago

MedGemma LLM: Doctors, Meet Your AI Assistant 🧠

Computer Vision ⚡ AI Lesson

MedGemma LLM: Doctors, Meet Your AI Assistant 🧠

AI Anytime Intermediate 1y ago

Introduction to AI Agents: LLMs, Workflows, and AI Agents

Computer Vision

Introduction to AI Agents: LLMs, Workflows, and AI Agents

Muhammad Moin Beginner 1y ago

[CVPR 2025] Pos3R: 6D Pose Estimation for Unseen Objects Made Easy

Computer Vision

[CVPR 2025] Pos3R: 6D Pose Estimation for Unseen Objects Made Easy

anucvml Intermediate 1y ago

Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained

Computer Vision

Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained

Thinking Neuron Beginner 1y ago

FastVLM brings advanced computer vision to your phone...

Computer Vision ⚡ AI Lesson

FastVLM brings advanced computer vision to your phone...

NeuralNine Advanced 1y ago

Building a Vision Transformer Model from Scratch with PyTorch

Computer Vision ⚡ AI Lesson

Building a Vision Transformer Model from Scratch with PyTorch

freeCodeCamp.org Beginner 1y ago

China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!

Computer Vision

China’s ByteDance Just Dropped BAGEL — Multimodal AI Beast!

Analytics Vidhya Intermediate 1y ago

Seulki Park - Visually Consistent Hierarchical Image Classification

Computer Vision

Seulki Park - Visually Consistent Hierarchical Image Classification

Cohere Beginner 1y ago

AI Personal Tutor for Everyone

Computer Vision

AI Personal Tutor for Everyone

Y Combinator Beginner 1y ago

OpenAI Multimodal CLIP Architecture in 60 Seconds

Computer Vision

OpenAI Multimodal CLIP Architecture in 60 Seconds

HowCanAIHelp Beginner 1y ago

Computer Vision in 100 Seconds

Computer Vision

Computer Vision in 100 Seconds

Infinite Codes Beginner 1y ago

How to Segment Your Audience in Mailchimp

Computer Vision ⚡ AI Lesson

How to Segment Your Audience in Mailchimp

Intuit Mailchimp Intermediate 1y ago

Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python

Computer Vision

Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python

Code In a Jiffy Beginner 1y ago

How to Detect People in Danger Zones with AI

Computer Vision

How to Detect People in Danger Zones with AI

Roboflow Beginner 1y ago

Multimodal AI with Logan Kilpatrick

Computer Vision

Multimodal AI with Logan Kilpatrick

Google Cloud Beginner 1y ago

DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1

Computer Vision

DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1

ExplainingAI Beginner 1y ago

Find out how Nevada DETR achieved 4x faster approvals with Vertex AI

Computer Vision

Find out how Nevada DETR achieved 4x faster approvals with Vertex AI

Google Cloud Advanced 1y ago

Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI

Computer Vision

Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI

Bytes of AI Beginner 1y ago

Multimodal AI & Next Gen Databases | Data Brew | Episode 42

Computer Vision ⚡ AI Lesson

Multimodal AI & Next Gen Databases | Data Brew | Episode 42

Databricks Intermediate 1y ago

PaliGemma – Making Gemma 2 see by adding a vision encoder

Computer Vision

PaliGemma – Making Gemma 2 see by adding a vision encoder

Google for Developers Advanced 1y ago

Aya Vision Challenge, Ep. 3

Computer Vision

Aya Vision Challenge, Ep. 3

Cohere Beginner 1y ago

Nurturing Customer Relationships - Behind the Keynotes - Season 3 Episode 8

Computer Vision

Nurturing Customer Relationships - Behind the Keynotes - Season 3 Episode 8

Nordic Business Forum Beginner 1y ago

Seminar: Segment Anything - Meta AI (15-03-2025)

Computer Vision

Seminar: Segment Anything - Meta AI (15-03-2025)

IEC Seminar Intermediate 1y ago

Building a travel buddy with Gemma

Computer Vision

Building a travel buddy with Gemma

Google for Developers Intermediate 1y ago

George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO

Computer Vision

George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO

george hotz archive Advanced 1y ago

Le meilleur OCR au monde : Mistral AI

Computer Vision

Le meilleur OCR au monde : Mistral AI

LAW I.A. Avocat & intelligence artificielle Lexvox Advanced 1y ago

Microsoft's Phi-4 Multimodal : NEW Opensource LLM is a TINY BEAST! (Full Test & Review)

Computer Vision

Microsoft's Phi-4 Multimodal : NEW Opensource LLM is a TINY BEAST! (Full Test & Review)

Codewello Beginner 1y ago

Open-source AI models are surpassing closed source (fast) | AI/ML Monthly

Computer Vision ⚡ AI Lesson

Open-source AI models are surpassing closed source (fast) | AI/ML Monthly

Daniel Bourke Beginner 1y ago

⚙️ How Does a Capacitive Proximity Sensor Work? #automation #sensor #proximitysensors #basics

Computer Vision

⚙️ How Does a Capacitive Proximity Sensor Work? #automation #sensor #proximitysensors #basics

Mr. SMART Engineering Beginner 1y ago

Train Foundation Models Better with LightlyTrain – Achieve Better Accuracy with Less Effort

Computer Vision

Train Foundation Models Better with LightlyTrain – Achieve Better Accuracy with Less Effort

Muhammad Moin Beginner 1y ago

Intuit uses Google Cloud Document AI to further simplify tax prep for millions

Computer Vision

Intuit uses Google Cloud Document AI to further simplify tax prep for millions

Google Cloud Intermediate 1y ago

RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?

Computer Vision

RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?

Roboflow Beginner 1y ago

RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow

Computer Vision

RF-DETR, Batch Processing, Instant Training, Serverless Inference, and More | What's New in Roboflow

Roboflow Intermediate 1y ago

Expedition Aya Kick Off Event

Computer Vision

Expedition Aya Kick Off Event

Cohere Intermediate 1y ago

RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)

Computer Vision

RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)

Roboflow Advanced 1y ago

How to Train RF-DETR Object Detection Transformer on Custom Dataset for Potholes Detection

Computer Vision

How to Train RF-DETR Object Detection Transformer on Custom Dataset for Potholes Detection

Muhammad Moin Beginner 1y ago

RF-DETR: Real-Time Object Detection in Images and Videos | A Step-by-Step Guide

Computer Vision

RF-DETR: Real-Time Object Detection in Images and Videos | A Step-by-Step Guide

Muhammad Moin Beginner 1y ago

Build a Football Analysis System Using YOLO11 and Supervision

Computer Vision

Build a Football Analysis System Using YOLO11 and Supervision

Muhammad Moin Intermediate 1y ago

Aya Vision Challenge, Ep. 2

Computer Vision ⚡ AI Lesson

Aya Vision Challenge, Ep. 2

Cohere Beginner 1y ago

YOLOE: Real-Time Zero-Shot Object Detection and Segmentation Explained | Visual Prompting

Computer Vision

YOLOE: Real-Time Zero-Shot Object Detection and Segmentation Explained | Visual Prompting

Muhammad Moin Advanced 1y ago

Build a Tennis Analysis System with YOLOv12 and OpenCV

Computer Vision

Build a Tennis Analysis System with YOLOv12 and OpenCV

Muhammad Moin Beginner 1y ago

New Course: YOLOv12 – Custom Object Detection, Tracking & Web Apps

Computer Vision

New Course: YOLOv12 – Custom Object Detection, Tracking & Web Apps

Muhammad Moin Intermediate 1y ago

How to Train YOLOv12 Models on Your Custom Dataset in Google Colab

Computer Vision

How to Train YOLOv12 Models on Your Custom Dataset in Google Colab

Muhammad Moin Beginner 1y ago

YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)

Computer Vision

YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)

Roboflow Advanced 1y ago

Measure Objects with AI | Identifying Common Pitfalls and Increasing Precision

Computer Vision

Measure Objects with AI | Identifying Common Pitfalls and Increasing Precision

Roboflow Beginner 1y ago

📚 Continue on Coursera External links · Free to audit

View all →

Create video, audio and infographics for online learning

📚 External: Coursera ↗

Create video, audio and infographics for online learning

Opens on Coursera ↗

AI and Disaster Management

📚 External: Coursera ↗

AI and Disaster Management

Opens on Coursera ↗

Craft Sales Strategy

📚 External: Coursera ↗

Craft Sales Strategy

Opens on Coursera ↗

Business Economics and Game Theory for Decision Making

📚 External: Coursera ↗

Business Economics and Game Theory for Decision Making

Opens on Coursera ↗

Artificial Vision for Textile quality control

📚 External: Coursera ↗

Artificial Vision for Textile quality control

Opens on Coursera ↗

📚 External: Coursera ↗

Process Images, Create Captioning AI Models

Opens on Coursera ↗

Features and Boundaries

📚 External: Coursera ↗

Features and Boundaries

Opens on Coursera ↗

Computer Vision: YOLO Custom Object Detection with Colab GPU

📚 External: Coursera ↗

Computer Vision: YOLO Custom Object Detection with Colab GPU

Opens on Coursera ↗

Finanzas para directivos

📚 External: Coursera ↗

Finanzas para directivos

Opens on Coursera ↗

📚 External: Coursera ↗

Introduction to Computer Vision

Opens on Coursera ↗

Computer Vision: Face Recognition Quick Starter in Python

📚 External: Coursera ↗

Computer Vision: Face Recognition Quick Starter in Python

Opens on Coursera ↗

Unity: Design & Deform Meshes for 3D Geometry Control

📚 External: Coursera ↗

Unity: Design & Deform Meshes for 3D Geometry Control

Opens on Coursera ↗

Advanced Algorithms and Complexity

📚 External: Coursera ↗

Advanced Algorithms and Complexity

Opens on Coursera ↗

Marketing in the Age of AI

📚 External: Coursera ↗

Marketing in the Age of AI

Opens on Coursera ↗

Positioning: What you need for a successful Marketing Strategy

📚 External: Coursera ↗

Positioning: What you need for a successful Marketing Strategy

Opens on Coursera ↗

📚 External: Coursera ↗

Intro to Operating Systems 2: Memory Management

Opens on Coursera ↗

📚 External: Coursera ↗

Running Distributed TensorFlow using Vertex AI

Opens on Coursera ↗

Global Marketing and International Business Strategy

📚 External: Coursera ↗

Global Marketing and International Business Strategy

Opens on Coursera ↗