Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

2,360

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 1,215

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding

Computer Vision

Stanford CS231N Deep Learning for Computer Vision | Spring 2025 | Lecture 10: Video Understanding

Stanford Online Beginner 10mo ago

Testing DeepSeek V3.1 – The BEST Open Source AI Model?

Computer Vision ⚡ AI Lesson

Testing DeepSeek V3.1 – The BEST Open Source AI Model?

Muhammad Moin Beginner 10mo ago

Computer Vision with Arduino Tutorial – 2 Projects

Computer Vision ⚡ AI Lesson

Computer Vision with Arduino Tutorial – 2 Projects

freeCodeCamp.org Beginner 10mo ago

Business Strategy Discussion Developing Business Solutions A Comprehensive Guide.

Computer Vision

Business Strategy Discussion Developing Business Solutions A Comprehensive Guide.

Strategic Marketing Beginner 10mo ago

I trained a Sign Language Detection Transformer (here's how you can do it too!)

Computer Vision ⚡ AI Lesson

I trained a Sign Language Detection Transformer (here's how you can do it too!)

Nicholas Renotte Beginner 10mo ago

Almost All Email Campaigns Are Doing This Wrong

Computer Vision

Almost All Email Campaigns Are Doing This Wrong

Neil Patel Beginner 10mo ago

Introducing CodeSpy.ai – Detect AI-Generated Code with Confidence

Computer Vision ⚡ AI Lesson

Introducing CodeSpy.ai – Detect AI-Generated Code with Confidence

Muhammad Moin Beginner 11mo ago

YOLOv5 Tutorial | Architecture, Assigning Targets & Loss Function Explained

Computer Vision

YOLOv5 Tutorial | Architecture, Assigning Targets & Loss Function Explained

ExplainingAI Beginner 11mo ago

Architecture, Engineering & Construction Industry - Trends

Computer Vision

Architecture, Engineering & Construction Industry - Trends

Primerli Beginner 11mo ago

Control PTZ Cameras with AI | ONVIF Integration with Object Tracking

Computer Vision

Control PTZ Cameras with AI | ONVIF Integration with Object Tracking

Roboflow Beginner 11mo ago

Auto Labeling Image Data | How to Annotate a Dataset and Train a Vision AI Model

Computer Vision

Auto Labeling Image Data | How to Annotate a Dataset and Train a Vision AI Model

Roboflow Beginner 11mo ago

Timothée Darcet - Scaling Self Supervised Learning for Vision An Introduction to DINOv2

Computer Vision ⚡ AI Lesson

Timothée Darcet - Scaling Self Supervised Learning for Vision An Introduction to DINOv2

Cohere Beginner 12mo ago

3 Insane Algorithms Netflix Uses to Scan BILLIONS of Frames

Computer Vision

3 Insane Algorithms Netflix Uses to Scan BILLIONS of Frames

Coding with Lewis Beginner 1y ago

What is Computer Vision

Computer Vision

What is Computer Vision

AI Simplified Beginner 1y ago

Multimodal Document Intelligence with NVIDIA Llama Nemotron Nano VL

Computer Vision ⚡ AI Lesson

Multimodal Document Intelligence with NVIDIA Llama Nemotron Nano VL

NVIDIA Developer Beginner 1y ago

Why More Researchers Should become Content Creators

Computer Vision

Why More Researchers Should become Content Creators

Jia-Bin Huang Beginner 1y ago

LLMs for Equities Feature Forecasting at Two Sigma [Ben Wellington] - 736

Computer Vision ⚡ AI Lesson

LLMs for Equities Feature Forecasting at Two Sigma [Ben Wellington] - 736

The TWIML AI Podcast with Sam Charrington Beginner 1y ago

Unsupervised Learning: Uncover Hidden Patterns & Data Secrets!

Computer Vision

Unsupervised Learning: Uncover Hidden Patterns & Data Secrets!

The AI Standard Beginner 1y ago

Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained

Computer Vision

Convolutional Neural Networks (CNN) - Face Recognition Case Study - Algorithm & Full Code Explained

Thinking Neuron Beginner 1y ago

Building a Vision Transformer Model from Scratch with PyTorch

Computer Vision ⚡ AI Lesson

Building a Vision Transformer Model from Scratch with PyTorch

freeCodeCamp.org Beginner 1y ago

Seulki Park - Visually Consistent Hierarchical Image Classification

Computer Vision

Seulki Park - Visually Consistent Hierarchical Image Classification

Cohere Beginner 1y ago

AI Personal Tutor for Everyone

Computer Vision

AI Personal Tutor for Everyone

Y Combinator Beginner 1y ago

OpenAI Multimodal CLIP Architecture in 60 Seconds

Computer Vision

OpenAI Multimodal CLIP Architecture in 60 Seconds

HowCanAIHelp Beginner 1y ago

Computer Vision in 100 Seconds

Computer Vision

Computer Vision in 100 Seconds

Infinite Codes Beginner 1y ago

Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python

Computer Vision

Build an AI/ML NBA Basketball Analysis system with YOLO, OpenCV, and Python

Code In a Jiffy Beginner 1y ago

Multimodal AI with Logan Kilpatrick

Computer Vision

Multimodal AI with Logan Kilpatrick

Google Cloud Beginner 1y ago

DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1

Computer Vision

DETR Explained | End-to-End Object Detection with Transformers | DETR Tutorial Part 1

ExplainingAI Beginner 1y ago

Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI

Computer Vision

Visual RAG Unleashed: Harnessing ColQwen2.5 & Qwen2.5-VL-3B-Instruct for Next-Level AI

Bytes of AI Beginner 1y ago

Nurturing Customer Relationships - Behind the Keynotes - Season 3 Episode 8

Computer Vision

Nurturing Customer Relationships - Behind the Keynotes - Season 3 Episode 8

Nordic Business Forum Beginner 1y ago

How to Build a Smart Football Analysis System Using YOLO11 #computervision #yolo11 #objectdetection

Computer Vision

How to Build a Smart Football Analysis System Using YOLO11 #computervision #yolo11 #objectdetection

Muhammad Moin Beginner 11mo ago

Getting Started with Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR

Computer Vision

Getting Started with Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR

Muhammad Moin Beginner 11mo ago

YOLO11 + SAHI = Better Detection for Small Objects! (Step-by-Step Guide)

Computer Vision

YOLO11 + SAHI = Better Detection for Small Objects! (Step-by-Step Guide)

Muhammad Moin Beginner 11mo ago

Kimi K2 Coder: NEW Best Free AI Coding Tool? (Open-Source Review)

Computer Vision

Kimi K2 Coder: NEW Best Free AI Coding Tool? (Open-Source Review)

Muhammad Moin Beginner 11mo ago

Build a Car & License Plate Recognition System with YOLO11 + PaddleOCR

Computer Vision

Build a Car & License Plate Recognition System with YOLO11 + PaddleOCR

Muhammad Moin Beginner 11mo ago

Gemini Code Assist - AI Coding Agents: A Step-by-Step Tutorial

Computer Vision

Gemini Code Assist - AI Coding Agents: A Step-by-Step Tutorial

Muhammad Moin Beginner 12mo ago

Build a PDF Text Extractor App with Streamlit, n8n & Mistral OCR API – Step-by-Step Tutorial

Computer Vision

Build a PDF Text Extractor App with Streamlit, n8n & Mistral OCR API – Step-by-Step Tutorial

Muhammad Moin Beginner 12mo ago

Build an AI Agent in n8n to Analyze YouTube Comments & Report Insights

Computer Vision

Build an AI Agent in n8n to Analyze YouTube Comments & Report Insights

Muhammad Moin Beginner 1y ago

Computer Vision

Instagram Profile Scraper using Apify and Google Sheets in n8n

Muhammad Moin Beginner 1y ago

Introduction to AI Agents: LLMs, Workflows, and AI Agents

Computer Vision

Introduction to AI Agents: LLMs, Workflows, and AI Agents

Muhammad Moin Beginner 1y ago

How to Detect People in Danger Zones with AI

Computer Vision

How to Detect People in Danger Zones with AI

Roboflow Beginner 1y ago

Train Foundation Models Better with LightlyTrain – Achieve Better Accuracy with Less Effort

Computer Vision

Train Foundation Models Better with LightlyTrain – Achieve Better Accuracy with Less Effort

Muhammad Moin Beginner 1y ago

RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?

Computer Vision

RF-DETR Architecture & How it Works | Why is DETR Better Than YOLO?

Roboflow Beginner 1y ago

Aya Vision Challenge, Ep. 3

Computer Vision

Aya Vision Challenge, Ep. 3

Cohere Beginner 1y ago

How to Train RF-DETR Object Detection Transformer on Custom Dataset for Potholes Detection

Computer Vision

How to Train RF-DETR Object Detection Transformer on Custom Dataset for Potholes Detection

Muhammad Moin Beginner 1y ago

RF-DETR: Real-Time Object Detection in Images and Videos | A Step-by-Step Guide

Computer Vision

RF-DETR: Real-Time Object Detection in Images and Videos | A Step-by-Step Guide

Muhammad Moin Beginner 1y ago

Aya Vision Challenge, Ep. 2

Computer Vision ⚡ AI Lesson

Aya Vision Challenge, Ep. 2

Cohere Beginner 1y ago

Build a Tennis Analysis System with YOLOv12 and OpenCV

Computer Vision

Build a Tennis Analysis System with YOLOv12 and OpenCV

Muhammad Moin Beginner 1y ago

How to Train YOLOv12 Models on Your Custom Dataset in Google Colab

Computer Vision

How to Train YOLOv12 Models on Your Custom Dataset in Google Colab

Muhammad Moin Beginner 1y ago

📚 Continue on Coursera External links · Free to audit

View all →

Fine-Tuning and Evaluating Vision AI Models

📚 External: Coursera ↗

Fine-Tuning and Evaluating Vision AI Models

Opens on Coursera ↗

AutoML: Build ML Models without Code

📚 External: Coursera ↗

AutoML: Build ML Models without Code

Opens on Coursera ↗

The Social Media Landscape

📚 External: Coursera ↗

The Social Media Landscape

Opens on Coursera ↗

Self-Driving Car Specialization Course

📚 External: Coursera ↗

Self-Driving Car Specialization Course

Opens on Coursera ↗

Marketing Fundamentals Mastery: Apply, Analyze & Evaluate

📚 External: Coursera ↗

Marketing Fundamentals Mastery: Apply, Analyze & Evaluate

Opens on Coursera ↗

Supply Chain Sourcing

📚 External: Coursera ↗

Supply Chain Sourcing

Opens on Coursera ↗

Market Research and Competitive Analysis

📚 External: Coursera ↗

Market Research and Competitive Analysis

Opens on Coursera ↗

Materiales para envase y embalaje

📚 External: Coursera ↗

Materiales para envase y embalaje

Opens on Coursera ↗

Foundations of Sports Marketing

📚 External: Coursera ↗

Foundations of Sports Marketing

Opens on Coursera ↗

H2O Cloud AI Developer Services

📚 External: Coursera ↗

H2O Cloud AI Developer Services

Opens on Coursera ↗

Interdisciplinarity in Thought and Practice

📚 External: Coursera ↗

Interdisciplinarity in Thought and Practice

Opens on Coursera ↗

International Marketing Strategies and Global Trade

📚 External: Coursera ↗

International Marketing Strategies and Global Trade

Opens on Coursera ↗

📚 External: Coursera ↗

Build an End-to-End Data Capture Pipeline using Document AI

Opens on Coursera ↗

Marketing in the Age of AI

📚 External: Coursera ↗

Marketing in the Age of AI

Opens on Coursera ↗

Deep Learning for Object Detection

📚 External: Coursera ↗

Deep Learning for Object Detection

Opens on Coursera ↗

📚 External: Coursera ↗

Create Image Captioning Models - Português Brasileiro

Opens on Coursera ↗

Create video, audio and infographics for online learning

📚 External: Coursera ↗

Create video, audio and infographics for online learning

Opens on Coursera ↗

Behavioral Marketing

📚 External: Coursera ↗

Behavioral Marketing

Opens on Coursera ↗