Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 394

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

🔬 What is a Capacitive Proximity Sensor? #automation #sensor #proximitysensors #basics

Computer Vision

🔬 What is a Capacitive Proximity Sensor? #automation #sensor #proximitysensors #basics

Mr. SMART Engineering Beginner 1y ago

Aya Vision - The Research Behind the Model

Computer Vision

Aya Vision - The Research Behind the Model

Cohere Beginner 1y ago

Aya Vision Challenge, Ep. 1

Computer Vision ⚡ AI Lesson

Aya Vision Challenge, Ep. 1

Cohere Beginner 1y ago

Microsoft’s Phi-4 SLM: Open-Source AI for Text, Vision & Audio!

Computer Vision

Microsoft’s Phi-4 SLM: Open-Source AI for Text, Vision & Audio!

Analytics Vidhya Advanced 1y ago

New Way Now: Safe Rate helps homebuyers and owners save thousands with AI-powered mortgage assistant

Computer Vision

New Way Now: Safe Rate helps homebuyers and owners save thousands with AI-powered mortgage assistant

Google Cloud Intermediate 1y ago

YOLOv12 Object Detection Training Tutorial

Computer Vision ⚡ AI Lesson

YOLOv12 Object Detection Training Tutorial

Roboflow Beginner 1y ago

Vision Transformer from Scratch Tutorial

Computer Vision ⚡ AI Lesson

Vision Transformer from Scratch Tutorial

freeCodeCamp.org Beginner 1y ago

Marketing Environment Analysis | Complete Breakdown

Computer Vision

Marketing Environment Analysis | Complete Breakdown

Leaders Talk - ThinkEduca Beginner 1y ago

How Machines Find Patterns [Template Matching]

Computer Vision

How Machines Find Patterns [Template Matching]

Jia-Bin Huang Intermediate 1y ago

AI Race: Luck or Skill in 2025?

Computer Vision ⚡ AI Lesson

AI Race: Luck or Skill in 2025?

Lex Frid Clips Beginner 1y ago

Deepseek is back with VISION

Computer Vision

Deepseek is back with VISION

1littlecoder Advanced 1y ago

22 Machine Learning Projects That Will Make You A God At Data Science

Computer Vision ⚡ AI Lesson

22 Machine Learning Projects That Will Make You A God At Data Science

Infinite Codes Beginner 1y ago

Build an AI-Powered Self-Serve Checkout & Cost Calculator in 10 Minutes (Almost)

Computer Vision

Build an AI-Powered Self-Serve Checkout & Cost Calculator in 10 Minutes (Almost)

Roboflow Intermediate 1y ago

Using Vertex AI for healthcare

Computer Vision

Using Vertex AI for healthcare

Google Cloud Tech Advanced 1y ago

Next Multi trillion dollar industry?

Computer Vision

Next Multi trillion dollar industry?

Full Disclosure Intermediate 1y ago

DeepSeek’s Janus-Pro-7B Crushes DALL·E 3! #deepseek #openai

Computer Vision

DeepSeek’s Janus-Pro-7B Crushes DALL·E 3! #deepseek #openai

Analytics Vidhya Intermediate 1y ago

Outro Of Project: Cutomer segmentation

Computer Vision

Outro Of Project: Cutomer segmentation

GeeksforGeeks Beginner 1y ago

Model Pusher: Customer Segmentation

Computer Vision

Model Pusher: Customer Segmentation

GeeksforGeeks Beginner 1y ago

This Python module is your go-to for speech and image recognition!

Computer Vision ⚡ AI Lesson

This Python module is your go-to for speech and image recognition!

Tech With Tim Intermediate 1y ago

Selling the Cause: Leveraging Marketing Strategies & Storytelling in Nonprofits

Computer Vision

Selling the Cause: Leveraging Marketing Strategies & Storytelling in Nonprofits

The Nonprofit Prof Intermediate 1y ago

Selling the Cause: Leveraging Marketing Strategies & Storytelling in Nonprofits

Computer Vision

Selling the Cause: Leveraging Marketing Strategies & Storytelling in Nonprofits

The Nonprofit Prof Beginner 1y ago

Enhance Generative AI Model Accuracy Through High-Quality Multimodal Data Processing

Computer Vision

Enhance Generative AI Model Accuracy Through High-Quality Multimodal Data Processing

NVIDIA Developer Advanced 1y ago

Not ElevenLabs, This new #1 Text to Speech AI is FREE!!!!

Computer Vision

Not ElevenLabs, This new #1 Text to Speech AI is FREE!!!!

1littlecoder Intermediate 1y ago

Pool Shot Predictor with OpenCV: Will the Ball Go Into the Pocket?

Computer Vision

Pool Shot Predictor with OpenCV: Will the Ball Go Into the Pocket?

Muhammad Moin Intermediate 1y ago

How to Train YOLO11 Instance Segmentation Models on Your Custom Dataset in Google Colab

Computer Vision

How to Train YOLO11 Instance Segmentation Models on Your Custom Dataset in Google Colab

Muhammad Moin Intermediate 1y ago

LLaVA | LLaVA Model Architecture | Understanding LLaVA Model | Multimodal

Computer Vision

LLaVA | LLaVA Model Architecture | Understanding LLaVA Model | Multimodal

AILinkDeepTech Beginner 1y ago

Multimodal AI Agents Are Revolutionising Image & Video Analysis!

Computer Vision

Multimodal AI Agents Are Revolutionising Image & Video Analysis!

Mervin Praison Beginner 1y ago

Next AI Project is Image Classification in Python🔍🤖

Computer Vision ⚡ AI Lesson

Next AI Project is Image Classification in Python🔍🤖

Tech With Tim Intermediate 1y ago

YOLOv2 (YOLO9000) and YOLOv3 Explained

Computer Vision ⚡ AI Lesson

YOLOv2 (YOLO9000) and YOLOv3 Explained

ExplainingAI Advanced 1y ago

Best of 2024 in Vision [LS Live @ NeurIPS]

Computer Vision ⚡ AI Lesson

Best of 2024 in Vision [LS Live @ NeurIPS]

Latent Space Intermediate 1y ago

How to Do Email Segmentation the Right Way

Computer Vision ⚡ AI Lesson

How to Do Email Segmentation the Right Way

Spark Bridge Digital | Email Marketing Agency Intermediate 1y ago

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

Computer Vision

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

OpenAI Intermediate 1y ago

New Video AI by META & Stanford Univ: APOLLO (7B)

Computer Vision ⚡ AI Lesson

New Video AI by META & Stanford Univ: APOLLO (7B)

Discover AI Advanced 1y ago

Ethan Norville EXPOSES Coronation Project Secrets

Computer Vision

Ethan Norville EXPOSES Coronation Project Secrets

Professor Charley T Intermediate 1y ago

Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate

Computer Vision ⚡ AI Lesson

Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate

Latent Space Beginner 1y ago

Aya Vision - The Challenges & Breakthroughs

Computer Vision ⚡ AI Lesson

Aya Vision - The Challenges & Breakthroughs

Cohere Advanced 1y ago

Peter Tong - MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Computer Vision

Peter Tong - MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Cohere Intermediate 1y ago

Model Evaluation: Customer Segmentation

Computer Vision

Model Evaluation: Customer Segmentation

GeeksforGeeks Beginner 1y ago

Model Evaluation: Customer Segmentation

Computer Vision

Model Evaluation: Customer Segmentation

GeeksforGeeks Beginner 1y ago

Intro Of Project - Customer Segmentation

Computer Vision

Intro Of Project - Customer Segmentation

GeeksforGeeks Beginner 1y ago

Measure Liquid Levels with AI | Build a Web App Powered by Computer Vision

Computer Vision

Measure Liquid Levels with AI | Build a Web App Powered by Computer Vision

Roboflow Intermediate 1y ago

How to Manage Hundreds of Edge Vision AI Devices in One Place

Computer Vision ⚡ AI Lesson

How to Manage Hundreds of Edge Vision AI Devices in One Place

Roboflow Beginner 1y ago

Estimate Real Distance to Objects with Depth Pro and YOLO11

Computer Vision

Estimate Real Distance to Objects with Depth Pro and YOLO11

Muhammad Moin Intermediate 1y ago

From DETR to SAM2: Reviewing the TOP Vision AI Advances of 2024

Computer Vision

From DETR to SAM2: Reviewing the TOP Vision AI Advances of 2024

Roboflow Beginner 1y ago

YOLO11 + Streamlit Computer Vision Dashboard Complete Tutorial

Computer Vision

YOLO11 + Streamlit Computer Vision Dashboard Complete Tutorial

Muhammad Moin Beginner 1y ago

Florence-2: Create and Deploy a Custom Vision Language Model

Computer Vision

Florence-2: Create and Deploy a Custom Vision Language Model

Roboflow Intermediate 1y ago

Use Semantic Search to Create Computer Vision Datasets

Computer Vision

Use Semantic Search to Create Computer Vision Datasets

Roboflow Beginner 1y ago

SAM-2.1: How to Fine-Tune for Image Segmentation

Computer Vision

SAM-2.1: How to Fine-Tune for Image Segmentation

Roboflow Beginner 1y ago

📚 Continue on Coursera External links · Free to audit

View all →

Market Research and Competitive Analysis

📚 External: Coursera ↗

Market Research and Competitive Analysis

Opens on Coursera ↗

Analyze Video Data Using OpenCV and Python

📚 External: Coursera ↗

Analyze Video Data Using OpenCV and Python

Opens on Coursera ↗

📚 External: Coursera ↗

Brand Positioning and Marketing Strategy

Opens on Coursera ↗

Introduction to Deep Learning for Computer Vision

📚 External: Coursera ↗

Introduction to Deep Learning for Computer Vision

Opens on Coursera ↗

Future of data and technology in football

📚 External: Coursera ↗

Future of data and technology in football

Opens on Coursera ↗

Master OpenCV Fundamentals for Real-Time Computer Vision

📚 External: Coursera ↗

Master OpenCV Fundamentals for Real-Time Computer Vision

Opens on Coursera ↗

Consumer Psychology and Marketing Decisions

📚 External: Coursera ↗

Consumer Psychology and Marketing Decisions

Opens on Coursera ↗

Implement Real-Time Face Detection with OpenCV & Python

📚 External: Coursera ↗

Implement Real-Time Face Detection with OpenCV & Python

Opens on Coursera ↗

Pluralidades em Português Brasileiro

📚 External: Coursera ↗

Pluralidades em Português Brasileiro

Opens on Coursera ↗

Market Analysis

📚 External: Coursera ↗

Market Analysis

Opens on Coursera ↗

Object Tracking and Motion Detection with Computer Vision

📚 External: Coursera ↗

Object Tracking and Motion Detection with Computer Vision

Opens on Coursera ↗

Customer Relationship Management

📚 External: Coursera ↗

Customer Relationship Management

Opens on Coursera ↗

Aspectos conceptuales y operativos de la Telesalud

📚 External: Coursera ↗

Aspectos conceptuales y operativos de la Telesalud

Opens on Coursera ↗

📚 External: Coursera ↗

Classify Images of Clouds in the Cloud with AutoML Vision

Opens on Coursera ↗

AI Technologies in Healthcare

📚 External: Coursera ↗

AI Technologies in Healthcare

Opens on Coursera ↗

Camera and Imaging

📚 External: Coursera ↗

Camera and Imaging

Opens on Coursera ↗

📚 External: Coursera ↗

Create Image Captioning Models - Português Brasileiro

Opens on Coursera ↗

📚 External: Coursera ↗

Intro to Operating Systems 2: Memory Management

Opens on Coursera ↗