Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 394

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

open-animal-tracks

Computer Vision ⚡ AI Lesson

open-animal-tracks

Data Skeptic Advanced 1y ago

Model Evaluation for Computer Vision

Computer Vision ⚡ AI Lesson

Model Evaluation for Computer Vision

Roboflow Beginner 1y ago

Bird Distribution Modeling with Satbird

Computer Vision ⚡ AI Lesson

Bird Distribution Modeling with Satbird

Data Skeptic Advanced 1y ago

Active Learning in Computer Vision

Computer Vision

Active Learning in Computer Vision

Roboflow Beginner 1y ago

I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.

Computer Vision

I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.

Neil Patel Intermediate 1y ago

YOLO Object Detection | YoloV1 Explanation and Implementation Tutorial

Computer Vision

YOLO Object Detection | YoloV1 Explanation and Implementation Tutorial

ExplainingAI Advanced 1y ago

Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and Llama 3.1

Computer Vision

Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and Llama 3.1

Muhammad Moin Intermediate 1y ago

C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya

Computer Vision

C4AI Expedition Aya - Most Promising Prize: Maya: Multimodal Aya

Cohere Beginner 1y ago

Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum

Computer Vision

Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum

Microsoft Research Advanced 1y ago

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA

Computer Vision ⚡ AI Lesson

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA

AI Anytime Intermediate 1y ago

Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218

Computer Vision

Exploring Robotics and Python Through Electronic Projects | Real Python Podcast #218

Real Python Beginner 1y ago

Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed

Computer Vision ⚡ AI Lesson

Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed

DataCamp Intermediate 1y ago

Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson

Computer Vision

Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson

Latent Space Advanced 1y ago

How to run SAM 2 (Segment Anything AI Model)?

Computer Vision ⚡ AI Lesson

How to run SAM 2 (Segment Anything AI Model)?

AI Anytime Intermediate 1y ago

JETSON AI LAB | Research Group Meeting (8/6/2024)

Computer Vision

JETSON AI LAB | Research Group Meeting (8/6/2024)

NVIDIA Developer Advanced 1y ago

Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai

Computer Vision

Meta Unveils Segment Anything 2: Revolutionizing Image and 3D Segmentation! #meta #ai #genai

Deepak Bhaskaran Beginner 1y ago

Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.

Computer Vision

Boost #WorkplaceSafety with Intenseye, an AI-powered employee health and safety (EHS) platform.

Google Cloud Beginner 1y ago

SAM 2 is going to transform COMPUTER VISION!!!

Computer Vision

SAM 2 is going to transform COMPUTER VISION!!!

1littlecoder Intermediate 1y ago

LlamaIndex Webinar: ColPali - Efficient Document Retrieval with Vision Language Models

Computer Vision

LlamaIndex Webinar: ColPali - Efficient Document Retrieval with Vision Language Models

LlamaIndex Advanced 1y ago

Real-Time Object Tracking using YOLO10 and DeepSORT Algorithm

Computer Vision

Real-Time Object Tracking using YOLO10 and DeepSORT Algorithm

Muhammad Moin Beginner 1y ago

Audience Segmentation Tips: 3 Ways to Segment Your Email List

Computer Vision ⚡ AI Lesson

Audience Segmentation Tips: 3 Ways to Segment Your Email List

Klaviyo Advanced 1y ago

Visual PDF Reader: ColPALI for RAG #ai

Computer Vision

Visual PDF Reader: ColPALI for RAG #ai

Discover AI Advanced 1y ago

New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud

Computer Vision

New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud

Google Cloud Intermediate 1y ago

An Overview of Object Recognition Tasks

Computer Vision ⚡ AI Lesson

An Overview of Object Recognition Tasks

Machine Learning Studio Beginner 1y ago

Excitement for the Generative AI era: Multi-Modal inputs

Computer Vision

Excitement for the Generative AI era: Multi-Modal inputs

Weights & Biases Intermediate 1y ago

Denoising Images with OpenCV in Python

Computer Vision ⚡ AI Lesson

Denoising Images with OpenCV in Python

NeuralNine Beginner 1y ago

Image Recognition with LLaVa in Python

Computer Vision ⚡ AI Lesson

Image Recognition with LLaVa in Python

NeuralNine Beginner 2y ago

Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!

Computer Vision

Microsoft's Florence 2: Breaking Boundaries in AI Vision Language!

Mervin Praison Beginner 2y ago

OCR Using Microsoft's Florence-2 Vision Model on Free Google Colab

Computer Vision

OCR Using Microsoft's Florence-2 Vision Model on Free Google Colab

TheAILearner Beginner 2y ago

Florence 2 - The Best Small VLM Out There?

Computer Vision ⚡ AI Lesson

Florence 2 - The Best Small VLM Out There?

Sam Witteveen Beginner 2y ago

New Microsoft Vision Model has AMAZING TRICKS!!!

Computer Vision ⚡ AI Lesson

New Microsoft Vision Model has AMAZING TRICKS!!!

1littlecoder Advanced 2y ago

CVPR 2024 Paper: Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance

Computer Vision

CVPR 2024 Paper: Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance

anucvml Beginner 2y ago

[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

Computer Vision

[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

anucvml Intermediate 2y ago

Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum

Computer Vision ⚡ AI Lesson

Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum

Microsoft Research Advanced 2y ago

OpenAI CLIP model explained

Computer Vision

OpenAI CLIP model explained

Machine Learning Studio Beginner 2y ago

Using PAM EXEC to Log Passwords on Linux

Computer Vision ⚡ AI Lesson

Using PAM EXEC to Log Passwords on Linux

IppSec Beginner 2y ago

Robotics AI for Industrial Applications

Computer Vision

Robotics AI for Industrial Applications

Weights & Biases Advanced 2y ago

Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...

Computer Vision ⚡ AI Lesson

Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...

Cohere Intermediate 2y ago

Use Dedicated Deployments with Computer Vision Workflows

Computer Vision

Use Dedicated Deployments with Computer Vision Workflows

Roboflow Intermediate 1y ago

Football AI | Community Q&A (Aug 29)

Computer Vision ⚡ AI Lesson

Football AI | Community Q&A (Aug 29)

Roboflow Advanced 1y ago

Football AI Tutorial: From Basics to Advanced Stats with Python

Computer Vision

Football AI Tutorial: From Basics to Advanced Stats with Python

Roboflow Intermediate 1y ago

Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI

Computer Vision ⚡ AI Lesson

Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI

Roboflow Intermediate 1y ago

AI-Assisted Data Labeling | Weekly Roboflow Product Session

Computer Vision

AI-Assisted Data Labeling | Weekly Roboflow Product Session

Roboflow Beginner 1y ago

Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)

Computer Vision

Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)

Roboflow Advanced 1y ago

License Plate Detection & Recognition with YOLOv10 and PaddleOCR | Save Data to SQL Database

Computer Vision

License Plate Detection & Recognition with YOLOv10 and PaddleOCR | Save Data to SQL Database

Muhammad Moin Beginner 1y ago

Florence-2: Fine-tune Microsoft’s Multimodal Model

Computer Vision

Florence-2: Fine-tune Microsoft’s Multimodal Model

Roboflow Beginner 1y ago

Reimagine document processing and understanding with generative AI

Computer Vision

Reimagine document processing and understanding with generative AI

Google Cloud Intermediate 1y ago

PaliGemma by Google: Train Model on Custom Detection Dataset

Computer Vision

PaliGemma by Google: Train Model on Custom Detection Dataset

Roboflow Intermediate 2y ago

📚 Continue on Coursera External links · Free to audit

View all →

📚 External: Coursera ↗

Hands-on Data Centric Visual AI

Opens on Coursera ↗

Artificial Vision for Textile quality control

📚 External: Coursera ↗

Artificial Vision for Textile quality control

Opens on Coursera ↗

Marketing in the Age of AI

📚 External: Coursera ↗

Marketing in the Age of AI

Opens on Coursera ↗

📚 External: Coursera ↗

Marketing Communications: Intro to Consumer Behavior

Opens on Coursera ↗

Jetson Nano Starter to Pro - A Computer Vision Course

📚 External: Coursera ↗

Jetson Nano Starter to Pro - A Computer Vision Course

Opens on Coursera ↗

Advanced Algorithms and Complexity

📚 External: Coursera ↗

Advanced Algorithms and Complexity

Opens on Coursera ↗

📚 External: Coursera ↗

Form Parsing with Document AI (Python)

Opens on Coursera ↗

UiPath Automation Developer Professional

📚 External: Coursera ↗

UiPath Automation Developer Professional

Opens on Coursera ↗

📚 External: Coursera ↗

Image Segmentation, Filtering, and Region Analysis

Opens on Coursera ↗

H2O Cloud AI Developer Services

📚 External: Coursera ↗

H2O Cloud AI Developer Services

Opens on Coursera ↗

Sales Transformation Fundamentals

📚 External: Coursera ↗

Sales Transformation Fundamentals

Opens on Coursera ↗

Networking and Security Architecture with VMware NSX

📚 External: Coursera ↗

Networking and Security Architecture with VMware NSX

Opens on Coursera ↗

📚 External: Coursera ↗

Process Documents with Python Using the Document AI API

Opens on Coursera ↗

Tendencias e innovaciones en los medios deportivos

📚 External: Coursera ↗

Tendencias e innovaciones en los medios deportivos

Opens on Coursera ↗

The Social Media Landscape

📚 External: Coursera ↗

The Social Media Landscape

Opens on Coursera ↗

YOLO-NAS + v8 Full-Stack Computer Vision Course

📚 External: Coursera ↗

YOLO-NAS + v8 Full-Stack Computer Vision Course

Opens on Coursera ↗

Features and Boundaries

📚 External: Coursera ↗

Features and Boundaries

Opens on Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

📚 External: Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

Opens on Coursera ↗