Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,542

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,145 Reads 397

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Audience Segmentation Tips: 3 Ways to Segment Your Email List

Computer Vision ⚡ AI Lesson

Audience Segmentation Tips: 3 Ways to Segment Your Email List

Klaviyo Advanced 1y ago

Visual PDF Reader: ColPALI for RAG #ai

Computer Vision

Visual PDF Reader: ColPALI for RAG #ai

Discover AI Advanced 1y ago

New Microsoft Vision Model has AMAZING TRICKS!!!

Computer Vision ⚡ AI Lesson

New Microsoft Vision Model has AMAZING TRICKS!!!

1littlecoder Advanced 2y ago

Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum

Computer Vision ⚡ AI Lesson

Case study on CLIP: Large Multi-Modal Models for Blind & Low Vision Users | Microsoft Research Forum

Microsoft Research Advanced 2y ago

Robotics AI for Industrial Applications

Computer Vision

Robotics AI for Industrial Applications

Weights & Biases Advanced 2y ago

Build computer vision applications easily with Roboflow and Google Cloud

Computer Vision

Build computer vision applications easily with Roboflow and Google Cloud

Google Cloud Advanced 2y ago

Search Engines Struggle to Process Text Efficiently: The Hidden Cost #seo

Computer Vision

Search Engines Struggle to Process Text Efficiently: The Hidden Cost #seo

Koray Tuğberk GÜBÜR Advanced 2y ago

Image Classification Using Vision Transformer | An Image is Worth 16x16 Words

Computer Vision

Image Classification Using Vision Transformer | An Image is Worth 16x16 Words

ExplainingAI Advanced 2y ago

Computer Vision Study Group Session on SAM

Computer Vision

Computer Vision Study Group Session on SAM

HuggingFace Advanced 2y ago

Navigating Beyond Pixels in Computer Vision | Satya Mallick, CEO of OpenCV | Expert Talk 03

Computer Vision

Navigating Beyond Pixels in Computer Vision | Satya Mallick, CEO of OpenCV | Expert Talk 03

Analytics Vidhya Advanced 2y ago

China's Qwen VL wins Big Time!!!

Computer Vision

China's Qwen VL wins Big Time!!!

1littlecoder Advanced 2y ago

Sanjay Subramanian - Visual Reasoning with Limited Human Labels

Computer Vision

Sanjay Subramanian - Visual Reasoning with Limited Human Labels

Cohere Advanced 2y ago

Data Augmentation and Optimized Architectures for Computer Vision - Fatih Porikli - 635

Computer Vision ⚡ AI Lesson

Data Augmentation and Optimized Architectures for Computer Vision - Fatih Porikli - 635

The TWIML AI Podcast with Sam Charrington Advanced 3y ago

Digital Renaissance: Neuralangelo by NVIDIA Research Reconstructs 3D Scenes from 2D Video Clips

Computer Vision

Digital Renaissance: Neuralangelo by NVIDIA Research Reconstructs 3D Scenes from 2D Video Clips

NVIDIA Developer Advanced 3y ago

Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?

Computer Vision

Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?

Discover AI Advanced 3y ago

Michael Tschannen - Image-and-Language Understanding from Pixels Only

Computer Vision

Michael Tschannen - Image-and-Language Understanding from Pixels Only

Cohere Advanced 3y ago

HPU vs GPU - The Frontier of AI Hardware

Computer Vision

HPU vs GPU - The Frontier of AI Hardware

Roboflow Advanced 3y ago

Roboflow 100: A New Object Detection Benchmark

Computer Vision ⚡ AI Lesson

Roboflow 100: A New Object Detection Benchmark

Roboflow Advanced 3y ago

Fast Zero Shot Object Detection with OpenAI CLIP

Computer Vision ⚡ AI Lesson

Fast Zero Shot Object Detection with OpenAI CLIP

James Briggs Advanced 3y ago

Performance Capture Possible With Any Camera with NVIDIA AI

Computer Vision ⚡ AI Lesson

Performance Capture Possible With Any Camera with NVIDIA AI

NVIDIA Developer Advanced 3y ago

Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579

Computer Vision ⚡ AI Lesson

Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579

The TWIML AI Podcast with Sam Charrington Advanced 4y ago

Lecture 13: Object Detection, Recognition and Pose Determination, PatQuick (US 7,016,539)

Computer Vision ⚡ AI Lesson

Lecture 13: Object Detection, Recognition and Pose Determination, PatQuick (US 7,016,539)

MIT OpenCourseWare Advanced 4y ago

L6: Anomaly Detection in Computer Vision

Computer Vision

L6: Anomaly Detection in Computer Vision

ogsconnect Advanced 4y ago

L5 - Anomaly Detection in Computer Vision

Computer Vision

L5 - Anomaly Detection in Computer Vision

ogsconnect Advanced 4y ago

Customer Clustering

Computer Vision ⚡ AI Lesson

Customer Clustering

Data Skeptic Advanced 4y ago

ConvNeXt: A ConvNet for the 2020s | Paper Explained

Computer Vision

ConvNeXt: A ConvNet for the 2020s | Paper Explained

Aleksa Gordić - The AI Epiphany Advanced 4y ago

Panel: Large-scale neural platform models: Opportunities, concerns, and directions

Computer Vision ⚡ AI Lesson

Panel: Large-scale neural platform models: Opportunities, concerns, and directions

Microsoft Research Advanced 4y ago

TORCHVISION 2021 | FRANCISCO MASSA

Computer Vision

TORCHVISION 2021 | FRANCISCO MASSA

PyTorch Advanced 4y ago

W&B Paper Reading Group: DETR

Computer Vision ⚡ AI Lesson

W&B Paper Reading Group: DETR

Weights & Biases Advanced 4y ago

OpenCV Python Tutorial #7 - Template Matching (Object Detection)

Computer Vision

OpenCV Python Tutorial #7 - Template Matching (Object Detection)

Tech With Tim Advanced 5y ago

CLIP: Connecting Text and Images

Computer Vision ⚡ AI Lesson

CLIP: Connecting Text and Images

Connor Shorten Advanced 5y ago

This Facebook AI model is the CHATGPT of Computer Vision (with Python Code)

Computer Vision

This Facebook AI model is the CHATGPT of Computer Vision (with Python Code)

1littlecoder Advanced 3y ago

How To Train SegFormer on a Custom Dataset for Computer Vision

Computer Vision ⚡ AI Lesson

How To Train SegFormer on a Custom Dataset for Computer Vision

Roboflow Advanced 3y ago

How to Train and Deploy YOLOS on a Custom Dataset

Computer Vision

How to Train and Deploy YOLOS on a Custom Dataset

Roboflow Advanced 4y ago

L4 - Anomaly Detection in Computer Vision

Computer Vision

L4 - Anomaly Detection in Computer Vision

ogsconnect Advanced 4y ago

L3 - Anomaly Detection in Computer Vision

Computer Vision

L3 - Anomaly Detection in Computer Vision

ogsconnect Advanced 4y ago

L2 - Anomaly Detection in Computer Vision

Computer Vision

L2 - Anomaly Detection in Computer Vision

ogsconnect Advanced 4y ago

L1 - Anomaly Detection in Computer Vision

Computer Vision

L1 - Anomaly Detection in Computer Vision

ogsconnect Advanced 4y ago

[LIVE CODING] Visualizing Football Plays with Computer Vision (Part 1)

Computer Vision

[LIVE CODING] Visualizing Football Plays with Computer Vision (Part 1)

Roboflow Advanced 4y ago

Visual Recognition beyond Appearances, and its Robotic Applications

Computer Vision ⚡ AI Lesson

Visual Recognition beyond Appearances, and its Robotic Applications

Microsoft Research Advanced 4y ago

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Computer Vision

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Microsoft Research Advanced 4y ago

How To Do Object Tracking

Computer Vision

How To Do Object Tracking

Roboflow Advanced 4y ago

What's New in YOLOX?

Computer Vision ⚡ AI Lesson

What's New in YOLOX?

Roboflow Advanced 4y ago

Which Image Augmentation Steps Should You Use with Aerial Data?

Computer Vision

Which Image Augmentation Steps Should You Use with Aerial Data?

Roboflow Advanced 5y ago

AI advances in image captioning: Describing images as well as people do

Computer Vision ⚡ AI Lesson

AI advances in image captioning: Describing images as well as people do

Microsoft Research Advanced 5y ago

Computer Vision Predictions for 2021

Computer Vision ⚡ AI Lesson

Computer Vision Predictions for 2021

Roboflow Advanced 5y ago

Zero-Shot Image Classification with Open AI's CLIP Model - GPT-3 for Images

Computer Vision

Zero-Shot Image Classification with Open AI's CLIP Model - GPT-3 for Images

1littlecoder Advanced 5y ago

How to Train Scaled-YOLOv4 to Detect Custom Objects

Computer Vision

How to Train Scaled-YOLOv4 to Detect Custom Objects

Roboflow Advanced 5y ago

📚 Continue on Coursera External links · Free to audit

View all →

Cisco Software-Defined Wan for Enterprise & Cloud: Unit 1

📚 External: Coursera ↗

Cisco Software-Defined Wan for Enterprise & Cloud: Unit 1

Opens on Coursera ↗

Business Economics and Game Theory for Decision Making

📚 External: Coursera ↗

Business Economics and Game Theory for Decision Making

Opens on Coursera ↗

Network Visualization and Intervention

📚 External: Coursera ↗

Network Visualization and Intervention

Opens on Coursera ↗

Analyze Video Data Using OpenCV and Python

📚 External: Coursera ↗

Analyze Video Data Using OpenCV and Python

Opens on Coursera ↗

Market Analysis

📚 External: Coursera ↗

Market Analysis

Opens on Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

📚 External: Coursera ↗

6G Evolution: Blockchain, Semantic Communications & Radar

Opens on Coursera ↗

📚 External: Coursera ↗

Uptraining with Document AI Workbench

Opens on Coursera ↗

📚 External: Coursera ↗

Running Distributed TensorFlow using Vertex AI

Opens on Coursera ↗

Traitement d'images : segmentation et caractérisation

📚 External: Coursera ↗

Traitement d'images : segmentation et caractérisation

Opens on Coursera ↗

Build Real-Time Face Recognition with OpenCV

📚 External: Coursera ↗

Build Real-Time Face Recognition with OpenCV

Opens on Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

📚 External: Coursera ↗

Digital Marketing Foundations: Analyze & Apply Strategies

Opens on Coursera ↗

Refine Segmentation: Boost Your AI Vision

📚 External: Coursera ↗

Refine Segmentation: Boost Your AI Vision

Opens on Coursera ↗

Advanced Algorithms and Complexity

📚 External: Coursera ↗

Advanced Algorithms and Complexity

Opens on Coursera ↗

Finanzas para directivos

📚 External: Coursera ↗

Finanzas para directivos

Opens on Coursera ↗

📚 External: Coursera ↗

Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital

Opens on Coursera ↗

📚 External: Coursera ↗

Segment Your Audience

Opens on Coursera ↗

Features and Boundaries

📚 External: Coursera ↗

Features and Boundaries

Opens on Coursera ↗

Unity: Design & Deform Meshes for 3D Geometry Control

📚 External: Coursera ↗

Unity: Design & Deform Meshes for 3D Geometry Control

Opens on Coursera ↗