Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Classify images with a pre-trained CNN

Modern CV Models

Run YOLO for real-time object detection

Build a Stable Diffusion inference pipeline

Videos 1,120 Reads 212

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Build computer vision applications easily with Roboflow and Google Cloud

Computer Vision

Build computer vision applications easily with Roboflow and Google Cloud

Google Cloud Advanced 2y ago

Search Engines Struggle to Process Text Efficiently: The Hidden Cost #seo

Computer Vision

Search Engines Struggle to Process Text Efficiently: The Hidden Cost #seo

Koray Tuğberk GÜBÜR Advanced 2y ago

Computer Vision Study Group Session on SAM

Computer Vision

Computer Vision Study Group Session on SAM

HuggingFace Advanced 2y ago

Navigating Beyond Pixels in Computer Vision | Satya Mallick, CEO of OpenCV | Expert Talk 03

Computer Vision

Navigating Beyond Pixels in Computer Vision | Satya Mallick, CEO of OpenCV | Expert Talk 03

Analytics Vidhya Advanced 2y ago

China's Qwen VL wins Big Time!!!

Computer Vision

China's Qwen VL wins Big Time!!!

1littlecoder Advanced 2y ago

Sanjay Subramanian - Visual Reasoning with Limited Human Labels

Computer Vision

Sanjay Subramanian - Visual Reasoning with Limited Human Labels

Cohere Advanced 2y ago

Data Augmentation and Optimized Architectures for Computer Vision - Fatih Porikli - 635

Computer Vision ⚡ AI Lesson

Data Augmentation and Optimized Architectures for Computer Vision - Fatih Porikli - 635

The TWIML AI Podcast with Sam Charrington Advanced 2y ago

Digital Renaissance: Neuralangelo by NVIDIA Research Reconstructs 3D Scenes from 2D Video Clips

Computer Vision

Digital Renaissance: Neuralangelo by NVIDIA Research Reconstructs 3D Scenes from 2D Video Clips

NVIDIA Developer Advanced 2y ago

This Facebook AI model is the CHATGPT of Computer Vision (with Python Code)

Computer Vision

This Facebook AI model is the CHATGPT of Computer Vision (with Python Code)

1littlecoder Advanced 3y ago

Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?

Computer Vision

Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?

Discover AI Advanced 3y ago

Michael Tschannen - Image-and-Language Understanding from Pixels Only

Computer Vision

Michael Tschannen - Image-and-Language Understanding from Pixels Only

Cohere Advanced 3y ago

New TECH: Vision Transformer 2023 on Image Classification | AI

Computer Vision ⚡ AI Lesson

New TECH: Vision Transformer 2023 on Image Classification | AI

Discover AI Advanced 3y ago

HPU vs GPU - The Frontier of AI Hardware

Computer Vision

HPU vs GPU - The Frontier of AI Hardware

Roboflow Advanced 3y ago

Experiment NVIDIA TAO Toolkit and pretrained models on Google Colab

Computer Vision ⚡ AI Lesson

Experiment NVIDIA TAO Toolkit and pretrained models on Google Colab

NVIDIA Developer Advanced 3y ago

Roboflow 100 Benchmarking Tutorial with Google Colab and Docker

Computer Vision

Roboflow 100 Benchmarking Tutorial with Google Colab and Docker

Roboflow Advanced 3y ago

Fast Zero Shot Object Detection with OpenAI CLIP

Computer Vision ⚡ AI Lesson

Fast Zero Shot Object Detection with OpenAI CLIP

James Briggs Advanced 3y ago

Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579

Computer Vision ⚡ AI Lesson

Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579

The TWIML AI Podcast with Sam Charrington Advanced 3y ago

Lecture 13: Object Detection, Recognition and Pose Determination, PatQuick (US 7,016,539)

Computer Vision ⚡ AI Lesson

Lecture 13: Object Detection, Recognition and Pose Determination, PatQuick (US 7,016,539)

MIT OpenCourseWare Advanced 3y ago

Customer Clustering

Computer Vision ⚡ AI Lesson

Customer Clustering

Data Skeptic Advanced 4y ago

ConvNeXt: A ConvNet for the 2020s | Paper Explained

Computer Vision

ConvNeXt: A ConvNet for the 2020s | Paper Explained

Aleksa Gordić - The AI Epiphany Advanced 4y ago

Panel: Large-scale neural platform models: Opportunities, concerns, and directions

Computer Vision ⚡ AI Lesson

Panel: Large-scale neural platform models: Opportunities, concerns, and directions

Microsoft Research Advanced 4y ago

TORCHVISION 2021 | FRANCISCO MASSA

Computer Vision

TORCHVISION 2021 | FRANCISCO MASSA

PyTorch Advanced 4y ago

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Computer Vision

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Microsoft Research Advanced 4y ago

W&B Paper Reading Group: DETR

Computer Vision ⚡ AI Lesson

W&B Paper Reading Group: DETR

Weights & Biases Advanced 4y ago

When Vision Transformers Outperform ResNets without Pretraining | Paper Explained

Computer Vision ⚡ AI Lesson

When Vision Transformers Outperform ResNets without Pretraining | Paper Explained

Aleksa Gordić - The AI Epiphany Advanced 4y ago

Vision Transformer - Keras Code Examples!!

Computer Vision

Vision Transformer - Keras Code Examples!!

Connor Shorten Advanced 5y ago

OpenCV Python Tutorial #7 - Template Matching (Object Detection)

Computer Vision

OpenCV Python Tutorial #7 - Template Matching (Object Detection)

Tech With Tim Advanced 5y ago

CLIP: Connecting Text and Images

Computer Vision ⚡ AI Lesson

CLIP: Connecting Text and Images

Connor Shorten Advanced 5y ago

TorchVision | PyTorch Developer Day 2020

Computer Vision ⚡ AI Lesson

TorchVision | PyTorch Developer Day 2020

PyTorch Advanced 5y ago

Real-time semantic segmentation in the browser - Made with TensorFlow.js

Computer Vision ⚡ AI Lesson

Real-time semantic segmentation in the browser - Made with TensorFlow.js

TensorFlow Advanced 5y ago

New AI Text-to-Video Model powered by Stable Diffusion + ControlNet

Computer Vision ⚡ AI Lesson

New AI Text-to-Video Model powered by Stable Diffusion + ControlNet

1littlecoder Advanced 3y ago

Roboflow 100: A New Object Detection Benchmark

Computer Vision ⚡ AI Lesson

Roboflow 100: A New Object Detection Benchmark

Roboflow Advanced 3y ago

Performance Capture Possible With Any Camera with NVIDIA AI

Computer Vision ⚡ AI Lesson

Performance Capture Possible With Any Camera with NVIDIA AI

NVIDIA Developer Advanced 3y ago

How To Train SegFormer on a Custom Dataset for Computer Vision

Computer Vision ⚡ AI Lesson

How To Train SegFormer on a Custom Dataset for Computer Vision

Roboflow Advanced 3y ago

How to Train and Deploy YOLOS on a Custom Dataset

Computer Vision

How to Train and Deploy YOLOS on a Custom Dataset

Roboflow Advanced 3y ago

[LIVE CODING] Visualizing Football Plays with Computer Vision (Part 1)

Computer Vision

[LIVE CODING] Visualizing Football Plays with Computer Vision (Part 1)

Roboflow Advanced 4y ago

How To Do Object Tracking

Computer Vision

How To Do Object Tracking

Roboflow Advanced 4y ago

What's New in YOLOX?

Computer Vision ⚡ AI Lesson

What's New in YOLOX?

Roboflow Advanced 4y ago

Test-time Adaptable Neural Networks for Robust Medical Image Segmentation | JRC Workshop 2021

Computer Vision

Test-time Adaptable Neural Networks for Robust Medical Image Segmentation | JRC Workshop 2021

Microsoft Research Advanced 4y ago

Which Image Augmentation Steps Should You Use with Aerial Data?

Computer Vision

Which Image Augmentation Steps Should You Use with Aerial Data?

Roboflow Advanced 5y ago

AI advances in image captioning: Describing images as well as people do

Computer Vision ⚡ AI Lesson

AI advances in image captioning: Describing images as well as people do

Microsoft Research Advanced 5y ago

Computer Vision Predictions for 2021

Computer Vision ⚡ AI Lesson

Computer Vision Predictions for 2021

Roboflow Advanced 5y ago

Zero-Shot Image Classification with Open AI's CLIP Model - GPT-3 for Images

Computer Vision

Zero-Shot Image Classification with Open AI's CLIP Model - GPT-3 for Images

1littlecoder Advanced 5y ago

How to Train Scaled-YOLOv4 to Detect Custom Objects

Computer Vision

How to Train Scaled-YOLOv4 to Detect Custom Objects

Roboflow Advanced 5y ago

YOLOv4 - Advanced Tactics

Computer Vision

YOLOv4 - Advanced Tactics

Roboflow Advanced 5y ago

#TWIMLfest: Computer Vision Office Hours

Computer Vision ⚡ AI Lesson

#TWIMLfest: Computer Vision Office Hours

The TWIML AI Podcast with Sam Charrington Advanced 5y ago

Spatial Analysis for Real-Time Video Processing with Adina Trufinescu - #417

Computer Vision ⚡ AI Lesson

Spatial Analysis for Real-Time Video Processing with Adina Trufinescu - #417

The TWIML AI Podcast with Sam Charrington Advanced 5y ago

Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz

Computer Vision ⚡ AI Lesson

Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz

Roboflow Advanced 5y ago

📚 Coursera Courses Opens on Coursera · Free to audit

View all →

📚 Coursera Course ↗

Process Images & Extract Motion Features

Opens on Coursera ↗

Open Source Models with Hugging Face

📚 Coursera Course ↗

Open Source Models with Hugging Face

Opens on Coursera ↗

Analyze Video Data Using OpenCV and Python

📚 Coursera Course ↗

Analyze Video Data Using OpenCV and Python

Opens on Coursera ↗

Breastfeeding and Adequate Substitutes

📚 Coursera Course ↗

Breastfeeding and Adequate Substitutes

Opens on Coursera ↗

Machine Learning in Python: Analyze & Apply

📚 Coursera Course ↗

Machine Learning in Python: Analyze & Apply

Opens on Coursera ↗

📚 Coursera Course ↗

Introduction to Computer Vision

Opens on Coursera ↗

Deploy & Evaluate Vision Models Effectively

📚 Coursera Course ↗

Deploy & Evaluate Vision Models Effectively

Opens on Coursera ↗

CompTIA Cloud CV0-003: Unit 3

📚 Coursera Course ↗

CompTIA Cloud CV0-003: Unit 3

Opens on Coursera ↗

📚 Coursera Course ↗

Hands-on Data Centric Visual AI

Opens on Coursera ↗

📚 Coursera Course ↗

Optical Character Recognition (OCR) with Document AI (Python)

Opens on Coursera ↗

AI Applications: Computer Vision and Speech Recognition

📚 Coursera Course ↗

AI Applications: Computer Vision and Speech Recognition

Opens on Coursera ↗

Introduction to Deep Learning for Computer Vision

📚 Coursera Course ↗

Introduction to Deep Learning for Computer Vision

Opens on Coursera ↗

📚 Coursera Course ↗

Build an End-to-End Data Capture Pipeline using Document AI

Opens on Coursera ↗

Anatomy of the Abdomen and Pelvis; a journey from basis to clinic.

📚 Coursera Course ↗

Anatomy of the Abdomen and Pelvis; a journey from basis to clinic.

Opens on Coursera ↗

Deep Learning for Object Detection

📚 Coursera Course ↗

Deep Learning for Object Detection

Opens on Coursera ↗

Document AI: Project & API Writing

📚 Coursera Course ↗

Document AI: Project & API Writing

Opens on Coursera ↗

Explore LiDAR in 3D

📚 Coursera Course ↗

Explore LiDAR in 3D

Opens on Coursera ↗

Start Remote Sensing

📚 Coursera Course ↗

Start Remote Sensing

Opens on Coursera ↗