Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,332
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Build computer vision applications easily with Roboflow and Google Cloud
Computer Vision
Build computer vision applications easily with Roboflow and Google Cloud
Google Cloud Advanced 2y ago
Search Engines Struggle to Process Text Efficiently: The Hidden Cost #seo
Computer Vision
Search Engines Struggle to Process Text Efficiently: The Hidden Cost #seo
Koray Tuğberk GÜBÜR Advanced 2y ago
Computer Vision Study Group Session on SAM
Computer Vision
Computer Vision Study Group Session on SAM
HuggingFace Advanced 2y ago
Navigating Beyond Pixels in Computer Vision | Satya Mallick, CEO of OpenCV | Expert Talk 03
Computer Vision
Navigating Beyond Pixels in Computer Vision | Satya Mallick, CEO of OpenCV | Expert Talk 03
Analytics Vidhya Advanced 2y ago
China's Qwen VL wins Big Time!!!
Computer Vision
China's Qwen VL wins Big Time!!!
1littlecoder Advanced 2y ago
Sanjay Subramanian - Visual Reasoning with Limited Human Labels
Computer Vision
Sanjay Subramanian - Visual Reasoning with Limited Human Labels
Cohere Advanced 2y ago
Data Augmentation and Optimized Architectures for Computer Vision - Fatih Porikli - 635
Computer Vision ⚡ AI Lesson
Data Augmentation and Optimized Architectures for Computer Vision - Fatih Porikli - 635
The TWIML AI Podcast with Sam Charrington Advanced 2y ago
Digital Renaissance: Neuralangelo by NVIDIA Research Reconstructs 3D Scenes from 2D Video Clips
Computer Vision
Digital Renaissance: Neuralangelo by NVIDIA Research Reconstructs 3D Scenes from 2D Video Clips
NVIDIA Developer Advanced 2y ago
This Facebook AI model is the CHATGPT of Computer Vision (with Python Code)
Computer Vision
This Facebook AI model is the CHATGPT of Computer Vision (with Python Code)
1littlecoder Advanced 3y ago
Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?
Computer Vision
Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?
Discover AI Advanced 3y ago
Michael Tschannen -  Image-and-Language Understanding from Pixels Only
Computer Vision
Michael Tschannen - Image-and-Language Understanding from Pixels Only
Cohere Advanced 3y ago
New TECH: Vision Transformer 2023 on Image Classification | AI
Computer Vision ⚡ AI Lesson
New TECH: Vision Transformer 2023 on Image Classification | AI
Discover AI Advanced 3y ago
HPU vs GPU - The Frontier of AI Hardware
Computer Vision
HPU vs GPU - The Frontier of AI Hardware
Roboflow Advanced 3y ago
Experiment NVIDIA TAO Toolkit and pretrained models on Google Colab
Computer Vision ⚡ AI Lesson
Experiment NVIDIA TAO Toolkit and pretrained models on Google Colab
NVIDIA Developer Advanced 3y ago
Roboflow 100 Benchmarking Tutorial with Google Colab and Docker
Computer Vision
Roboflow 100 Benchmarking Tutorial with Google Colab and Docker
Roboflow Advanced 3y ago
Fast Zero Shot Object Detection with OpenAI CLIP
Computer Vision ⚡ AI Lesson
Fast Zero Shot Object Detection with OpenAI CLIP
James Briggs Advanced 3y ago
Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579
Computer Vision ⚡ AI Lesson
Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579
The TWIML AI Podcast with Sam Charrington Advanced 3y ago
Lecture 13: Object Detection, Recognition and Pose Determination, PatQuick (US 7,016,539)
Computer Vision ⚡ AI Lesson
Lecture 13: Object Detection, Recognition and Pose Determination, PatQuick (US 7,016,539)
MIT OpenCourseWare Advanced 3y ago
Customer Clustering
Computer Vision ⚡ AI Lesson
Customer Clustering
Data Skeptic Advanced 4y ago
ConvNeXt: A ConvNet for the 2020s | Paper Explained
Computer Vision
ConvNeXt: A ConvNet for the 2020s | Paper Explained
Aleksa Gordić - The AI Epiphany Advanced 4y ago
Panel: Large-scale neural platform models: Opportunities, concerns, and directions
Computer Vision ⚡ AI Lesson
Panel: Large-scale neural platform models: Opportunities, concerns, and directions
Microsoft Research Advanced 4y ago
TORCHVISION 2021 | FRANCISCO MASSA
Computer Vision
TORCHVISION 2021 | FRANCISCO MASSA
PyTorch Advanced 4y ago
MDETR: Modulated Detection for End-to-End Multi-Modal Understanding
Computer Vision
MDETR: Modulated Detection for End-to-End Multi-Modal Understanding
Microsoft Research Advanced 4y ago
W&B Paper Reading Group: DETR
Computer Vision ⚡ AI Lesson
W&B Paper Reading Group: DETR
Weights & Biases Advanced 4y ago
When Vision Transformers Outperform ResNets without Pretraining | Paper Explained
Computer Vision ⚡ AI Lesson
When Vision Transformers Outperform ResNets without Pretraining | Paper Explained
Aleksa Gordić - The AI Epiphany Advanced 4y ago
Vision Transformer - Keras Code Examples!!
Computer Vision
Vision Transformer - Keras Code Examples!!
Connor Shorten Advanced 5y ago
OpenCV Python Tutorial #7 - Template Matching (Object Detection)
Computer Vision
OpenCV Python Tutorial #7 - Template Matching (Object Detection)
Tech With Tim Advanced 5y ago
CLIP: Connecting Text and Images
Computer Vision ⚡ AI Lesson
CLIP: Connecting Text and Images
Connor Shorten Advanced 5y ago
TorchVision | PyTorch Developer Day 2020
Computer Vision ⚡ AI Lesson
TorchVision | PyTorch Developer Day 2020
PyTorch Advanced 5y ago
Real-time semantic segmentation in the browser  - Made with TensorFlow.js
Computer Vision ⚡ AI Lesson
Real-time semantic segmentation in the browser - Made with TensorFlow.js
TensorFlow Advanced 5y ago
New AI Text-to-Video Model powered by Stable Diffusion + ControlNet
Computer Vision ⚡ AI Lesson
New AI Text-to-Video Model powered by Stable Diffusion + ControlNet
1littlecoder Advanced 3y ago
Roboflow 100: A New Object Detection Benchmark
Computer Vision ⚡ AI Lesson
Roboflow 100: A New Object Detection Benchmark
Roboflow Advanced 3y ago
Performance Capture Possible With Any Camera with NVIDIA AI
Computer Vision ⚡ AI Lesson
Performance Capture Possible With Any Camera with NVIDIA AI
NVIDIA Developer Advanced 3y ago
How To Train SegFormer on a Custom Dataset for Computer Vision
Computer Vision ⚡ AI Lesson
How To Train SegFormer on a Custom Dataset for Computer Vision
Roboflow Advanced 3y ago
How to Train and Deploy YOLOS on a Custom Dataset
Computer Vision
How to Train and Deploy YOLOS on a Custom Dataset
Roboflow Advanced 3y ago
[LIVE CODING] Visualizing Football Plays with Computer Vision (Part 1)
Computer Vision
[LIVE CODING] Visualizing Football Plays with Computer Vision (Part 1)
Roboflow Advanced 4y ago
How To Do Object Tracking
Computer Vision
How To Do Object Tracking
Roboflow Advanced 4y ago
What's New in YOLOX?
Computer Vision ⚡ AI Lesson
What's New in YOLOX?
Roboflow Advanced 4y ago
Test-time Adaptable Neural Networks for Robust Medical Image Segmentation | JRC Workshop 2021
Computer Vision
Test-time Adaptable Neural Networks for Robust Medical Image Segmentation | JRC Workshop 2021
Microsoft Research Advanced 4y ago
Which Image Augmentation Steps Should You Use with Aerial Data?
Computer Vision
Which Image Augmentation Steps Should You Use with Aerial Data?
Roboflow Advanced 5y ago
AI advances in image captioning: Describing images as well as people do
Computer Vision ⚡ AI Lesson
AI advances in image captioning: Describing images as well as people do
Microsoft Research Advanced 5y ago
Computer Vision Predictions for 2021
Computer Vision ⚡ AI Lesson
Computer Vision Predictions for 2021
Roboflow Advanced 5y ago
Zero-Shot Image Classification with Open AI's CLIP Model - GPT-3 for Images
Computer Vision
Zero-Shot Image Classification with Open AI's CLIP Model - GPT-3 for Images
1littlecoder Advanced 5y ago
How to Train Scaled-YOLOv4 to Detect Custom Objects
Computer Vision
How to Train Scaled-YOLOv4 to Detect Custom Objects
Roboflow Advanced 5y ago
YOLOv4 - Advanced Tactics
Computer Vision
YOLOv4 - Advanced Tactics
Roboflow Advanced 5y ago
#TWIMLfest: Computer Vision Office Hours
Computer Vision ⚡ AI Lesson
#TWIMLfest: Computer Vision Office Hours
The TWIML AI Podcast with Sam Charrington Advanced 5y ago
Spatial Analysis for Real-Time Video Processing with Adina Trufinescu - #417
Computer Vision ⚡ AI Lesson
Spatial Analysis for Real-Time Video Processing with Adina Trufinescu - #417
The TWIML AI Podcast with Sam Charrington Advanced 5y ago
Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz
Computer Vision ⚡ AI Lesson
Elisha Odemakinde Hosts Roboflow ML Engineer, Jacob Solawetz
Roboflow Advanced 5y ago
📚 Coursera Courses Opens on Coursera · Free to audit
1 / 3 View all →
Process Images & Extract Motion Features
📚 Coursera Course ↗
Self-paced
Process Images & Extract Motion Features
Opens on Coursera ↗
Open Source Models with Hugging Face
📚 Coursera Course ↗
Self-paced
Open Source Models with Hugging Face
Opens on Coursera ↗
Analyze Video Data Using OpenCV and Python
📚 Coursera Course ↗
Self-paced
Analyze Video Data Using OpenCV and Python
Opens on Coursera ↗
Breastfeeding and Adequate Substitutes
📚 Coursera Course ↗
Self-paced
Breastfeeding and Adequate Substitutes
Opens on Coursera ↗
Machine Learning in Python: Analyze & Apply
📚 Coursera Course ↗
Self-paced
Machine Learning in Python: Analyze & Apply
Opens on Coursera ↗
Introduction to Computer Vision
📚 Coursera Course ↗
Self-paced
Introduction to Computer Vision
Opens on Coursera ↗