Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Computer Vision
Using RTSP Streams for Computer Vision | Tracking & Counting Objects
Roboflow Intermediate 1y ago
The era of unbounded products: Designing for Multimodal IO: Ben Hylak
Computer Vision
The era of unbounded products: Designing for Multimodal IO: Ben Hylak
AI Engineer Intermediate 1y ago
Use Dedicated Deployments with Computer Vision Workflows
Computer Vision
Use Dedicated Deployments with Computer Vision Workflows
Roboflow Intermediate 1y ago
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Computer Vision
I’ve been doing marketing for 20 years now, and here’s my biggest source of inspiration.
Neil Patel Intermediate 1y ago
Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and  Llama 3.1
Computer Vision
Organize PDFs Efficiently: Build a Streamlit PDF Sorter Application using LangChain and Llama 3.1
Muhammad Moin Intermediate 1y ago
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
Computer Vision ⚡ AI Lesson
Qwen2-VL: The Best Open Source Vision Model for OCR & VQA
AI Anytime Intermediate 1y ago
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
Computer Vision ⚡ AI Lesson
Joy Buolamwini—trail-blazing AI ethicist outlines the dark side of image recognition on #DataFramed
DataCamp Intermediate 1y ago
How to run SAM 2 (Segment Anything AI Model)?
Computer Vision ⚡ AI Lesson
How to run SAM 2 (Segment Anything AI Model)?
AI Anytime Intermediate 1y ago
SAM 2 is going to transform COMPUTER VISION!!!
Computer Vision
SAM 2 is going to transform COMPUTER VISION!!!
1littlecoder Intermediate 1y ago
New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud
Computer Vision
New Way Now: McLaren Racing is shifting performance into top gear with Google Cloud
Google Cloud Intermediate 1y ago
Excitement for the Generative AI era: Multi-Modal inputs
Computer Vision
Excitement for the Generative AI era: Multi-Modal inputs
Weights & Biases Intermediate 1y ago
Reimagine document processing and understanding with generative AI
Computer Vision
Reimagine document processing and understanding with generative AI
Google Cloud Intermediate 1y ago
[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation
Computer Vision
[CVPR2024] Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation
anucvml Intermediate 2y ago
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Computer Vision ⚡ AI Lesson
Walid Bousselham - LeGrad: An Explainability for Vision Transformers via...
Cohere Intermediate 2y ago
Multimodal AI Business Companions
Computer Vision
Multimodal AI Business Companions
Daniel Finkenstadt Intermediate 2y ago
Enrich Scenario Planning with Multimodal Wargames
Computer Vision
Enrich Scenario Planning with Multimodal Wargames
Daniel Finkenstadt Intermediate 2y ago
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
2:29
Computer Vision ⚡ AI Lesson
Can New GPT Model Read Music Notation? Multimodal GPT-4o Omni
Burned Guitarist Intermediate 2y ago
New2Cyber en Espanol | El final de la era del profesional de seguridad
Computer Vision ⚡ AI Lesson
New2Cyber en Espanol | El final de la era del profesional de seguridad
SANS Institute Intermediate 2y ago
How To Fine-tune LLaVA Model (From Your Laptop!)
Computer Vision
How To Fine-tune LLaVA Model (From Your Laptop!)
Brev Intermediate 2y ago
Mean Average Precision (mAP) | Explanation and Implementation for Object Detection
Computer Vision
Mean Average Precision (mAP) | Explanation and Implementation for Object Detection
ExplainingAI Intermediate 2y ago
Is synthetic data from generative models ready for image recognition? (ICLR 2023, spotlight)
Computer Vision
Is synthetic data from generative models ready for image recognition? (ICLR 2023, spotlight)
MIPAL-SNU Intermediate 2y ago
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
Computer Vision ⚡ AI Lesson
Bringing AI to the Masses with Adam D'Angelo, CEO of Quora
a16z Intermediate 2y ago
Multi-Modal NSFW Detection with AI
Computer Vision
Multi-Modal NSFW Detection with AI
James Briggs Intermediate 2y ago
This VLM can be your MultiModal AI with less than 6GB Memory!!!
Computer Vision
This VLM can be your MultiModal AI with less than 6GB Memory!!!
1littlecoder Intermediate 2y ago
New course with Hugging Face: Open Source Models with Hugging Face
Computer Vision ⚡ AI Lesson
New course with Hugging Face: Open Source Models with Hugging Face
DeepLearningAI Intermediate 2y ago
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Computer Vision
Multimodality: The Next Big Step (Demis Hassabis - Google DeepMind CEO)
Dwarkesh Patel Intermediate 2y ago
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
Computer Vision
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
MIPAL-SNU Intermediate 2y ago
Vision Transformer (ViT)
Computer Vision
Vision Transformer (ViT)
Machine Learning Studio Intermediate 2y ago
The Future Of Computer Vision
Computer Vision ⚡ AI Lesson
The Future Of Computer Vision
a16z Intermediate 2y ago
Create a Custom Document Extractor with Document AI
Computer Vision ⚡ AI Lesson
Create a Custom Document Extractor with Document AI
Google Cloud Tech Intermediate 2y ago
Tune in to know what are the most exciting opportunities to look out for in computer vision!
Computer Vision ⚡ AI Lesson
Tune in to know what are the most exciting opportunities to look out for in computer vision!
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
Computer Vision ⚡ AI Lesson
Vision community faces evaluation challenges and should lean on cost-effective automatic evaluation
The TWIML AI Podcast with Sam Charrington Intermediate 2y ago
Image segmentation - ML on Android with MediaPipe Series
Computer Vision
Image segmentation - ML on Android with MediaPipe Series
Google for Developers Intermediate 2y ago
AI Revolutionizing Immigration: Streamlining Visa Processing 🌐✈️ #AIInImmigration #VisaProcessing
Computer Vision
AI Revolutionizing Immigration: Streamlining Visa Processing 🌐✈️ #AIInImmigration #VisaProcessing
LawSikho Technology & AI Law Intermediate 2y ago
Want to find the BEST segmentation for your business?
Computer Vision
Want to find the BEST segmentation for your business?
Adam Erhart Intermediate 2y ago
How does AI aid in immigration document verification and processing for visas and asylum cases?
Computer Vision
How does AI aid in immigration document verification and processing for visas and asylum cases?
LawSikho Technology & AI Law Intermediate 2y ago
Accelerating Explorations in Vision and Multimodal AI Using Pytorch...- Nicolas, Philip, Evan & Peng
Computer Vision
Accelerating Explorations in Vision and Multimodal AI Using Pytorch...- Nicolas, Philip, Evan & Peng
PyTorch Intermediate 2y ago
TIME Best Invention of 2023: NVIDIA Neuralangelo
Computer Vision
TIME Best Invention of 2023: NVIDIA Neuralangelo
NVIDIA Developer Intermediate 2y ago
Object detection using Yolo V8
Computer Vision
Object detection using Yolo V8
Developers Hutt Intermediate 2y ago
Football AI Tutorial: From Basics to Advanced Stats with Python
Computer Vision
Football AI Tutorial: From Basics to Advanced Stats with Python
Roboflow Intermediate 1y ago
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Computer Vision ⚡ AI Lesson
Computer Vision Hardware Configuration | Cameras, lenses, and GPUs for vision AI
Roboflow Intermediate 1y ago
PaliGemma by Google: Train Model on Custom Detection Dataset
Computer Vision
PaliGemma by Google: Train Model on Custom Detection Dataset
Roboflow Intermediate 2y ago
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Computer Vision ⚡ AI Lesson
YOLOv9 Tutorial: Train Model on Custom Dataset | How to Deploy YOLOv9
Roboflow Intermediate 2y ago
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
Computer Vision ⚡ AI Lesson
Big Ideas 2024: New Applications for Computer Vision and Video Intelligence with Kimberly Tan
a16z Intermediate 2y ago
¿La verdadera razón detrás de la transformación digital?
Computer Vision
¿La verdadera razón detrás de la transformación digital?
Google Cloud Intermediate 2y ago
Can AI-Inventions Be Patented in India? Exploring Patent Law Dynamics! 🤖💡 #AIPatents #PatentLaw
Computer Vision
Can AI-Inventions Be Patented in India? Exploring Patent Law Dynamics! 🤖💡 #AIPatents #PatentLaw
LawSikho Technology & AI Law Intermediate 2y ago
C360 for BigQuery powered by Lytics fuels next gen AI, analytics, and predictions
Computer Vision
C360 for BigQuery powered by Lytics fuels next gen AI, analytics, and predictions
Google Cloud Intermediate 2y ago
AI.engineer 2023: Live Coding a Multimodal Game, paint.wtf
Computer Vision
AI.engineer 2023: Live Coding a Multimodal Game, paint.wtf
Roboflow Intermediate 2y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Materiales para envase y embalaje
📚 External: Coursera ↗
Self-paced
Materiales para envase y embalaje
Opens on Coursera ↗
Advancing Your Career in Computer Vision Engineering
📚 External: Coursera ↗
Self-paced
Advancing Your Career in Computer Vision Engineering
Opens on Coursera ↗
Jetson Nano Starter to Pro - A Computer Vision Course
📚 External: Coursera ↗
Self-paced
Jetson Nano Starter to Pro - A Computer Vision Course
Opens on Coursera ↗
International Marketing Strategies and Global Trade
📚 External: Coursera ↗
Self-paced
International Marketing Strategies and Global Trade
Opens on Coursera ↗
Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital
📚 External: Coursera ↗
Self-paced
Image and Video Processing: From Mars to Hollywood with a Stop at the Hospital
Opens on Coursera ↗
Vision Models: Train and Evaluate
📚 External: Coursera ↗
Self-paced
Vision Models: Train and Evaluate
Opens on Coursera ↗