✕ Clear filters
1,115 lessons

👁️ Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

All ▶ YouTube 117,663📚 Coursera 18,102🏛 Archive.org 1🎤 TED 1
From DETR to SAM2: Reviewing the TOP Vision AI Advances of 2024
👁️ Computer Vision
From DETR to SAM2: Reviewing the TOP Vision AI Advances of 2024
Roboflow Beginner 1y ago
Multimodal AI Agents Are Revolutionising Image & Video Analysis!
👁️ Computer Vision
Multimodal AI Agents Are Revolutionising Image & Video Analysis!
Mervin Praison Beginner 1y ago
Next AI Project is Image Classification in Python🔍🤖
👁️ Computer Vision
Next AI Project is Image Classification in Python🔍🤖
Tech With Tim Intermediate 1y ago
YOLOv2 (YOLO9000) and YOLOv3 Explained
👁️ Computer Vision
YOLOv2 (YOLO9000) and YOLOv3 Explained
ExplainingAI Advanced 1y ago
Does anyone even understand what quantum computing is for? Presented by ​⁠@amazonwebservices
👁️ Computer Vision
Does anyone even understand what quantum computing is for? Presented by ​⁠@amazonwebservices
The Verge Intermediate 1y ago
Best of 2024 in Vision [LS Live @ NeurIPS]
👁️ Computer Vision
Best of 2024 in Vision [LS Live @ NeurIPS]
Latent Space Intermediate 1y ago
How to Do Email Segmentation the Right Way
0:47
👁️ Computer Vision
How to Do Email Segmentation the Right Way
Spark Bridge Digital | Email Marketing Agency Intermediate 1y ago
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
👁️ Computer Vision
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
OpenAI Intermediate 1y ago
New Video AI by META & Stanford Univ: APOLLO (7B)
👁️ Computer Vision
New Video AI by META & Stanford Univ: APOLLO (7B)
Discover AI Advanced 1y ago
Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate
👁️ Computer Vision
Latent Space LIVE! - Best of 2024: Startups, Vision, Open Src, Reasoning, & The Great Scaling Debate
Latent Space Beginner 1y ago
Florence-2: Create and Deploy a Custom Vision Language Model
👁️ Computer Vision
Florence-2: Create and Deploy a Custom Vision Language Model
Roboflow Intermediate 1y ago
Use Semantic Search to Create Computer Vision Datasets
👁️ Computer Vision
Use Semantic Search to Create Computer Vision Datasets
Roboflow Beginner 1y ago
SAM-2.1: How to Fine-Tune for Image Segmentation
👁️ Computer Vision
SAM-2.1: How to Fine-Tune for Image Segmentation
Roboflow Beginner 1y ago
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
👁️ Computer Vision
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Cohere Advanced 1y ago
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
👁️ Computer Vision
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Cohere Advanced 1y ago
MediaPipe Web: Bringing cross-platform AI tech to the browser
👁️ Computer Vision
MediaPipe Web: Bringing cross-platform AI tech to the browser
Chrome for Developers Intermediate 1y ago
Multimodal Embeddings: Introduction & Use Cases (with Python)
👁️ Computer Vision
Multimodal Embeddings: Introduction & Use Cases (with Python)
Shaw Talebi Beginner 1y ago
How to Build a Smart Parking System - License Plate Detection & OCR
👁️ Computer Vision
How to Build a Smart Parking System - License Plate Detection & OCR
Roboflow Beginner 1y ago
Insights from a Kaggle Grandmaster: Multimodal Models, Agents, Document AI & more
👁️ Computer Vision
Insights from a Kaggle Grandmaster: Multimodal Models, Agents, Document AI & more
Analytics Vidhya Beginner 1y ago
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
👁️ Computer Vision
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
Discover AI Advanced 1y ago
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
👁️ Computer Vision
Moondream: how does a tiny vision model slap so hard? — Vikhyat Korrapati
AI Engineer Intermediate 1y ago
Transformers.js: State-of-the-art Machine Learning for the web
👁️ Computer Vision
Transformers.js: State-of-the-art Machine Learning for the web
Chrome for Developers Intermediate 1y ago
Web AI Summit 2024: State of client side machine learning
👁️ Computer Vision
Web AI Summit 2024: State of client side machine learning
Chrome for Developers Beginner 1y ago
NLP Engineer & Computer Vision Engineer #codebasics #nlp #computervision #datajob #shorts
👁️ Computer Vision
NLP Engineer & Computer Vision Engineer #codebasics #nlp #computervision #datajob #shorts
codebasics Beginner 1y ago