Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

548
videos
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
👁️ Computer Vision
Why Vision Language Models Ignore What They See [Munawar Hayat] - 758
TWIML AI Podcast Beginner 3mo ago
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
👁️ Computer Vision
DeepSeek V3.2 Speciale Testing – Can It Handle Complex Tasks Without Tools?
Muhammad Moin Beginner 3mo ago
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
👁️ Computer Vision
Build a Hybrid CSV Intelligence Agent with RAG, Pandas, and LLM Judge
Muhammad Moin Beginner 3mo ago
I Took the Duolingo English Test and Here’s What Happened
👁️ Computer Vision
I Took the Duolingo English Test and Here’s What Happened
Teacher Luke - Duolingo English Test Beginner 3mo ago
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
👁️ Computer Vision
Multimodal RAG: Chat with Complex PDFs (Text, Tables & Images)
Muhammad Moin Beginner 3mo ago
Why are Transformers replacing CNNs?
👁️ Computer Vision
Why are Transformers replacing CNNs?
Julia Turc Beginner 3mo ago
What is reciprocal rank fusion in hybrid search?
👁️ Computer Vision
What is reciprocal rank fusion in hybrid search?
Abhishek Thakur Beginner 4mo ago
Should AI be introduced to kids early?  #podcast #interview
👁️ Computer Vision
Should AI be introduced to kids early? #podcast #interview
Abhishek Thakur Beginner 4mo ago
Multimodal and Multi-model AI in Action
👁️ Computer Vision
Multimodal and Multi-model AI in Action
Microsoft 365 Developer Beginner 4mo ago
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
👁️ Computer Vision
What is Segment Anything 3 (SAM3)? Live Q&A with Meta's Engineers Behind the Model
Roboflow Beginner 4mo ago
A no nonsense intro to BM25
👁️ Computer Vision
A no nonsense intro to BM25
Abhishek Thakur Beginner 4mo ago
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
👁️ Computer Vision
Getting Started With Transformers for Computer Vision - Divya Swaminathan & Tony Reina
PyData Beginner 4mo ago
Vibe + VSCode + Codex = Search UI
👁️ Computer Vision
Vibe + VSCode + Codex = Search UI
Abhishek Thakur Beginner 4mo ago
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
👁️ Computer Vision
Validate Actions with Vision AI | Building a Web App for Real-Time Drinking Detection
Roboflow Beginner 4mo ago
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
👁️ Computer Vision
Vision AI in a Web Browser: Creating a Scavenger Hunt App with Inference.js
Roboflow Beginner 4mo ago
Choosing Your Path: AI Professional Program Course Selection Guide
👁️ Computer Vision
Choosing Your Path: AI Professional Program Course Selection Guide
Stanford Online Beginner 4mo ago
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
👁️ Computer Vision
Vibe Coding with AI in 2025 – Build Anything with Google AI Studio
Muhammad Moin Beginner 4mo ago
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
👁️ Computer Vision
From Parking Lots to Airports: How Metropolis Uses AI for Seamless Payments
The Information Beginner 4mo ago
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
👁️ Computer Vision
ExecuTorch 1.0: General Availability Status for Mobile and Embedded...- Mergen Nachin & Cemal Bilgin
PyTorch Beginner 4mo ago
Building, learning and teaching with AI (w/ Parul Pandey)
👁️ Computer Vision
Building, learning and teaching with AI (w/ Parul Pandey)
Abhishek Thakur Beginner 4mo ago
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
👁️ Computer Vision
Ask the Engineers: RF-DETR Segmentation and Creating Best-in-Class Vision Models for the Edge
Roboflow Beginner 5mo ago
How The Field Museum Unlocks New Research Possibilities with Vision AI
👁️ Computer Vision
How The Field Museum Unlocks New Research Possibilities with Vision AI
Roboflow Beginner 5mo ago
OneDrive’s AI is scanning your PHOTOS
👁️ Computer Vision
OneDrive’s AI is scanning your PHOTOS
David Bombal Beginner 5mo ago
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
👁️ Computer Vision
How Jesai Scored a Perfect 160 on the Duolingo English Test (DET)!
Teacher Luke - Duolingo English Test Beginner 5mo ago