Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

112
videos
David Fan & Peter Tong  - Scaling Language Free Visual Representation Learning
👁️ Computer Vision
David Fan & Peter Tong - Scaling Language Free Visual Representation Learning
Cohere Advanced 7mo ago
Distilling Transformers and Diffusion Models for Robust Edge Use Cases [Fatih Porikli] - 738
👁️ Computer Vision
Distilling Transformers and Diffusion Models for Robust Edge Use Cases [Fatih Porikli] - 738
The TWIML AI Podcast with Sam Charrington Advanced 8mo ago
VGG From Scratch – Deep Learning Theory & PyTorch Implementation (Full Course)
👁️ Computer Vision
VGG From Scratch – Deep Learning Theory & PyTorch Implementation (Full Course)
freeCodeCamp.org Advanced 8mo ago
Transforming Guest Experiences: GoTo Foods’ Data Journey with Amperity & Databricks
👁️ Computer Vision
Transforming Guest Experiences: GoTo Foods’ Data Journey with Amperity & Databricks
Databricks Advanced 8mo ago
Train YOLO on Custom Dataset | Object Detection Step-by-Step Tutorial
👁️ Computer Vision
Train YOLO on Custom Dataset | Object Detection Step-by-Step Tutorial
Samin Learns AI Advanced 9mo ago
FastVLM brings advanced computer vision to your phone...
👁️ Computer Vision
FastVLM brings advanced computer vision to your phone...
NeuralNine Advanced 10mo ago
RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)
👁️ Computer Vision
RF-DETR Beat YOLOs on Real-time Object Detection | Fine-Tuning | Live Coding & Q&A (Mar 27th)
Roboflow Advanced 1y ago
YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)
👁️ Computer Vision
YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)
Roboflow Advanced 1y ago
George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO
👁️ Computer Vision
George Hotz | mixture of experts (like deepseek) on tinygrad sovereign AMD stack | AMD YOLO
george hotz archive Advanced 1y ago
Aya Vision - The Challenges & Breakthroughs
👁️ Computer Vision
Aya Vision - The Challenges & Breakthroughs
Cohere Advanced 1y ago
Microsoft’s Phi-4 SLM: Open-Source AI for Text, Vision & Audio!
👁️ Computer Vision
Microsoft’s Phi-4 SLM: Open-Source AI for Text, Vision & Audio!
Analytics Vidhya Advanced 1y ago
Deepseek is back with VISION
👁️ Computer Vision
Deepseek is back with VISION
1littlecoder Advanced 1y ago
Using Vertex AI for healthcare
👁️ Computer Vision
Using Vertex AI for healthcare
Google Cloud Tech Advanced 1y ago
Enhance Generative AI Model Accuracy Through High-Quality Multimodal Data Processing
👁️ Computer Vision
Enhance Generative AI Model Accuracy Through High-Quality Multimodal Data Processing
NVIDIA Developer Advanced 1y ago
YOLOv2 (YOLO9000) and YOLOv3 Explained
👁️ Computer Vision
YOLOv2 (YOLO9000) and YOLOv3 Explained
ExplainingAI Advanced 1y ago
New Video AI by META & Stanford Univ: APOLLO (7B)
👁️ Computer Vision
New Video AI by META & Stanford Univ: APOLLO (7B)
Discover AI Advanced 1y ago
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
👁️ Computer Vision
Shreyash Arya- B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Cohere Advanced 1y ago
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
👁️ Computer Vision
Peng Xia - RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Cohere Advanced 1y ago
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
👁️ Computer Vision
MedAI: Vision Language Models & Fine-Tuning (KnowAda)
Discover AI Advanced 1y ago
Gwanghyun (Bradley) Kim - BeyondScene: Higher-Resolution Human-Scene Generation
👁️ Computer Vision
Gwanghyun (Bradley) Kim - BeyondScene: Higher-Resolution Human-Scene Generation
Cohere Advanced 1y ago
open-animal-tracks
👁️ Computer Vision
open-animal-tracks
Data Skeptic Advanced 1y ago
Bird Distribution Modeling with Satbird
👁️ Computer Vision
Bird Distribution Modeling with Satbird
Data Skeptic Advanced 1y ago
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
👁️ Computer Vision
Beyond Language: The future of multimodal models in health, gaming, & AI | Microsoft Research Forum
Microsoft Research Advanced 1y ago
Football AI | Community Q&A (Aug 29)
👁️ Computer Vision
Football AI | Community Q&A (Aug 29)
Roboflow Advanced 1y ago