Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
All Reads (394) Articles (216)Blog Posts (117)Tutorials (47)Research Papers (13)News (1)
Distill.pub 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 5y ago
Weight Banding
Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.
Distill.pub 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 5y ago
High-Low Frequency Detectors
A family of early-vision neurons reacting to directional transitions from high to low spatial frequency.
Distill.pub 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 6y ago
An Overview of Early Vision in InceptionV1
An overview of all the neurons in the first five layers of InceptionV1, organized into a taxonomy of 'neuron groups.'
Distill.pub 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 6y ago
A Discussion of 'Adversarial Examples Are Not Bugs, They Are Features': Two Examples of Useful, Non-Robust Features
An example project using webpack and svelte-loader and ejs to inline SVGs
Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 7y ago
Object Detection Part 4: Fast Detection Models
In Part 3 , we have reviewed models in the R-CNN family. All of them are region-based object detection algorithms. They can achieve hig
Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 8y ago
Object Detection for Dummies Part 3: R-CNN Family
[Updated on 2018-12-20: Remove YOLO here. Part 4 will cover multiple fast object detection algorithms, including YOLO.] [Updated on 2018-12-27: Add bbox regress
Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 8y ago
Object Detection for Dummies Part 2: CNN, DPM and Overfeat
Part 1 of the “Object Detection for Dummies” series introduced: (1) the concept of image gradient vector and how HOG algorithm summarizes the inform
Distill.pub 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 8y ago
Feature Visualization
How neural networks build up their understanding of images
Lilian Weng's Blog 👁️ Computer Vision ⚡ AI Lesson 8y ago
Object Detection for Dummies Part 1: Gradient Vector, HOG, and SS
I’ve never worked in the field of
OpenAI News 👁️ Computer Vision ⚡ AI Lesson 8y ago
Robust adversarial inputs
We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that