Foundations

Computer Vision

Object detection, segmentation, YOLO, CLIP, and vision-language models

1,539
lessons
Skills in this topic
View full skill map →
CV Basics
beginner
Classify images with a pre-trained CNN
Modern CV Models
intermediate
Run YOLO for real-time object detection
Generative CV
advanced
Build a Stable Diffusion inference pipeline
All Reads (394) Articles (216)Blog Posts (117)Tutorials (47)Research Papers (13)News (1)
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 20h ago
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
How our team built a real-time Computer Vision system using YOLO, OpenCV, and DeepFace to assist professional certification exams and why… Continue reading on M
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 20h ago
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
How our team built a real-time Computer Vision system using YOLO, OpenCV, and DeepFace to assist professional certification exams and why… Continue reading on M
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
Medium · Cybersecurity 👁️ Computer Vision ⚡ AI Lesson 20h ago
When the Camera Becomes an Exam Proctor: Building an AI-Powered Exam Monitoring System with…
How our team built a real-time Computer Vision system using YOLO, OpenCV, and DeepFace to assist professional certification exams and why… Continue reading on M
The Story of U-Net: Why It Still Dominates Image Segmentation 10 Years Later
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 4d ago
The Story of U-Net: Why It Still Dominates Image Segmentation 10 Years Later
A journey from a small university lab to solving real-world problems in medicine, satellite imagery, and beyond Continue reading on Medium »
The Story of U-Net: Why It Still Dominates Image Segmentation 10 Years Later
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 4d ago
The Story of U-Net: Why It Still Dominates Image Segmentation 10 Years Later
A journey from a small university lab to solving real-world problems in medicine, satellite imagery, and beyond Continue reading on Medium »
What is Remote Sensing?
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1w ago
What is Remote Sensing?
This blog is inspired from iirs -distance learning programme by isro and iirs ( Institute of remote sensing Dehraun) which i had attended. Continue reading on M
Clifford Vortex Filaments: Rendering Chaotic Attractors in 3D
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Clifford Vortex Filaments: Rendering Chaotic Attractors in 3D
“Clifford Vortex Filaments” is a generative media art piece that dives deep into chaos theory and non-linear dynamics, visualizing a… Continue reading on Medium
Application of Linear Algebra: Image Filters and Sharpening of images
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1w ago
Application of Linear Algebra: Image Filters and Sharpening of images
In the late autumn of 1889, Vincent van Gogh stood before his easel in Saint-Rémy-de-Provence, applying thick, deliberate strokes of… Continue reading on Medium
Visual Search System: Complete ML System Design
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1w ago
Visual Search System: Complete ML System Design
A visual search system enables users to discover images that are visually similar to a selected image. Platforms such as Pinterest use… Continue reading on Medi
The 1980s Code That Rules the Geometry of Modern Video Games
Medium · Data Science 👁️ Computer Vision ⚡ AI Lesson 2w ago
The 1980s Code That Rules the Geometry of Modern Video Games
If you’ve ever climbed a mountain in Minecraft, sailed across a realistic ocean in a video game, or flown over an endless landscape that… Continue reading on Me
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 2w ago
I Built an AI Bot That Counts Calories From a Photo of Your Plate
And it spots patterns your nutritionist would catch — for $5/month Continue reading on Medium »
Algo(31/40)Real-World Perception & Action: Pixels, Boxes & Trust (2015)
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 2w ago
Algo(31/40)Real-World Perception & Action: Pixels, Boxes & Trust (2015)
By 2015, Neural Networks were excellent at saying “This is a cat.” But in the real world, that isn’t enough. A self-driving car needs to… Continue reading on Me
Teaching a Logo Detector to Say “I Don’t Know”
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 2w ago
Teaching a Logo Detector to Say “I Don’t Know”
Building BrandSpotter: a three-stage brand recognition pipeline on LogoDet-3K, and why the hardest part wasn’t detection or classification. Continue reading on
Computer Vision 101: A Data Scientist’s Guide to Image Representation, Deep Feature Extraction, and…
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 3w ago
Computer Vision 101: A Data Scientist’s Guide to Image Representation, Deep Feature Extraction, and…
“A computer does not see a landscape, a face, or a self-driving lane. It sees an infinite grid of integers. Computer Vision is the… Continue reading on Medium »
Computer Vision 101: A Data Scientist’s Guide to Image Representation, Deep Feature Extraction, and…
Medium · Data Science 👁️ Computer Vision ⚡ AI Lesson 3w ago
Computer Vision 101: A Data Scientist’s Guide to Image Representation, Deep Feature Extraction, and…
“A computer does not see a landscape, a face, or a self-driving lane. It sees an infinite grid of integers. Computer Vision is the… Continue reading on Medium »
From Pixels to Predictions: How Image Preprocessing Helps Machines See the World
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 3w ago
From Pixels to Predictions: How Image Preprocessing Helps Machines See the World
Before a Machine Can Recognize a Cat, a Car, or a Face, It Must First Learn to Understand Pixels Continue reading on Medium »
From Pixels to Predictions: How Image Preprocessing Helps Machines See the World
Medium · Data Science 👁️ Computer Vision ⚡ AI Lesson 3w ago
From Pixels to Predictions: How Image Preprocessing Helps Machines See the World
Before a Machine Can Recognize a Cat, a Car, or a Face, It Must First Learn to Understand Pixels Continue reading on Medium »
Deep Learning Essentials — (5) Adapting Pretrained Vision Models
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 3w ago
Deep Learning Essentials — (5) Adapting Pretrained Vision Models
Deep Learning Foundations, Models for Images and Sequences, and Generative AI Continue reading on Deep Learning Essentials »
Building an Ingredient-Based Visual Question Answering System for Food Images
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 4w ago
Building an Ingredient-Based Visual Question Answering System for Food Images
Food image understanding is usually treated as a classification problem. Given an image, the model predicts one label such as pizza… Continue reading on Medium
Building an Ingredient-Based Visual Question Answering System for Food Images
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 4w ago
Building an Ingredient-Based Visual Question Answering System for Food Images
Food image understanding is usually treated as a classification problem. Given an image, the model predicts one label such as pizza… Continue reading on Medium
Building an Ingredient-Based Visual Question Answering System for Food Images
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 4w ago
Building an Ingredient-Based Visual Question Answering System for Food Images
Food image understanding is usually treated as a classification problem. Given an image, the model predicts one label such as pizza… Continue reading on Medium
Building a Real-Time Fire Detection and People Counting System with InceptionV3 and OpenCV
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 4w ago
Building a Real-Time Fire Detection and People Counting System with InceptionV3 and OpenCV
How transfer learning and classical computer vision can work together on edge hardware to save lives Continue reading on Medium »
Household Item Annotation Services for AI & Computer Vision
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 4w ago
Household Item Annotation Services for AI & Computer Vision
Artificial Intelligence systems that understand indoor environments are becoming increasingly important across industries such as real… Continue reading on Medi
The Robotics Interview Series: Part 2A
Medium · Data Science 👁️ Computer Vision ⚡ AI Lesson 1mo ago
The Robotics Interview Series: Part 2A
The Perception Concepts You Need Cold (Beyond CV Fundamentals) Continue reading on Medium »
Reddit r/deeplearning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Pls suggest best resources to learn semantic segmentation
​ I want to learn it for road extraction....so please suggest the best resources submitted by /u/NoAnybody8034 [link]
Akıllı Ulaşım Sistemlerinde Görüntü İşleme Teknolojisi Kullanılarak Araç Hız Tespiti Nasıl…
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Akıllı Ulaşım Sistemlerinde Görüntü İşleme Teknolojisi Kullanılarak Araç Hız Tespiti Nasıl…
Bir trafik kamerası size bir aracın kaç km/h hızla geçtiğini söyleyebilir mi? Yazılım katmanı olmadan hayır. Bu yazı, bu yazılım katmanını… Continue reading on
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Shot detection is the cheap feature everyone underestimates
A friend of mine spent two months trying to add a “smart preview” feature to a video product, the kind of thing you see on every modern… Continue reading on Med
cv3 — make OpenCV pythonic again
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
cv3 — make OpenCV pythonic again
TL;DR cv3 is a Pythonic wrapper for OpenCV that simplifies computer vision tasks by providing more intuitive interfaces and eliminating… Continue reading on Med
cv3 — make OpenCV pythonic again
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
cv3 — make OpenCV pythonic again
TL;DR cv3 is a Pythonic wrapper for OpenCV that simplifies computer vision tasks by providing more intuitive interfaces and eliminating… Continue reading on Med
SentinelML
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
SentinelML
A modular, open-source framework for real-time firearm detection and alerting using YOLOv8 and cloud-native infrastructure. Continue reading on Medium »
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
sen2p: Download Sentinel-2 Imagery Without API Keys or Extra Setup
A lightweight Python library that makes Sentinel-2 imagery easier to search and download. Continue reading on GeoAI »
R-CNN : The Foundation of Deep Learning-Based Object Detection
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
R-CNN : The Foundation of Deep Learning-Based Object Detection
Object detection is one of the most important tasks in computer vision. Unlike image classification, where the goal is only to identify… Continue reading on Med
I Built a 7-Stage OCR Pipeline to Make Gemini Vision Actually Reliable
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
I Built a 7-Stage OCR Pipeline to Make Gemini Vision Actually Reliable
We all know LLMs are powerful. But they’re also probabilistic — and that’s the problem. The real job of an AI engineer isn’t just to call… Continue reading on M
Traffic Light Recognition (TLR) Architecture: 2D Bounding Box Detection
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Traffic Light Recognition (TLR) Architecture: 2D Bounding Box Detection
The TLR model is a Fully Convolutional Network (FCN) + FPN + Header model, utilizing an “anchor-free” approach. Instead of guessing… Continue reading on Medium
Inside SAM 3D: how Meta turns a single image into 3D
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Inside SAM 3D: how Meta turns a single image into 3D
For about forty years, “3D” in the practical sense meant one thing: triangle meshes. Every game shipped, every animated film rendered… Continue reading on Mediu
Inside SAM 3D: how Meta turns a single image into 3D
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Inside SAM 3D: how Meta turns a single image into 3D
For about forty years, “3D” in the practical sense meant one thing: triangle meshes. Every game shipped, every animated film rendered… Continue reading on Mediu
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
Medium · Data Science 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Demystifying CNNs: How Convolutional Filters and Max-Pooling Actually Work
If you’ve ever wondered how a computer can look at a photo of a car and instantly know it’s a car, you’re looking at the magic of… Continue reading on Medium »
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
This project speaks for itself. It covers three crucial steps in one go: motion tracking, skeleton reconstruction, and 3D animation. What… Continue reading on M
Panduan Praktis Optimasi Pencahayaan Citra Digital dengan Python
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Panduan Praktis Optimasi Pencahayaan Citra Digital dengan Python
Mengapa Pencahayaan Itu Krusial? ​Pernahkah Anda mengambil foto di kondisi minim cahaya dan mendapati hasilnya sangat gelap hingga… Continue reading on Medium »
Efficiency vs. Precision: A Python Deep Dive into Faster R-CNN and SSD PyTorch
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Efficiency vs. Precision: A Python Deep Dive into Faster R-CNN and SSD PyTorch
In the rapidly evolving landscape of artificial intelligence, selecting the optimal architecture for computer vision is rarely a simple… Continue reading on Obj
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Dari Pixel ke AI: Bagaimana Komputer Memahami Sebuah Gambar
“Sebuah eksplorasi sederhana tentang bagaimana gambar digital diubah menjadi informasi yang dapat dipahami oleh Artificial Intelligence.”… Continue reading on M
Teaching a Random Forest to Tell Walking from Running: A Computer Vision Pipeline with Hand-Built...
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Teaching a Random Forest to Tell Walking from Running: A Computer Vision Pipeline with Hand-Built...
How a 56-feature baseline became a 240-feature classifier at 86% accuracy, with per-class SHAP guiding every feature engineering decision. Continue reading on M
Mengenal Lebih Dekat Deteksi Tepi Canny Pada Pengolahan Citra Digital dengan python dan opencv
Medium · Python 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Mengenal Lebih Dekat Deteksi Tepi Canny Pada Pengolahan Citra Digital dengan python dan opencv
Dalam dunia pengolahan citra digital, mendeteksi batas suatu objek merupakan hal yang sangat penting. Continue reading on Medium »
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Add multi-camera support to a Python face recognition system using threaded OpenCV capture for faster, non-blocking real-time video Continue reading on Medium »
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Building Samaritan: A Multi-Camera Real-Time Face Recognition System in Python — Part 4
Add multi-camera support to a Python face recognition system using threaded OpenCV capture for faster, non-blocking real-time video Continue reading on Medium »
Building an AI-Based Exam Monitoring System Using Computer Vision, YOLO, and OpenCV
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
Building an AI-Based Exam Monitoring System Using Computer Vision, YOLO, and OpenCV
Introduction Continue reading on Medium »
IMPLEMENTING FASTER RCNN FROM SCRATCH IN PYTORCH FOR OBJECT DETECTION — PART ONE
Medium · Deep Learning 👁️ Computer Vision ⚡ AI Lesson 1mo ago
IMPLEMENTING FASTER RCNN FROM SCRATCH IN PYTORCH FOR OBJECT DETECTION — PART ONE
Learning computer vision has been an exciting journey over the past few weeks. From data preprocessing to model evaluation, every new… Continue reading on Mediu
What is Camera Calibration? How It Helps in Computer Vision Tasks
Medium · Machine Learning 👁️ Computer Vision ⚡ AI Lesson 2mo ago
What is Camera Calibration? How It Helps in Computer Vision Tasks
A ground truth guide to how cameras distort reality and why calibration is critical for accurate computer vision systems. Continue reading on Medium »