Foundations
Computer Vision
Object detection, segmentation, YOLO, CLIP, and vision-language models
Skills in this topic
3 skills — Sign in to track your progress

Medium · Cybersecurity
👁️ Computer Vision
⚡ AI Lesson
1mo ago
OSI Modeli: Ezberlenen 7 Katmandan Daha Fazlası
Siber güvenlik veya network dünyasına yeni giren herkesin karşısına bir noktada OSI modeli çıkar. Continue reading on Medium »

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
PySIFT: GPU Accelerated SIFT for Modern Era
In computer vision, the Scale-Invariant Feature Transform (SIFT) algorithm remains a classic foundational standard for keypoint detection… Continue reading on M

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Python for Data Science & AI · Blog 18 of 20 — CNNs for Image Classification
From filters to feature maps: building networks that actually see. Continue reading on Medium »

Medium · Data Science
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Python for Data Science & AI · Blog 18 of 20 — CNNs for Image Classification
From filters to feature maps: building networks that actually see. Continue reading on Medium »

Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Python for Data Science & AI · Blog 18 of 20 — CNNs for Image Classification
From filters to feature maps: building networks that actually see. Continue reading on Medium »

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Python for Data Science & AI · Blog 18 of 20 — CNNs for Image Classification
From filters to feature maps: building networks that actually see. Continue reading on Medium »

Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Programación gráfica desde cero: una introducción a shaders, vértices y fragmentos
La programación gráfica permite crear imágenes mediante instrucciones ejecutadas por la tarjeta gráfica. Este campo no solo sirve para… Continue reading on Medi
Reddit r/learnprogramming
👁️ Computer Vision
⚡ AI Lesson
1mo ago
[Question] Need arrow dataset images for shape detection project
Hi everyone, I’m working on a shape detection project where the user draws on a whiteboard/canvas, and the system converts the drawing into a detected shape. Th

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Edge Detection From Scratch
If you would like to follow along you will need Google Colab or Jupyter Notebook and these libraries: Continue reading on Medium »

Dev.to · Eli
👁️ Computer Vision
⚡ AI Lesson
1mo ago
New Framework Adds 3D Awareness to Video Object Tracking
Researchers tackle fundamental gaps in motion detection by grounding segmentation in spatiotemporal coordinates rather than relying on pre-computed 2D approxima
Reddit r/MachineLearning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Query about non-archival workshop at CVPR-2026 [R]
My paper was recently accepted to a workshop at CVPR-2026 as non-archival acceptance. Is it mandatory for me to register to the conference as I won't be able to

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
How Computer Vision Is Transforming Industries Around the World
Artificial Intelligence has made remarkable progress over the past decade, but one of the most impactful areas is computer vision. Continue reading on Medium »

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
How Computer Vision Is Transforming Industries Around the World
Artificial Intelligence has made remarkable progress over the past decade, but one of the most impactful areas is computer vision. Continue reading on Medium »

Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
7 Things I Started Doing That Changed My Life as a Computer Engineer
A few years ago, I thought being a successful computer engineering student was all about writing code and getting good grades. Continue reading on Medium »
Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
1mo ago
NVIDIA LocateAnything-3B : GoodBye YOLO Object Detection
How to use NVIDIA LocateAnything-3B ? Continue reading on Data Science in Your Pocket »

Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Free Restore Old Photos Using AI (Full Guide)
Preserving historical family memories or working with damaged digital archives often feels like a losing battle against time. Learning how… Continue reading on

Dev.to · HARSHA GOPALKRISHNA PURANIK
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Smart Face Recognition Attendance System — No More Proxy Attendance
What I Built I built Attendance Pro — a Smart Face Recognition Attendance System for...
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Planning with the Views via Scene Self-Exploration
arXiv:2605.29563v1 Announce Type: new Abstract: Can VLMs predict how each camera move changes the view, and plan many such moves ahead? We call this capability
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Learning Context-Conditioned Predicate Semantics via Prototype Feedback
arXiv:2605.29610v1 Announce Type: cross Abstract: In scene graph generation, a central challenge is modeling polysemous predicates whose meanings shift across c
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Mitigating Hallucination in Vision-Language Models through Barrier-Regulated Adaptive Closed-form Steering
arXiv:2605.29881v1 Announce Type: cross Abstract: Large vision-language models (LVLMs) often hallucinate objects that are not present in the input image, largel
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
PhyGenHOI: Physically-Aware 4D Generation of Dynamic Human-Object Interactions
arXiv:2605.30268v1 Announce Type: cross Abstract: We address the task of generating physically accurate and visually faithful 4D Human-Object Interaction (HOI).
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images
arXiv:2605.30310v1 Announce Type: cross Abstract: City-scale 3D surface reconstruction from multiview images for downstream 3D simulation, poses highly challeng
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes
arXiv:2605.30318v1 Announce Type: cross Abstract: Portrait photography is largely decided before the shutter opens: the subject's pose, the camera configuration
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
arXiv:2505.21876v2 Announce Type: replace-cross Abstract: Recent approaches for video generation with camera control often create anchor videos (i.e., rendered
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
MOO: A Multi-view Oriented Observations Dataset for Viewpoint Analysis in Cattle Re-Identification
arXiv:2603.04314v2 Announce Type: replace-cross Abstract: Animal re-identification (ReID) faces critical challenges due to viewpoint variations, particularly in

Dev.to · Freddy Carrillo
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Centralized vs. Decentralized: Why Modern Collaborative Tools choose CRDTs
Real-time collaboration works like magic until two users edit the same line simultaneously. Under the...

Dev.to · CaraComp
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Biometrics' New Scoreboard: Seconds Saved, Not Match Scores
The shift toward friction-less biometric deployment For developers working in computer vision and...

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
The Building a Production AI Vision Pipeline: Lessons from InsightSnap
How I turned exhibition photos into professional intelligence reports — and what I learned about working with Vision Language Models in… Continue reading on Med
Reddit r/MachineLearning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]
Hello everyone. The new dataset is named MONET, is Apache 2.0 and available on HF: https://huggingface.co/datasets/jasperai/monet MONET is open, Apache 2.0-lice

Medium · Programming
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Binary to Decimal Conversion Explained — With the Fastest Free Converter Online
Everything a student, programmer, or network engineer needs to know about binary and decimal — plus a tool that shows every step of the… Continue reading on Med

Hackernoon
👁️ Computer Vision
⚡ AI Lesson
1mo ago
How We Built a Price Tag Recognition System in 2017 — Before It Was Cool
A story of cfans duct-taped to GPUs, neural network hallucinations, and what it actually takes to ship computer vision in production.
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
BlazeEdit: Generalist Image Editing on Mobile Devices with Image-to-Image Diffusion Models
arXiv:2605.28067v1 Announce Type: new Abstract: The remarkable generation quality of modern diffusion models often comes at the cost of massive parameter counts
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Can Segmentation Models Understand the World? Towards Proactive Affordance Reasoning via Visual Chain-of-Thought
arXiv:2605.27764v1 Announce Type: cross Abstract: Recent segmentation models couple large language models (LLMs) with mask decoders to ground complex language e
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Revisiting Change Detection Methods for their Application to Serac Fall Time-Lapse Monitoring
arXiv:2605.28100v1 Announce Type: cross Abstract: In an era where climate change aggravates environmental uncertainties, the identification and detection of eve
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
Mining Multi-Modality Spatio-Temporal Cues for Video Important Person Identification
arXiv:2605.28604v1 Announce Type: cross Abstract: Identifying key individuals in video scenes is essential for applications such as automated video editing and
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
RelaxFlow: Text-Driven Amodal 3D Generation
arXiv:2603.05425v2 Announce Type: replace-cross Abstract: Image-to-3D generation faces inherent semantic ambiguity under occlusion, where partial observation al

Medium · Data Science
👁️ Computer Vision
⚡ AI Lesson
1mo ago
The Robotics Interview Series: Part 2A
The Perception Concepts You Need Cold (Beyond CV Fundamentals) Continue reading on Medium »
Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Computer Vision Yolculuğu — Gün 9: Gesture Volume Control Sistemlerinde Test, Debug
Gerçek zamanlı Computer Vision projelerinde yalnızca çalışan bir sistem geliştirmek yeterli değildir. Önemli olan nokta; sistemin stabil… Continue reading on Me
Medium · Python
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Detecté 1.882
Detecté 1.882 plantas de maíz desde un dron con YOLOv8 — sin GPU, sin etiquetado manual, y con 100% de precisión en campo Continue reading on Medium »
Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Detecté 1.882
Detecté 1.882 plantas de maíz desde un dron con YOLOv8 — sin GPU, sin etiquetado manual, y con 100% de precisión en campo Continue reading on Medium »
Reddit r/deeplearning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
CCTV Shoplifting Detection Dataset (Keypoints + VLM annotations) [Synthetic]
submitted by /u/MiserableDonkey1974 [link] [comments]
Reddit r/deeplearning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Pls suggest best resources to learn semantic segmentation
I want to learn it for road extraction....so please suggest the best resources submitted by /u/NoAnybody8034 [link]
Medium · Deep Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
From LeNet to ViT: The Evolution of Deep Learning Vision Architectures (And Why It’s Redefining…
A technical walkthrough of how convolutional networks gave way to Vision Transformers — and what that means for document intelligence. Continue reading on Mediu

Dev.to · Andrew Judd
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Less Than a Penny Per Document
People hear "I replaced my OCR pipeline with a vision model" and the first thing they ask about is cost. Fair question. I assumed it would be expensive too. Und
Reddit r/learnprogramming
👁️ Computer Vision
⚡ AI Lesson
1mo ago
Two Dimensional Transformation Visualiser
hey everyone, I’m a first-year BSc Mathematics student currently applying computer graphics and mathematical concepts in AI. to understand transformation matric

Medium · Machine Learning
👁️ Computer Vision
⚡ AI Lesson
1mo ago
From Computer Vision Demo to Real Industrial Tool
From Computer Vision Demo to Real Industrial Tool Continue reading on Medium »
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
AssetGen: Deployable 3D Asset Generation at Interactive Speed
arXiv:2605.26137v1 Announce Type: cross Abstract: While 3D generation is progressing rapidly, recent work has often focused on obtaining high-resolution assets,
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
1mo ago
E$^3$C: Video Generation with 3D Environmental Memory and Ego-Exo Human Pose Control
arXiv:2605.26316v1 Announce Type: cross Abstract: Controllable and physically grounded egocentric video generation is essential for embodied agents to reason ab
DeepCamp AI