Prototype-Grounded Concept Models for Verifiable Concept Alignment

📰 ArXiv cs.AI

arXiv:2604.16076v1 Announce Type: cross Abstract: Concept Bottleneck Models (CBMs) aim to improve interpretability in Deep Learning by structuring predictions through human-understandable concepts, but they provide no way to verify whether learned concepts align with the human's intended meaning, hurting interpretability. We introduce Prototype-Grounded Concept Models (PGCMs), which ground concepts in learned visual prototypes: image parts that serve as explicit evidence for the concepts. This g

Published 20 Apr 2026
Read full paper → ← Back to Reads