Why Can't I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition

📰 ArXiv cs.AI

Mitigating object-driven shortcuts in zero-shot compositional action recognition to improve model performance

advanced Published 8 Apr 2026
Action Steps
  1. Identify sparse compositional supervision as a potential cause of object-driven shortcuts
  2. Recognize verb-object learning asymmetry as a factor contributing to shortcut learning
  3. Develop strategies to mitigate object-driven shortcuts, such as using temporal evidence instead of relying on labeled object classes
  4. Implement and evaluate these strategies in zero-shot compositional action recognition models
Who Needs to Know This

Machine learning researchers and engineers working on action recognition tasks can benefit from this research to improve their models' accuracy and robustness

Key Insight

💡 Object-driven shortcuts can hinder model performance in zero-shot compositional action recognition, and addressing sparse compositional supervision and verb-object learning asymmetry can help

Share This
🤖 Mitigating object-driven shortcuts in zero-shot action recognition #AI #ML
Read full paper → ← Back to Reads