1,258 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 1,258 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (4907) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
arXiv:2603.25823v1 Announce Type: cross Abstract: Beneath the stunning visual fidelity of modern AIGC models lies a "logical desert", where systems fail tasks t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
A Compression Perspective on Simplicity Bias
arXiv:2603.25839v1 Announce Type: cross Abstract: Deep neural networks exhibit a simplicity bias, a well-documented tendency to favor simple functions over comp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding
arXiv:2603.25841v1 Announce Type: cross Abstract: Current multimodal large language models (MLLMs) cannot effectively utilize eye-gaze information for video und
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
Why Safety Probes Catch Liars But Miss Fanatics
arXiv:2603.25861v1 Announce Type: cross Abstract: Activation-based probes have emerged as a promising approach for detecting deceptively aligned AI systems by i
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 8h ago
Methods for Knowledge Graph Construction from Text Collections: Development and Applications
arXiv:2603.25862v1 Announce Type: cross Abstract: Virtually every sector of society is experiencing a dramatic growth in the volume of unstructured textual data
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 8h ago
Dynamic LIBRAS Gesture Recognition via CNN over Spatiotemporal Matrix Representation
arXiv:2603.25863v1 Announce Type: cross Abstract: This paper proposes a method for dynamic hand gesture recognition based on the composition of two models: the
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
arXiv:2603.25864v1 Announce Type: cross Abstract: Graphical User Interface (GUI) agents have the potential to assist users in interacting with complex software
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 8h ago
Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment
arXiv:2603.25880v1 Announce Type: cross Abstract: Protein structural ensembles from NMR spectroscopy capture biologically important conformational heterogeneity
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins
arXiv:2603.25898v1 Announce Type: cross Abstract: LLM-assisted modeling holds the potential to rapidly build executable Digital Twins of complex systems from on
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 8h ago
Decoding Defensive Coverage Responsibilities in American Football Using Factorized Attention Based Transformer Models
arXiv:2603.25901v1 Announce Type: cross Abstract: Defensive coverage schemes in the National Football League (NFL) represent complex tactical patterns requiring
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
Good Scores, Bad Data: A Metric for Multimodal Coherence
arXiv:2603.25924v1 Announce Type: cross Abstract: Multimodal AI systems are evaluated by downstream task accuracy, but high accuracy does not mean the underlyin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation
arXiv:2603.25931v1 Announce Type: cross Abstract: Flow-matching video generators produce temporally coherent, high-fidelity outputs yet routinely violate elemen
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 8h ago
DenseSwinV2: Channel Attentive Dual Branch CNN Transformer Learning for Cassava Leaf Disease Classification
arXiv:2603.25935v1 Announce Type: cross Abstract: This work presents a new Hybrid Dense SwinV2, a two-branch framework that jointly leverages densely connected
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
Reinforcing Structured Chain-of-Thought for Video Understanding
arXiv:2603.25942v1 Announce Type: cross Abstract: Multi-modal Large Language Models (MLLMs) show promise in video understanding. However, their reasoning often
ArXiv cs.AI 📄 Paper 8h ago
Can Small Models Reason About Legal Documents? A Comparative Study
arXiv:2603.25944v1 Announce Type: cross Abstract: Large language models show promise for legal applications, but deploying frontier models raises concerns about
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 8h ago
Collision-Aware Vision-Language Learning for End-to-End Driving with Multimodal Infraction Datasets
arXiv:2603.25946v1 Announce Type: cross Abstract: High infraction rates remain the primary bottleneck for end-to-end (E2E) autonomous driving, as evidenced by t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models
arXiv:2603.25960v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt fo
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 8h ago
Do Neurons Dream of Primitive Operators? Wake-Sleep Compression Rediscovers Schank's Event Semantics
arXiv:2603.25975v1 Announce Type: cross Abstract: We show that they do. Schank's conceptual dependency theory proposed that all events decompose into primitive
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
Policy-Guided World Model Planning for Language-Conditioned Visual Navigation
arXiv:2603.25981v1 Announce Type: cross Abstract: Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 8h ago
Longitudinal Boundary Sharpness Coefficient Slopes Predict Time to Alzheimer's Disease Conversion in Mild Cognitive Impairment: A Survival Analysis Using the ADNI Cohort
arXiv:2603.26007v1 Announce Type: cross Abstract: Predicting whether someone with mild cognitive impairment (MCI) will progress to Alzheimer's disease (AD) is c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 8h ago
FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants
arXiv:2603.26008v1 Announce Type: cross Abstract: While powerful in image-conditioned generation, multimodal large language models (MLLMs) can display uneven pe
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 8h ago
VLAgeBench: Benchmarking Large Vision-Language Models for Zero-Shot Human Age Estimation
arXiv:2603.26015v1 Announce Type: cross Abstract: Human age estimation from facial images represents a challenging computer vision task with significant applica
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 8h ago
Unlabeled Cross-Center Automatic Analysis for TAAD: An Integrated Framework from Segmentation to Clinical Features
arXiv:2603.26019v1 Announce Type: cross Abstract: Type A Aortic Dissection (TAAD) is a life-threatening cardiovascular emergency that demands rapid and precise
ArXiv cs.AI 🖌️ UI/UX Design 📄 Paper ⚡ AI Lesson 8h ago
Designing Fatigue-Aware VR Interfaces via Biomechanical Models
arXiv:2603.26031v1 Announce Type: cross Abstract: Prolonged mid-air interaction in virtual reality (VR) causes arm fatigue and discomfort, negatively affecting