📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21843) ArXiv cs.AI Dev.to AI Medium · AI Medium · Programming Forbes Innovation Medium · Machine Learning

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago

Multi-view Graph Convolutional Network with Fully Leveraging Consistency via Granular-ball-based Topology Construction, Feature Enhancement and Interactive Fusion

arXiv:2603.26729v1 Announce Type: cross Abstract: The effective utilization of consistency is crucial for multi-view learning. GCNs leverage node connections to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Contextual inference from single objects in Vision-Language models

arXiv:2603.26731v1 Announce Type: cross Abstract: How much scene context a single object carries is a well-studied question in human scene perception, yet how t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Distilled Large Language Model-Driven Dynamic Sparse Expert Activation Mechanism

arXiv:2603.26735v1 Announce Type: cross Abstract: High inter-class similarity, extreme scale variation, and limited computational budgets hinder reliable visual

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1mo ago

Ordinal Semantic Segmentation Applied to Medical and Odontological Images

arXiv:2603.26736v1 Announce Type: cross Abstract: Semantic segmentation consists of assigning a semantic label to each pixel according to predefined classes. Th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Beyond Static Visual Tokens: Structured Sequential Visual Chain-of-Thought Reasoning

arXiv:2603.26737v1 Announce Type: cross Abstract: Current multimodal LLMs encode images as static visual prefixes and rely on text-based reasoning, lacking goal

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SleepVLM: Explainable and Rule-Grounded Sleep Staging via a Vision-Language Model

arXiv:2603.26738v1 Announce Type: cross Abstract: While automated sleep staging has achieved expert-level accuracy, its clinical adoption is hindered by a lack

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Quantum Fuzzy Sets Revisited: Density Matrices, Decoherence, and the Q-Matrix Framework

arXiv:2603.26739v1 Announce Type: cross Abstract: In 2006 we proposed Quantum Fuzzy Sets, observing that states of a quantum register could serve as characteris

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Language-Conditioned World Modeling for Visual Navigation

arXiv:2603.26741v1 Announce Type: cross Abstract: We study language-conditioned visual navigation (LCVN), in which an embodied agent is asked to follow a natura

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Steering Sparse Autoencoder Latents to Control Dynamic Head Pruning in Vision Transformers (Student Abstract)

arXiv:2603.26743v1 Announce Type: cross Abstract: Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads,

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

LARD 2.0: Enhanced Datasets and Benchmarking for Autonomous Landing Systems

arXiv:2603.26748v1 Announce Type: cross Abstract: This paper addresses key challenges in the development of autonomous landing systems, focusing on dataset limi

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

Training-Free Diffusion-Driven Modeling of Pareto Set Evolution for Dynamic Multiobjective Optimization

arXiv:2603.26749v1 Announce Type: cross Abstract: Dynamic multiobjective optimization problems (DMOPs) feature time-varying objectives, which cause the Pareto o

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1mo ago

Generating Synthetic Wildlife Health Data from Camera Trap Imagery: A Pipeline for Alopecia and Body Condition Training Data

arXiv:2603.26754v1 Announce Type: cross Abstract: No publicly available, ML ready datasets exist for wildlife health conditions in camera trap imagery, creating

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1mo ago

Tiny-ViT: A Compact Vision Transformer for Efficient and Explainable Potato Leaf Disease Classification

arXiv:2603.26761v1 Announce Type: cross Abstract: Early and precise identification of plant diseases, especially in potato crops is important to ensure the heal

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1mo ago

Aesthetic Assessment of Chinese Handwritings Based on Vision Language Models

arXiv:2603.26768v1 Announce Type: cross Abstract: The handwriting of Chinese characters is a fundamental aspect of learning the Chinese language. Previous autom

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Edge Reliability Gap in Vision-Language Models: Quantifying Failure Modes of Compressed VLMs Under Visual Corruption

arXiv:2603.26769v1 Announce Type: cross Abstract: The rapid compression of large vision-language models (VLMs) for edge deployment raises an underexplored quest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Content to Audience: A Multimodal Annotation Framework for Broadcast Television Analytics

arXiv:2603.26772v1 Announce Type: cross Abstract: Automated semantic annotation of broadcast television content presents distinctive challenges, combining struc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Learning to Select Visual In-Context Demonstrations

arXiv:2603.26775v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) adapt to visual tasks via in-context learning (ICL), which relies hea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TED: Training-Free Experience Distillation for Multimodal Reasoning

arXiv:2603.26778v1 Announce Type: cross Abstract: Knowledge distillation is typically realized by transferring a teacher model's knowledge into a student's para

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Limits of Imagery Reasoning in Frontier LLM Models

arXiv:2603.26779v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, yet they struggle with spati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Can We Change the Stroke Size for Easier Diffusion?

arXiv:2603.26783v1 Announce Type: cross Abstract: Diffusion models can be challenged in the low signal-to-noise regime, where they have to make pixel-level pred

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Step Toward Federated Pretraining of Multimodal Large Language Models

arXiv:2603.26786v1 Announce Type: cross Abstract: The rapid evolution of Multimodal Large Language Models (MLLMs) is bottlenecked by the saturation of high-qual

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CRISP: Characterizing Relative Impact of Scholarly Publications

arXiv:2603.26791v1 Announce Type: cross Abstract: Assessing a cited paper's impact is typically done by analyzing its citation context in isolation within the c

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

A Firefly Algorithm for Mixed-Variable Optimization Based on Hybrid Distance Modeling

arXiv:2603.26792v1 Announce Type: cross Abstract: Several real-world optimization problems involve mixed-variable search spaces, where continuous, ordinal, and

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago

PhyDCM: A Reproducible Open-Source Framework for AI-Assisted Brain Tumor Classification from Multi-Sequence MRI

arXiv:2603.26794v1 Announce Type: cross Abstract: MRI-based medical imaging has become indispensable in modern clinical diagnosis, particularly for brain tumor