8,253 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21843) ArXiv cs.AIDev.to AIMedium · AIMedium · ProgrammingForbes InnovationMedium · Machine Learning
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Beyond Static Visual Tokens: Structured Sequential Visual Chain-of-Thought Reasoning
arXiv:2603.26737v1 Announce Type: cross Abstract: Current multimodal LLMs encode images as static visual prefixes and rely on text-based reasoning, lacking goal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
SleepVLM: Explainable and Rule-Grounded Sleep Staging via a Vision-Language Model
arXiv:2603.26738v1 Announce Type: cross Abstract: While automated sleep staging has achieved expert-level accuracy, its clinical adoption is hindered by a lack
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Quantum Fuzzy Sets Revisited: Density Matrices, Decoherence, and the Q-Matrix Framework
arXiv:2603.26739v1 Announce Type: cross Abstract: In 2006 we proposed Quantum Fuzzy Sets, observing that states of a quantum register could serve as characteris
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Language-Conditioned World Modeling for Visual Navigation
arXiv:2603.26741v1 Announce Type: cross Abstract: We study language-conditioned visual navigation (LCVN), in which an embodied agent is asked to follow a natura
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Steering Sparse Autoencoder Latents to Control Dynamic Head Pruning in Vision Transformers (Student Abstract)
arXiv:2603.26743v1 Announce Type: cross Abstract: Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads,
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
LARD 2.0: Enhanced Datasets and Benchmarking for Autonomous Landing Systems
arXiv:2603.26748v1 Announce Type: cross Abstract: This paper addresses key challenges in the development of autonomous landing systems, focusing on dataset limi
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
Training-Free Diffusion-Driven Modeling of Pareto Set Evolution for Dynamic Multiobjective Optimization
arXiv:2603.26749v1 Announce Type: cross Abstract: Dynamic multiobjective optimization problems (DMOPs) feature time-varying objectives, which cause the Pareto o
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1mo ago
Generating Synthetic Wildlife Health Data from Camera Trap Imagery: A Pipeline for Alopecia and Body Condition Training Data
arXiv:2603.26754v1 Announce Type: cross Abstract: No publicly available, ML ready datasets exist for wildlife health conditions in camera trap imagery, creating
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1mo ago
Tiny-ViT: A Compact Vision Transformer for Efficient and Explainable Potato Leaf Disease Classification
arXiv:2603.26761v1 Announce Type: cross Abstract: Early and precise identification of plant diseases, especially in potato crops is important to ensure the heal
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1mo ago
Aesthetic Assessment of Chinese Handwritings Based on Vision Language Models
arXiv:2603.26768v1 Announce Type: cross Abstract: The handwriting of Chinese characters is a fundamental aspect of learning the Chinese language. Previous autom
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Edge Reliability Gap in Vision-Language Models: Quantifying Failure Modes of Compressed VLMs Under Visual Corruption
arXiv:2603.26769v1 Announce Type: cross Abstract: The rapid compression of large vision-language models (VLMs) for edge deployment raises an underexplored quest
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
From Content to Audience: A Multimodal Annotation Framework for Broadcast Television Analytics
arXiv:2603.26772v1 Announce Type: cross Abstract: Automated semantic annotation of broadcast television content presents distinctive challenges, combining struc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Learning to Select Visual In-Context Demonstrations
arXiv:2603.26775v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) adapt to visual tasks via in-context learning (ICL), which relies hea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
TED: Training-Free Experience Distillation for Multimodal Reasoning
arXiv:2603.26778v1 Announce Type: cross Abstract: Knowledge distillation is typically realized by transferring a teacher model's knowledge into a student's para
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Limits of Imagery Reasoning in Frontier LLM Models
arXiv:2603.26779v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, yet they struggle with spati
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Can We Change the Stroke Size for Easier Diffusion?
arXiv:2603.26783v1 Announce Type: cross Abstract: Diffusion models can be challenged in the low signal-to-noise regime, where they have to make pixel-level pred
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
A Step Toward Federated Pretraining of Multimodal Large Language Models
arXiv:2603.26786v1 Announce Type: cross Abstract: The rapid evolution of Multimodal Large Language Models (MLLMs) is bottlenecked by the saturation of high-qual
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
CRISP: Characterizing Relative Impact of Scholarly Publications
arXiv:2603.26791v1 Announce Type: cross Abstract: Assessing a cited paper's impact is typically done by analyzing its citation context in isolation within the c
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
A Firefly Algorithm for Mixed-Variable Optimization Based on Hybrid Distance Modeling
arXiv:2603.26792v1 Announce Type: cross Abstract: Several real-world optimization problems involve mixed-variable search spaces, where continuous, ordinal, and
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1mo ago
PhyDCM: A Reproducible Open-Source Framework for AI-Assisted Brain Tumor Classification from Multi-Sequence MRI
arXiv:2603.26794v1 Announce Type: cross Abstract: MRI-based medical imaging has become indispensable in modern clinical diagnosis, particularly for brain tumor