3,299 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,299 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (12148) ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis
arXiv:2604.05482v1 Announce Type: cross Abstract: Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models.
ArXiv cs.AI 🛠️ AI Tools & Apps 📄 Paper ⚡ AI Lesson 1w ago
Learned Elevation Models as a Lightweight Alternative to LiDAR for Radio Environment Map Estimation
arXiv:2604.05520v1 Announce Type: cross Abstract: Next-generation wireless systems such as 6G operate at higher frequency bands, making signal propagation highl
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Controllable Singing Style Conversion with Boundary-Aware Information Bottleneck
arXiv:2604.05526v1 Announce Type: cross Abstract: This paper presents the submission of the S4 team to the Singing Voice Conversion Challenge 2025 (SVCC2025)-a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system
arXiv:2604.05536v1 Announce Type: cross Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version
arXiv:2604.05551v1 Announce Type: cross Abstract: Self-conditioning has been central to the success of continuous diffusion language models, as it allows models
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue
arXiv:2604.05552v1 Announce Type: cross Abstract: Large Language Models demonstrate outstanding performance in many language tasks but still face fundamental ch
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
Foundations for Agentic AI Investigations from the Forensic Analysis of OpenClaw
arXiv:2604.05589v1 Announce Type: cross Abstract: Agentic Al systems are increasingly deployed as personal assistants and are likely to become a common object o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering
arXiv:2604.05591v1 Announce Type: cross Abstract: This work introduces a modular platform that brings together six AI services, automatic speech recognition via
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
INTERACT: An AI-Driven Extended Reality Framework for Accesible Communication Featuring Real-Time Sign Language Interpretation and Emotion Recognition
arXiv:2604.05605v1 Announce Type: cross Abstract: Video conferencing has become central to professional collaboration, yet most platforms offer limited support
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago
Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization
arXiv:2604.05616v1 Announce Type: cross Abstract: Deep learning models for computer vision often suffer from poor generalization when deployed in real-world set
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 1w ago
Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening
arXiv:2604.05620v1 Announce Type: cross Abstract: Medical image segmentation driven by free-text clinical instructions is a critical frontier in computer-aided
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Analogical Reasoning as a Doctor: A Foundation Model for Gastrointestinal Endoscopy Diagnosis
arXiv:2604.05649v1 Announce Type: cross Abstract: Gastrointestinal diseases impose a growing global health burden, and endoscopy is a primary tool for early dia
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Multiscale Physics-Informed Neural Network for Complex Fluid Flows with Long-Range Dependencies
arXiv:2604.05652v1 Announce Type: cross Abstract: Fluid flows are governed by the nonlinear Navier-Stokes equations, which can manifest multiscale dynamics even
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals
arXiv:2604.05655v1 Announce Type: cross Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
SnapFlow: One-Step Action Generation for Flow-Matching VLAs via Progressive Self-Distillation
arXiv:2604.05656v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models based on flow matching -- such as pi0, pi0.5, and SmolVLA -- achieve state
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Rectified Schr\"odinger Bridge Matching for Few-Step Visual Navigation
arXiv:2604.05673v1 Announce Type: cross Abstract: Visual navigation is a core challenge in Embodied AI, requiring autonomous agents to translate high-dimensiona
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems
arXiv:2604.05674v1 Announce Type: cross Abstract: Cyber-physical systems often contend with incomplete architectural documentation or outdated information resul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion
arXiv:2604.05688v1 Announce Type: cross Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-co
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1w ago
CRFT: Consistent-Recurrent Feature Flow Transformer for Cross-Modal Image Registration
arXiv:2604.05689v1 Announce Type: cross Abstract: We present Consistent-Recurrent Feature Flow Transformer (CRFT), a unified coarse-to-fine framework based on f
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1w ago
SemLink: A Semantic-Aware Automated Test Oracle for Hyperlink Verification using Siamese Sentence-BERT
arXiv:2604.05711v1 Announce Type: cross Abstract: Web applications rely heavily on hyperlinks to connect disparate information resources. However, the dynamic n