📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,014 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (19189) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation Medium · Programming Medium · AI

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering

arXiv:2604.05591v1 Announce Type: cross Abstract: This work introduces a modular platform that brings together six AI services, automatic speech recognition via

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

INTERACT: An AI-Driven Extended Reality Framework for Accesible Communication Featuring Real-Time Sign Language Interpretation and Emotion Recognition

arXiv:2604.05605v1 Announce Type: cross Abstract: Video conferencing has become central to professional collaboration, yet most platforms offer limited support

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago

Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization

arXiv:2604.05616v1 Announce Type: cross Abstract: Deep learning models for computer vision often suffer from poor generalization when deployed in real-world set

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago

Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening

arXiv:2604.05620v1 Announce Type: cross Abstract: Medical image segmentation driven by free-text clinical instructions is a critical frontier in computer-aided

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Analogical Reasoning as a Doctor: A Foundation Model for Gastrointestinal Endoscopy Diagnosis

arXiv:2604.05649v1 Announce Type: cross Abstract: Gastrointestinal diseases impose a growing global health burden, and endoscopy is a primary tool for early dia

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Multiscale Physics-Informed Neural Network for Complex Fluid Flows with Long-Range Dependencies

arXiv:2604.05652v1 Announce Type: cross Abstract: Fluid flows are governed by the nonlinear Navier-Stokes equations, which can manifest multiscale dynamics even

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

arXiv:2604.05655v1 Announce Type: cross Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

SnapFlow: One-Step Action Generation for Flow-Matching VLAs via Progressive Self-Distillation

arXiv:2604.05656v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models based on flow matching -- such as pi0, pi0.5, and SmolVLA -- achieve state

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Rectified Schr\"odinger Bridge Matching for Few-Step Visual Navigation

arXiv:2604.05673v1 Announce Type: cross Abstract: Visual navigation is a core challenge in Embodied AI, requiring autonomous agents to translate high-dimensiona

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems

arXiv:2604.05674v1 Announce Type: cross Abstract: Cyber-physical systems often contend with incomplete architectural documentation or outdated information resul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

arXiv:2604.05688v1 Announce Type: cross Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-co

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 2w ago

CRFT: Consistent-Recurrent Feature Flow Transformer for Cross-Modal Image Registration

arXiv:2604.05689v1 Announce Type: cross Abstract: We present Consistent-Recurrent Feature Flow Transformer (CRFT), a unified coarse-to-fine framework based on f

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

SemLink: A Semantic-Aware Automated Test Oracle for Hyperlink Verification using Siamese Sentence-BERT

arXiv:2604.05711v1 Announce Type: cross Abstract: Web applications rely heavily on hyperlinks to connect disparate information resources. However, the dynamic n

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing

arXiv:2604.05719v1 Announce Type: cross Abstract: The rapid advancement of Large Language Models (LLMs) has created new opportunities for Automated Penetration

ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago

On the Robustness of Diffusion-Based Image Compression to Bit-Flip Errors

arXiv:2604.05743v1 Announce Type: cross Abstract: Modern image compression methods are typically optimized for the rate--distortion--perception trade-off, where

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CAKE: Cloud Architecture Knowledge Evaluation of Large Language Models

arXiv:2604.05755v1 Announce Type: cross Abstract: In today's software architecture, large language models (LLMs) serve as software architecture co-pilots. Howev

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say "I Don't Know"

arXiv:2604.05779v1 Announce Type: cross Abstract: While large language models (LLMs) demonstrate strong capabilities across diverse user queries, they still suf

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

"OK Aura, Be Fair With Me": Demographics-Agnostic Training for Bias Mitigation in Wake-up Word Detection

arXiv:2604.05830v1 Announce Type: cross Abstract: Voice-based interfaces are widely used; however, achieving fair Wake-up Word detection across diverse speaker

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding

arXiv:2604.05843v1 Announce Type: cross Abstract: Brain-computer interfaces (BCIs) enable direct communication between the brain and external devices, providing

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

Evaluating Learner Representations for Differentiation Prior to Instructional Outcomes

arXiv:2604.05848v1 Announce Type: cross Abstract: Learner representations play a central role in educational AI systems, yet it is often unclear whether they pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Neural Network Pruning via QUBO Optimization

arXiv:2604.05856v1 Announce Type: cross Abstract: Neural network pruning can be formulated as a combinatorial optimization problem, yet most existing approaches

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts

arXiv:2604.05872v1 Announce Type: cross Abstract: The deployment of large language models (LLMs) in Swiss financial and regulatory contexts demands empirical ev

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago

Automatic dental superimposition of 3D intraorals and 2D photographs for human identification

arXiv:2604.05877v1 Announce Type: cross Abstract: Dental comparison is considered a primary identification method, at the level of fingerprints and DNA profilin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation

arXiv:2604.05906v1 Announce Type: cross Abstract: Numerous studies on text-to-image (T2I) generative models have utilized cross-attention maps to boost applicat