7,014 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 7,014 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (19189) ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationMedium · ProgrammingMedium · AI
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering
arXiv:2604.05591v1 Announce Type: cross Abstract: This work introduces a modular platform that brings together six AI services, automatic speech recognition via
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
INTERACT: An AI-Driven Extended Reality Framework for Accesible Communication Featuring Real-Time Sign Language Interpretation and Emotion Recognition
arXiv:2604.05605v1 Announce Type: cross Abstract: Video conferencing has become central to professional collaboration, yet most platforms offer limited support
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago
Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization
arXiv:2604.05616v1 Announce Type: cross Abstract: Deep learning models for computer vision often suffer from poor generalization when deployed in real-world set
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago
Semantic-Topological Graph Reasoning for Language-Guided Pulmonary Screening
arXiv:2604.05620v1 Announce Type: cross Abstract: Medical image segmentation driven by free-text clinical instructions is a critical frontier in computer-aided
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Analogical Reasoning as a Doctor: A Foundation Model for Gastrointestinal Endoscopy Diagnosis
arXiv:2604.05649v1 Announce Type: cross Abstract: Gastrointestinal diseases impose a growing global health burden, and endoscopy is a primary tool for early dia
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Multiscale Physics-Informed Neural Network for Complex Fluid Flows with Long-Range Dependencies
arXiv:2604.05652v1 Announce Type: cross Abstract: Fluid flows are governed by the nonlinear Navier-Stokes equations, which can manifest multiscale dynamics even
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals
arXiv:2604.05655v1 Announce Type: cross Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
SnapFlow: One-Step Action Generation for Flow-Matching VLAs via Progressive Self-Distillation
arXiv:2604.05656v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models based on flow matching -- such as pi0, pi0.5, and SmolVLA -- achieve state
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Rectified Schr\"odinger Bridge Matching for Few-Step Visual Navigation
arXiv:2604.05673v1 Announce Type: cross Abstract: Visual navigation is a core challenge in Embodied AI, requiring autonomous agents to translate high-dimensiona
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems
arXiv:2604.05674v1 Announce Type: cross Abstract: Cyber-physical systems often contend with incomplete architectural documentation or outdated information resul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion
arXiv:2604.05688v1 Announce Type: cross Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-co
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 2w ago
CRFT: Consistent-Recurrent Feature Flow Transformer for Cross-Modal Image Registration
arXiv:2604.05689v1 Announce Type: cross Abstract: We present Consistent-Recurrent Feature Flow Transformer (CRFT), a unified coarse-to-fine framework based on f
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
SemLink: A Semantic-Aware Automated Test Oracle for Hyperlink Verification using Siamese Sentence-BERT
arXiv:2604.05711v1 Announce Type: cross Abstract: Web applications rely heavily on hyperlinks to connect disparate information resources. However, the dynamic n
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing
arXiv:2604.05719v1 Announce Type: cross Abstract: The rapid advancement of Large Language Models (LLMs) has created new opportunities for Automated Penetration
ArXiv cs.AI 💻 AI-Assisted Coding 📄 Paper ⚡ AI Lesson 2w ago
On the Robustness of Diffusion-Based Image Compression to Bit-Flip Errors
arXiv:2604.05743v1 Announce Type: cross Abstract: Modern image compression methods are typically optimized for the rate--distortion--perception trade-off, where
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
CAKE: Cloud Architecture Knowledge Evaluation of Large Language Models
arXiv:2604.05755v1 Announce Type: cross Abstract: In today's software architecture, large language models (LLMs) serve as software architecture co-pilots. Howev
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say "I Don't Know"
arXiv:2604.05779v1 Announce Type: cross Abstract: While large language models (LLMs) demonstrate strong capabilities across diverse user queries, they still suf
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
"OK Aura, Be Fair With Me": Demographics-Agnostic Training for Bias Mitigation in Wake-up Word Detection
arXiv:2604.05830v1 Announce Type: cross Abstract: Voice-based interfaces are widely used; however, achieving fair Wake-up Word detection across diverse speaker
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding
arXiv:2604.05843v1 Announce Type: cross Abstract: Brain-computer interfaces (BCIs) enable direct communication between the brain and external devices, providing
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Evaluating Learner Representations for Differentiation Prior to Instructional Outcomes
arXiv:2604.05848v1 Announce Type: cross Abstract: Learner representations play a central role in educational AI systems, yet it is often unclear whether they pr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Neural Network Pruning via QUBO Optimization
arXiv:2604.05856v1 Announce Type: cross Abstract: Neural network pruning can be formulated as a combinatorial optimization problem, yet most existing approaches
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts
arXiv:2604.05872v1 Announce Type: cross Abstract: The deployment of large language models (LLMs) in Swiss financial and regulatory contexts demands empirical ev
ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 2w ago
Automatic dental superimposition of 3D intraorals and 2D photographs for human identification
arXiv:2604.05877v1 Announce Type: cross Abstract: Dental comparison is considered a primary identification method, at the level of fingerprints and DNA profilin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation
arXiv:2604.05906v1 Announce Type: cross Abstract: Numerous studies on text-to-image (T2I) generative models have utilized cross-attention maps to boost applicat