5,060 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (12729) ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI 📄 Paper 2d ago
ARGen: Affect-Reinforced Generative Augmentation towards Vision-based Dynamic Emotion Perception
arXiv:2604.12255v1 Announce Type: cross Abstract: Dynamic facial expression recognition in the wild remains challenging due to data scarcity and long-tail distr
ArXiv cs.AI 📄 Paper 2d ago
Coding-Free and Privacy-Preserving MCP Framework for Clinical Agentic Research Intelligence System
arXiv:2604.12258v1 Announce Type: cross Abstract: Clinical research involves labor-intensive processes such as study design, cohort construction, model developm
ArXiv cs.AI 📄 Paper 2d ago
CascadeDebate: Multi-Agent Deliberation for Cost-Aware LLM Cascades
arXiv:2604.12262v1 Announce Type: cross Abstract: Cascaded LLM systems coordinate models of varying sizes with human experts to balance accuracy, cost, and abst
ArXiv cs.AI 📄 Paper 2d ago
MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer
arXiv:2604.12281v1 Announce Type: cross Abstract: Style transfer aims to render a content image with the visual characteristics of a reference style while prese
ArXiv cs.AI 📄 Paper 2d ago
Local-Splitter: A Measurement Study of Seven Tactics for Reducing Cloud LLM Token Usage on Coding-Agent Workloads
arXiv:2604.12301v1 Announce Type: cross Abstract: We present a systematic measurement study of seven tactics for reducing cloud LLM token usage when a small loc
ArXiv cs.AI 📄 Paper 2d ago
GCA Framework: A Gulf-Grounded Dataset and Agentic Pipeline for Climate Decision Support
arXiv:2604.12306v1 Announce Type: cross Abstract: Climate decision-making in the Gulf increasingly demands systems that can translate heterogeneous scientific a
ArXiv cs.AI 📄 Paper 2d ago
Is Vibe Coding the Future? An Empirical Assessment of LLM Generated Codes for Construction Safety
arXiv:2604.12311v1 Announce Type: cross Abstract: The emergence of vibe coding, a paradigm where non-technical users instruct Large Language Models (LLMs) to ge
ArXiv cs.AI 📄 Paper 2d ago
EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports
arXiv:2604.12320v1 Announce Type: cross Abstract: While video large language models (Video-LLMs) excel in understanding slow-paced, real-world egocentric videos
ArXiv cs.AI 📄 Paper 2d ago
Black-Box Optimization From Small Offline Datasets via Meta Learning with Synthetic Tasks
arXiv:2604.12325v1 Announce Type: cross Abstract: We consider the problem of offline black-box optimization, where the goal is to discover optimal designs (e.g.
ArXiv cs.AI 📄 Paper 2d ago
GeM-EA: A Generative and Meta-learning Enhanced Evolutionary Algorithm for Streaming Data-Driven Optimization
arXiv:2604.12336v1 Announce Type: cross Abstract: Streaming Data-Driven Optimization (SDDO) problems arise in many applications where data arrive continuously a
ArXiv cs.AI 📄 Paper 2d ago
FRTSearch: Unified Detection and Parameter Inference of Fast Radio Transients using Instance Segmentation
arXiv:2604.12344v1 Announce Type: cross Abstract: The exponential growth of data from modern radio telescopes presents a significant challenge to traditional si
ArXiv cs.AI 📄 Paper 2d ago
Scaffold-Conditioned Preference Triplets for Controllable Molecular Optimization with Large Language Models
arXiv:2604.12350v1 Announce Type: cross Abstract: Molecular property optimization is central to drug discovery, yet many deep learning methods rely on black-box
ArXiv cs.AI 📄 Paper 2d ago
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv:2604.12374v1 Announce Type: cross Abstract: We describe the pre-training, post-training, and quantization of Nemotron 3 Super, a 120 billion (active 12 bi
ArXiv cs.AI 📄 Paper 2d ago
Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations
arXiv:2604.12376v1 Announce Type: cross Abstract: When LLM conversations grow beyond the context window, old content must be evicted -- but how does the model r
ArXiv cs.AI 📄 Paper 2d ago
SCRIPT: A Subcharacter Compositional Representation Injection Module for Korean Pre-Trained Language Models
arXiv:2604.12377v1 Announce Type: cross Abstract: Korean is a morphologically rich language with a featural writing system in which each character is systematic
ArXiv cs.AI 📄 Paper 2d ago
Beyond Output Correctness: Benchmarking and Evaluating Large Language Model Reasoning in Coding Tasks
arXiv:2604.12379v1 Announce Type: cross Abstract: Large language models (LLMs) increasingly rely on explicit reasoning to solve coding tasks, yet evaluating the
ArXiv cs.AI 📄 Paper 2d ago
Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models
arXiv:2604.12391v1 Announce Type: cross Abstract: In this paper, we present Chain-of-Models Pre-Training (CoM-PT), a novel performance-lossless training acceler
ArXiv cs.AI 📄 Paper 2d ago
Security and Resilience in Autonomous Vehicles: A Proactive Design Approach
arXiv:2604.12408v1 Announce Type: cross Abstract: Autonomous vehicles (AVs) promise efficient, clean and cost-effective transportation systems, but their relian
ArXiv cs.AI 📄 Paper 2d ago
RACF: A Resilient Autonomous Car Framework with Object Distance Correction
arXiv:2604.12418v1 Announce Type: cross Abstract: Autonomous vehicles are increasingly deployed in safety-critical applications, where sensing failures or cyber
ArXiv cs.AI 📄 Paper 2d ago
Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation
arXiv:2604.12424v1 Announce Type: cross Abstract: Multimodal Large Language Models frequently suffer from inference hallucinations, partially stemming from lang