📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (13641) ArXiv cs.AI Dev.to · FORUM WEB Dev.to AI Forbes Innovation OpenAI News Medium · Programming

A Multiparty Homomorphic Encryption Approach to Confidential Federated Kaplan Meier Survival Analysis

arXiv:2412.20495v2 Announce Type: replace-cross Abstract: The proliferation of real-world health data enables multi-institutional survival studies, yet privacy

ArXiv cs.AI 📄 Paper 4d ago

Influencing Humans to Conform to Preference Models for RLHF

arXiv:2501.06416v3 Announce Type: replace-cross Abstract: Designing a reinforcement learning from human feedback (RLHF) algorithm to approximate a human's unobs

ArXiv cs.AI 📄 Paper 4d ago

Curriculum-based Sample Efficient Reinforcement Learning for Robust Stabilization of a Quadrotor

arXiv:2501.18490v3 Announce Type: replace-cross Abstract: This article introduces a novel sample-efficient curriculum learning (CL) approach for training an end

ArXiv cs.AI 📄 Paper 4d ago

Integrating Semi-Supervised and Active Learning for Semantic Segmentation

arXiv:2501.19227v2 Announce Type: replace-cross Abstract: In this paper, we propose a novel active learning approach integrated with an improved semi-supervised

ArXiv cs.AI 📄 Paper 4d ago

Large Language Models Can Help Mitigate Barren Plateaus in Quantum Neural Networks

arXiv:2502.13166v3 Announce Type: replace-cross Abstract: In the era of noisy intermediate-scale quantum (NISQ) computing, Quantum Neural Networks (QNNs) have e

ArXiv cs.AI 📄 Paper 4d ago

ExPath: Targeted Pathway Inference for Biological Knowledge Bases via Graph Learning and Explanation

arXiv:2502.18026v3 Announce Type: replace-cross Abstract: Retrieving targeted pathways in biological knowledge bases, particularly when incorporating wet-lab ex

ArXiv cs.AI 📄 Paper 4d ago

Learning to Play Piano in the Real World

arXiv:2503.15481v3 Announce Type: replace-cross Abstract: Towards the grand challenge of achieving human-level manipulation in robots, playing piano is a compel

ArXiv cs.AI 📄 Paper 4d ago

AccidentSim: Generating Vehicle Collision Videos with Physically Realistic Collision Trajectories from Real-World Accident Reports

arXiv:2503.20654v4 Announce Type: replace-cross Abstract: Collecting real-world vehicle accident videos for autonomous driving research is challenging due to th

ArXiv cs.AI 📄 Paper 4d ago

If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs

arXiv:2503.23514v2 Announce Type: replace-cross Abstract: Large language models (LLMs) can carry out human-like dialogue, but unlike humans, they are stateless

ArXiv cs.AI 📄 Paper 4d ago

TARAC: Mitigating Hallucination in LVLMs via Temporal Attention Real-time Accumulative Connection

arXiv:2504.04099v2 Announce Type: replace-cross Abstract: Large Vision-Language Models have demonstrated remarkable capabilities, yet they suffer from hallucina

ArXiv cs.AI 📄 Paper 4d ago

Optimizing Large Language Models: Metrics, Energy Efficiency, and Case Study Insights

arXiv:2504.06307v2 Announce Type: replace-cross Abstract: The rapid adoption of large language models (LLMs) has led to significant energy consumption and carbo

ArXiv cs.AI 📄 Paper 4d ago

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

arXiv:2504.13818v4 Announce Type: replace-cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has emerged as the leading approach for enhancin

ArXiv cs.AI 📄 Paper 4d ago

LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers

arXiv:2504.14386v2 Announce Type: replace-cross Abstract: Positional embeddings (PE) play a crucial role in Vision Transformers (ViTs) by providing spatial info

ArXiv cs.AI 📄 Paper 4d ago

Non-stationary Diffusion For Probabilistic Time Series Forecasting

arXiv:2505.04278v3 Announce Type: replace-cross Abstract: Due to the dynamics of underlying physics and external influences, the uncertainty of time series ofte

ArXiv cs.AI 📄 Paper 4d ago

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

arXiv:2505.04842v2 Announce Type: replace-cross Abstract: Prevalent reinforcement learning~(RL) methods for fine-tuning LLM reasoners, such as GRPO or Leave-one

ArXiv cs.AI 📄 Paper 4d ago

Auto-regressive transformation for image alignment

arXiv:2505.04864v2 Announce Type: replace-cross Abstract: Existing methods for image alignment struggle in cases involving feature-sparse regions, extreme scale

ArXiv cs.AI 📄 Paper 4d ago

Variational Visual Question Answering for Uncertainty-Aware Selective Prediction

arXiv:2505.09591v3 Announce Type: replace-cross Abstract: Despite remarkable progress in recent years, Vision Language Models (VLMs) remain prone to overconfide

ArXiv cs.AI 📄 Paper 4d ago

TokUR: Token-Level Uncertainty Estimation for Large Language Model Reasoning

arXiv:2505.11737v4 Announce Type: replace-cross Abstract: While Large Language Models (LLMs) have demonstrated impressive capabilities, their output quality rem

ArXiv cs.AI 📄 Paper 4d ago

Sat2Sound: A Unified Framework for Zero-Shot Soundscape Mapping

arXiv:2505.13777v2 Announce Type: replace-cross Abstract: We present Sat2Sound, a unified multimodal framework for geospatial soundscape understanding, designed

ArXiv cs.AI 📄 Paper 4d ago

SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence

arXiv:2505.17012v3 Announce Type: replace-cross Abstract: Existing evaluations of multimodal large language models (MLLMs) on spatial intelligence are typically

ArXiv cs.AI 📄 Paper 4d ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

arXiv:2505.17022v2 Announce Type: replace-cross Abstract: Visual generation models have made remarkable progress in creating realistic images from text prompts,

ArXiv cs.AI 📄 Paper 4d ago

Tuning Language Models for Robust Prediction of Diverse User Behaviors

arXiv:2505.17682v2 Announce Type: replace-cross Abstract: Predicting user behavior is essential for intelligent assistant services, yet deep learning models oft

ArXiv cs.AI 📄 Paper 4d ago

Learning World Models for Interactive Video Generation

arXiv:2505.21996v3 Announce Type: replace-cross Abstract: Foundational world models must be both interactive and preserve spatiotemporal coherence for effective

ArXiv cs.AI 📄 Paper 4d ago

Towards Reasonable Concept Bottleneck Models

arXiv:2506.05014v2 Announce Type: replace-cross Abstract: We propose a novel, flexible, and efficient framework for designing Concept Bottleneck Models (CBMs) t