📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (13937) ArXiv cs.AI Dev.to AI Dev.to · FORUM WEB Forbes Innovation OpenAI News Medium · Programming

Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

arXiv:2411.10636v2 Announce Type: replace-cross Abstract: In this study, we investigate extrinsic gender bias in Bangla pretrained language models, a largely un

ArXiv cs.AI 📄 Paper 6d ago

OmniPrism: Learning Disentangled Visual Concept for Image Generation

arXiv:2412.12242v2 Announce Type: replace-cross Abstract: Creative visual concept generation often draws inspiration from specific concepts in a reference image

ArXiv cs.AI 📄 Paper 6d ago

Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution

arXiv:2502.06809v3 Announce Type: replace-cross Abstract: Pervasive polysemanticity in large language models (LLMs) undermines discrete neuron-concept attributi

ArXiv cs.AI 📄 Paper 6d ago

AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society

arXiv:2502.08691v2 Announce Type: replace-cross Abstract: Understanding human behavior and society is a central focus in social sciences, with the rise of gener

ArXiv cs.AI 📄 Paper 6d ago

Constraining Sequential Model Editing with Editing Anchor Compression

arXiv:2503.00035v2 Announce Type: replace-cross Abstract: Large language models (LLMs) struggle with hallucinations due to false or outdated knowledge. Given th

ArXiv cs.AI 📄 Paper 6d ago

Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models

arXiv:2505.12509v3 Announce Type: replace-cross Abstract: Post-hoc explanations provide transparency and are essential for guiding model optimization, such as p

ArXiv cs.AI 📄 Paper 6d ago

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

arXiv:2505.18600v3 Announce Type: replace-cross Abstract: Modern single-image super-resolution (SISR) models deliver photo-realistic results at the scale factor

ArXiv cs.AI 📄 Paper 6d ago

Gen-n-Val: Agentic Image Data Generation and Validation

arXiv:2506.04676v2 Announce Type: replace-cross Abstract: The data scarcity, label noise, and long-tailed category imbalance remain important and unresolved cha

ArXiv cs.AI 📄 Paper 6d ago

Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations

arXiv:2506.09067v2 Announce Type: replace-cross Abstract: Generative medical vision-language models~(Med-VLMs) are primarily designed to generate complex textua

ArXiv cs.AI 📄 Paper 6d ago

Listener-Rewarded Thinking in VLMs for Image Preferences

arXiv:2506.22832v3 Announce Type: replace-cross Abstract: Training robust and generalizable reward models for human visual preferences is essential for aligning

ArXiv cs.AI 📄 Paper 6d ago

Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos

arXiv:2508.04853v2 Announce Type: replace-cross Abstract: Post-training quantization (PTQ) has become a crucial tool for reducing the memory and compute costs o

ArXiv cs.AI 📄 Paper 6d ago

VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

arXiv:2508.06869v4 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) demonstrate exceptional performance in vision-language tasks,

ArXiv cs.AI 📄 Paper 6d ago

Mitigating Domain Drift in Multi Species Segmentation with DINOv2: A Cross-Domain Evaluation in Herbicide Research Trials

arXiv:2508.07514v4 Announce Type: replace-cross Abstract: Reliable plant species and damage segmentation for herbicide field research trials requires models tha

ArXiv cs.AI 📄 Paper 6d ago

Investigating Multimodal Large Language Models to Support Usability Evaluation

arXiv:2508.16165v2 Announce Type: replace-cross Abstract: Usability evaluation is an essential method to support the design of effective and intuitive user inte

ArXiv cs.AI 📄 Paper 6d ago

AR-KAN: Autoregressive-Weight-Enhanced Kolmogorov-Arnold Network for Time Series Forecasting

arXiv:2509.02967v3 Announce Type: replace-cross Abstract: Traditional neural networks struggle to capture the spectral structure of complex signals. Fourier neu

ArXiv cs.AI 📄 Paper 6d ago

STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting

arXiv:2509.25210v3 Announce Type: replace-cross Abstract: To gain finer regional forecasts, many works have explored the regional integration from the global at

ArXiv cs.AI 📄 Paper 6d ago

On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

arXiv:2509.25214v3 Announce Type: replace-cross Abstract: As increasingly large pre-trained models are released, deploying them on edge devices for privacy-pres

ArXiv cs.AI 📄 Paper 6d ago

Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search

arXiv:2509.26435v2 Announce Type: replace-cross Abstract: Controllable summarization moves beyond generic outputs toward human-aligned summaries guided by speci

ArXiv cs.AI 📄 Paper 6d ago

Traj2Action: A Co-Denoising Framework for Trajectory-Guided Human-to-Robot Skill Transfer

arXiv:2510.00491v3 Announce Type: replace-cross Abstract: Learning diverse manipulation skills for real-world robots is severely bottlenecked by the reliance on

ArXiv cs.AI 📄 Paper 6d ago

Unmasking Puppeteers: Leveraging Biometric Leakage to Disarm Impersonation in AI-based Videoconferencing

arXiv:2510.03548v3 Announce Type: replace-cross Abstract: AI-based talking-head videoconferencing systems reduce bandwidth by sending a compact pose-expression

ArXiv cs.AI 📄 Paper 6d ago

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

arXiv:2510.06499v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved remarkable success through imitation learning on vast text

ArXiv cs.AI 📄 Paper 6d ago

Dejavu: Towards Experience Feedback Learning for Embodied Intelligence

arXiv:2510.10181v3 Announce Type: replace-cross Abstract: Embodied agents face a fundamental limitation: once deployed in real-world environments, they cannot e

ArXiv cs.AI 📄 Paper 6d ago

RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation

arXiv:2510.17640v3 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have demonstrated remarkable performance on complex tasks through

ArXiv cs.AI 📄 Paper 6d ago

LLM4Delay: Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation

arXiv:2510.23636v3 Announce Type: replace-cross Abstract: Flight delay prediction has become a key focus in air traffic management (ATM), as delays reflect inef