📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 5,060 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (13937)
ArXiv cs.AIDev.to AIDev.to · FORUM WEBForbes InnovationOpenAI NewsMedium · Programming
ArXiv cs.AI
📄 Paper
6d ago
Mitigating Extrinsic Gender Bias for Bangla Classification Tasks
arXiv:2411.10636v2 Announce Type: replace-cross Abstract: In this study, we investigate extrinsic gender bias in Bangla pretrained language models, a largely un
ArXiv cs.AI
📄 Paper
6d ago
OmniPrism: Learning Disentangled Visual Concept for Image Generation
arXiv:2412.12242v2 Announce Type: replace-cross Abstract: Creative visual concept generation often draws inspiration from specific concepts in a reference image
ArXiv cs.AI
📄 Paper
6d ago
Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution
arXiv:2502.06809v3 Announce Type: replace-cross Abstract: Pervasive polysemanticity in large language models (LLMs) undermines discrete neuron-concept attributi
ArXiv cs.AI
📄 Paper
6d ago
AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society
arXiv:2502.08691v2 Announce Type: replace-cross Abstract: Understanding human behavior and society is a central focus in social sciences, with the rise of gener
ArXiv cs.AI
📄 Paper
6d ago
Constraining Sequential Model Editing with Editing Anchor Compression
arXiv:2503.00035v2 Announce Type: replace-cross Abstract: Large language models (LLMs) struggle with hallucinations due to false or outdated knowledge. Given th
ArXiv cs.AI
📄 Paper
6d ago
Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
arXiv:2505.12509v3 Announce Type: replace-cross Abstract: Post-hoc explanations provide transparency and are essential for guiding model optimization, such as p
ArXiv cs.AI
📄 Paper
6d ago
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
arXiv:2505.18600v3 Announce Type: replace-cross Abstract: Modern single-image super-resolution (SISR) models deliver photo-realistic results at the scale factor
ArXiv cs.AI
📄 Paper
6d ago
Gen-n-Val: Agentic Image Data Generation and Validation
arXiv:2506.04676v2 Announce Type: replace-cross Abstract: The data scarcity, label noise, and long-tailed category imbalance remain important and unresolved cha
ArXiv cs.AI
📄 Paper
6d ago
Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations
arXiv:2506.09067v2 Announce Type: replace-cross Abstract: Generative medical vision-language models~(Med-VLMs) are primarily designed to generate complex textua
ArXiv cs.AI
📄 Paper
6d ago
Listener-Rewarded Thinking in VLMs for Image Preferences
arXiv:2506.22832v3 Announce Type: replace-cross Abstract: Training robust and generalizable reward models for human visual preferences is essential for aligning
ArXiv cs.AI
📄 Paper
6d ago
Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos
arXiv:2508.04853v2 Announce Type: replace-cross Abstract: Post-training quantization (PTQ) has become a crucial tool for reducing the memory and compute costs o
ArXiv cs.AI
📄 Paper
6d ago
VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding
arXiv:2508.06869v4 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) demonstrate exceptional performance in vision-language tasks,
ArXiv cs.AI
📄 Paper
6d ago
Mitigating Domain Drift in Multi Species Segmentation with DINOv2: A Cross-Domain Evaluation in Herbicide Research Trials
arXiv:2508.07514v4 Announce Type: replace-cross Abstract: Reliable plant species and damage segmentation for herbicide field research trials requires models tha
ArXiv cs.AI
📄 Paper
6d ago
Investigating Multimodal Large Language Models to Support Usability Evaluation
arXiv:2508.16165v2 Announce Type: replace-cross Abstract: Usability evaluation is an essential method to support the design of effective and intuitive user inte
ArXiv cs.AI
📄 Paper
6d ago
AR-KAN: Autoregressive-Weight-Enhanced Kolmogorov-Arnold Network for Time Series Forecasting
arXiv:2509.02967v3 Announce Type: replace-cross Abstract: Traditional neural networks struggle to capture the spectral structure of complex signals. Fourier neu
ArXiv cs.AI
📄 Paper
6d ago
STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting
arXiv:2509.25210v3 Announce Type: replace-cross Abstract: To gain finer regional forecasts, many works have explored the regional integration from the global at
ArXiv cs.AI
📄 Paper
6d ago
On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs
arXiv:2509.25214v3 Announce Type: replace-cross Abstract: As increasingly large pre-trained models are released, deploying them on edge devices for privacy-pres
ArXiv cs.AI
📄 Paper
6d ago
Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
arXiv:2509.26435v2 Announce Type: replace-cross Abstract: Controllable summarization moves beyond generic outputs toward human-aligned summaries guided by speci
ArXiv cs.AI
📄 Paper
6d ago
Traj2Action: A Co-Denoising Framework for Trajectory-Guided Human-to-Robot Skill Transfer
arXiv:2510.00491v3 Announce Type: replace-cross Abstract: Learning diverse manipulation skills for real-world robots is severely bottlenecked by the reliance on
ArXiv cs.AI
📄 Paper
6d ago
Unmasking Puppeteers: Leveraging Biometric Leakage to Disarm Impersonation in AI-based Videoconferencing
arXiv:2510.03548v3 Announce Type: replace-cross Abstract: AI-based talking-head videoconferencing systems reduce bandwidth by sending a compact pose-expression
ArXiv cs.AI
📄 Paper
6d ago
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
arXiv:2510.06499v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved remarkable success through imitation learning on vast text
ArXiv cs.AI
📄 Paper
6d ago
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
arXiv:2510.10181v3 Announce Type: replace-cross Abstract: Embodied agents face a fundamental limitation: once deployed in real-world environments, they cannot e
ArXiv cs.AI
📄 Paper
6d ago
RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation
arXiv:2510.17640v3 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have demonstrated remarkable performance on complex tasks through
ArXiv cs.AI
📄 Paper
6d ago
LLM4Delay: Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation
arXiv:2510.23636v3 Announce Type: replace-cross Abstract: Flight delay prediction has become a key focus in air traffic management (ATM), as delays reflect inef
DeepCamp AI