📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 1,213 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs
arXiv:2603.22293v1 Announce Type: cross Abstract: Search-augmented large language models (LLMs) trained with reinforcement learning (RL) have achieved strong re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks
arXiv:2603.22294v1 Announce Type: cross Abstract: Synthetic Data Generation (SDG), leveraging Large Language Models (LLMs), has recently been recognized and bro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs
arXiv:2603.22295v1 Announce Type: cross Abstract: Large language models appear to develop internal representations of emotion -- "emotion circuits," "emotion ne
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores
arXiv:2603.22299v1 Announce Type: cross Abstract: Large language models (LLMs) are often confidently wrong, making reliable uncertainty estimation (UE) essentia
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Scaling Attention via Feature Sparsity
arXiv:2603.22300v1 Announce Type: cross Abstract: Scaling Transformers to ultra-long contexts is bottlenecked by the $O(n^2 d)$ cost of self-attention. Existing
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Latent Semantic Manifolds in Large Language Models
arXiv:2603.22301v1 Announce Type: cross Abstract: Large Language Models (LLMs) perform internal computations in continuous vector spaces yet produce discrete to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models
arXiv:2603.22303v1 Announce Type: cross Abstract: Hallucinations in large language models (LLMs) remain a central obstacle to trustworthy deployment, motivating
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News
arXiv:2603.22305v1 Announce Type: cross Abstract: Large Language Models (LLMs) are rapidly transitioning from static Natural Language Processing (NLP) tasks inc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
UniFluids: Unified Neural Operator Learning with Conditional Flow-matching
arXiv:2603.22309v1 Announce Type: cross Abstract: Partial differential equation (PDE) simulation holds extensive significance in scientific research. Currently,
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection
arXiv:2603.22313v1 Announce Type: cross Abstract: The increasing global aging population has intensified the demand for reliable health monitoring systems, part
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction
arXiv:2603.22314v1 Announce Type: cross Abstract: Tropical cyclones (TCs) pose severe threats to life, infrastructure, and economies in tropical and subtropical
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Emergency Preemption Without Online Exploration: A Decision Transformer Approach
arXiv:2603.22315v1 Announce Type: cross Abstract: Emergency vehicle (EV) response time is a critical determinant of survival outcomes, yet deployed signal preem
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography
arXiv:2603.22316v1 Announce Type: cross Abstract: Group dance generation from music requires synchronizing multiple dancers while maintaining spatial coordinati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning
arXiv:2603.22317v1 Announce Type: cross Abstract: Graph-structured data typically exhibits complex topological heterogeneity, making it difficult to model accur
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Sparsely-Supervised Data Assimilation via Physics-Informed Schr\"odinger Bridge
arXiv:2603.22319v1 Announce Type: cross Abstract: Data assimilation (DA) for systems governed by partial differential equations (PDE) aims to reconstruct full s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs
arXiv:2603.22321v1 Announce Type: cross Abstract: The recent advancements introduced by Large Language Models (LLMs) have transformed how Artificial Intelligenc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations
arXiv:2603.22322v1 Announce Type: cross Abstract: Machine learning systems deployed in medical devices require governance frameworks that ensure safety while en
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life
arXiv:2603.22323v1 Announce Type: cross Abstract: Accurately predicting the state-of-health (SOH) and remaining useful life (RUL) of lithium-ion batteries is cr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression
arXiv:2603.22324v1 Announce Type: cross Abstract: We introduce Delta-Aware Quantization (DAQ), a data-free post-training quantization framework that preserves t
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Hybrid Associative Memories
arXiv:2603.22325v1 Announce Type: cross Abstract: Recurrent neural networks (RNNs) and self-attention are both widely used sequence-mixing layers that maintain
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
A Direct Classification Approach for Reliable Wind Ramp Event Forecasting under Severe Class Imbalance
arXiv:2603.22326v1 Announce Type: cross Abstract: Decision support systems are essential for maintaining grid stability in low-carbon power systems, such as win
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI
arXiv:2603.22327v1 Announce Type: cross Abstract: Systematic literature reviews are essential for synthesizing scientific evidence but are costly, difficult to
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
6d ago
Beyond the Mean: Distribution-Aware Loss Functions for Bimodal Regression
arXiv:2603.22328v1 Announce Type: cross Abstract: Despite the strong predictive performance achieved by machine learning models across many application domains,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
6d ago
Trained Persistent Memory for Frozen Decoder-Only LLMs
arXiv:2603.22329v1 Announce Type: cross Abstract: Decoder-only language models are stateless: hidden representations are discarded after every forward pass and
DeepCamp AI