AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

arXiv:2603.22293v1 Announce Type: cross Abstract: Search-augmented large language models (LLMs) trained with reinforcement learning (RL) have achieved strong re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

arXiv:2603.22294v1 Announce Type: cross Abstract: Synthetic Data Generation (SDG), leveraging Large Language Models (LLMs), has recently been recognized and bro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

arXiv:2603.22295v1 Announce Type: cross Abstract: Large language models appear to develop internal representations of emotion -- "emotion circuits," "emotion ne

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores

arXiv:2603.22299v1 Announce Type: cross Abstract: Large language models (LLMs) are often confidently wrong, making reliable uncertainty estimation (UE) essentia

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Scaling Attention via Feature Sparsity

arXiv:2603.22300v1 Announce Type: cross Abstract: Scaling Transformers to ultra-long contexts is bottlenecked by the $O(n^2 d)$ cost of self-attention. Existing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Latent Semantic Manifolds in Large Language Models

arXiv:2603.22301v1 Announce Type: cross Abstract: Large Language Models (LLMs) perform internal computations in continuous vector spaces yet produce discrete to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models

arXiv:2603.22303v1 Announce Type: cross Abstract: Hallucinations in large language models (LLMs) remain a central obstacle to trustworthy deployment, motivating

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News

arXiv:2603.22305v1 Announce Type: cross Abstract: Large Language Models (LLMs) are rapidly transitioning from static Natural Language Processing (NLP) tasks inc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

UniFluids: Unified Neural Operator Learning with Conditional Flow-matching

arXiv:2603.22309v1 Announce Type: cross Abstract: Partial differential equation (PDE) simulation holds extensive significance in scientific research. Currently,

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection

arXiv:2603.22313v1 Announce Type: cross Abstract: The increasing global aging population has intensified the demand for reliable health monitoring systems, part

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction

arXiv:2603.22314v1 Announce Type: cross Abstract: Tropical cyclones (TCs) pose severe threats to life, infrastructure, and economies in tropical and subtropical

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Emergency Preemption Without Online Exploration: A Decision Transformer Approach

arXiv:2603.22315v1 Announce Type: cross Abstract: Emergency vehicle (EV) response time is a critical determinant of survival outcomes, yet deployed signal preem

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography

arXiv:2603.22316v1 Announce Type: cross Abstract: Group dance generation from music requires synchronizing multiple dancers while maintaining spatial coordinati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning

arXiv:2603.22317v1 Announce Type: cross Abstract: Graph-structured data typically exhibits complex topological heterogeneity, making it difficult to model accur

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Sparsely-Supervised Data Assimilation via Physics-Informed Schr\"odinger Bridge

arXiv:2603.22319v1 Announce Type: cross Abstract: Data assimilation (DA) for systems governed by partial differential equations (PDE) aims to reconstruct full s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs

arXiv:2603.22321v1 Announce Type: cross Abstract: The recent advancements introduced by Large Language Models (LLMs) have transformed how Artificial Intelligenc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations

arXiv:2603.22322v1 Announce Type: cross Abstract: Machine learning systems deployed in medical devices require governance frameworks that ensure safety while en

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life

arXiv:2603.22323v1 Announce Type: cross Abstract: Accurately predicting the state-of-health (SOH) and remaining useful life (RUL) of lithium-ion batteries is cr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression

arXiv:2603.22324v1 Announce Type: cross Abstract: We introduce Delta-Aware Quantization (DAQ), a data-free post-training quantization framework that preserves t

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Hybrid Associative Memories

arXiv:2603.22325v1 Announce Type: cross Abstract: Recurrent neural networks (RNNs) and self-attention are both widely used sequence-mixing layers that maintain

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

A Direct Classification Approach for Reliable Wind Ramp Event Forecasting under Severe Class Imbalance

arXiv:2603.22326v1 Announce Type: cross Abstract: Decision support systems are essential for maintaining grid stability in low-carbon power systems, such as win

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

arXiv:2603.22327v1 Announce Type: cross Abstract: Systematic literature reviews are essential for synthesizing scientific evidence but are costly, difficult to

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Beyond the Mean: Distribution-Aware Loss Functions for Bimodal Regression

arXiv:2603.22328v1 Announce Type: cross Abstract: Despite the strong predictive performance achieved by machine learning models across many application domains,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Trained Persistent Memory for Frozen Decoder-Only LLMs

arXiv:2603.22329v1 Announce Type: cross Abstract: Decoder-only language models are stateless: hidden representations are discarded after every forward pass and

📰 ArXiv cs.AI