📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21843) ArXiv cs.AI Dev.to AI Medium · AI Medium · Programming Forbes Innovation Medium · Machine Learning

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Limits of Inference Scaling Through Resampling

arXiv:2411.17501v3 Announce Type: replace-cross Abstract: Recent research has generated hope that inference scaling, such as resampling solutions until they pas

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Physics-Informed Evolution: An Evolutionary Framework for Solving Quantum Control Problems Involving the Schr\"odinger Equation

arXiv:2502.05228v3 Announce Type: replace-cross Abstract: Physics-informed Neural Networks (PINNs) show that embedding physical laws directly into the learning

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition

arXiv:2505.24840v2 Announce Type: replace-cross Abstract: This paper reveals that many open-source large language models (LLMs) lack hierarchical knowledge abou

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

arXiv:2506.12104v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly central to agentic systems due to their strong reasoning

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Instruction Following by Principled Boosting Attention of Large Language Models

arXiv:2506.13734v3 Announce Type: replace-cross Abstract: Large language models' behavior is often shaped by instructions such as system prompts, refusal bounda

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models

arXiv:2506.14861v2 Announce Type: replace-cross Abstract: Transcriptomic foundation models pretrained with masked language modeling can achieve low pretraining

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

U-DREAM: Unsupervised Dereverberation guided by a Reverberation Model

arXiv:2507.14237v2 Announce Type: replace-cross Abstract: This paper explores the outcome of training state-of-the-art dereverberation models with supervision s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

arXiv:2507.19737v2 Announce Type: replace-cross Abstract: The vulnerability of cities has increased with urbanization and climate change, making it more importa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CodeNER: Code Prompting for Named Entity Recognition

arXiv:2507.20423v4 Announce Type: replace-cross Abstract: Recent studies have explored various approaches for treating candidate named entity spans as both sour

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv:2508.00307v2 Announce Type: replace-cross Abstract: We introduce a U-net model for 360{\deg} acoustic source localization formulated as a spherical semant

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation

arXiv:2508.09223v2 Announce Type: replace-cross Abstract: Test-time adaptation allows pretrained models to adjust to incoming data streams, addressing distribut

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Mapping the Course for Prompt-based Structured Prediction

arXiv:2508.15090v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated strong performance in a wide-range of language tasks wi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Information Dynamics of Generative Diffusion

arXiv:2508.19897v4 Announce Type: replace-cross Abstract: Generative diffusion models have emerged as a powerful class of models in machine learning, yet a unif

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation

arXiv:2508.21435v2 Announce Type: replace-cross Abstract: Synthetic medical data offers a scalable solution for training robust models, but significant domain g

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

End-to-End Low-Level Neural Control of an Industrial-Grade 6D Magnetic Levitation System

arXiv:2509.01388v2 Announce Type: replace-cross Abstract: Magnetic levitation is poised to revolutionize industrial automation by integrating flexible in-machin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response

arXiv:2509.19354v3 Announce Type: replace-cross Abstract: LLMs excel at linguistic tasks but lack the inner geospatial capabilities needed for time-critical dis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

arXiv:2509.24296v2 Announce Type: replace-cross Abstract: The rapid advancement of Diffusion Large Language Models (dLLMs) introduces unprecedented vulnerabilit

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints

arXiv:2510.10415v2 Announce Type: replace-cross Abstract: Evaluating multi-paragraph clinical question answering (QA) systems is resource-intensive and challeng

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago

Constrained Diffusion for Protein Design with Hard Structural Constraints

arXiv:2510.14989v2 Announce Type: replace-cross Abstract: Diffusion models offer a powerful means of capturing the manifold of realistic protein structures, ena

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

arXiv:2510.24821v3 Announce Type: replace-cross Abstract: We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1mo ago

Generative deep learning for foundational video translation in ultrasound

arXiv:2511.03255v2 Announce Type: replace-cross Abstract: Deep learning (DL) has the potential to revolutionize image acquisition and interpretation across medi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Foundry: Distilling 3D Foundation Models for the Edge

arXiv:2511.20721v2 Announce Type: replace-cross Abstract: Foundation models pre-trained with self-supervised learning (SSL) on large-scale datasets have become

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A cross-species neural foundation model for end-to-end speech decoding

arXiv:2511.21740v4 Announce Type: replace-cross Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by tran

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval

arXiv:2512.00804v2 Announce Type: replace-cross Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in retrieval-a