8,253 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 8,253 articles · Updated every 3 hours · View all reads

All ⚡ AI Lessons (21843) ArXiv cs.AIDev.to AIMedium · AIMedium · ProgrammingForbes InnovationMedium · Machine Learning
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Instruction Following by Principled Boosting Attention of Large Language Models
arXiv:2506.13734v3 Announce Type: replace-cross Abstract: Large language models' behavior is often shaped by instructions such as system prompts, refusal bounda
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models
arXiv:2506.14861v2 Announce Type: replace-cross Abstract: Transcriptomic foundation models pretrained with masked language modeling can achieve low pretraining
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago
U-DREAM: Unsupervised Dereverberation guided by a Reverberation Model
arXiv:2507.14237v2 Announce Type: replace-cross Abstract: This paper explores the outcome of training state-of-the-art dereverberation models with supervision s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning
arXiv:2507.19737v2 Announce Type: replace-cross Abstract: The vulnerability of cities has increased with urbanization and climate change, making it more importa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
CodeNER: Code Prompting for Named Entity Recognition
arXiv:2507.20423v4 Announce Type: replace-cross Abstract: Recent studies have explored various approaches for treating candidate named entity spans as both sour
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago
Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
arXiv:2508.00307v2 Announce Type: replace-cross Abstract: We introduce a U-net model for 360{\deg} acoustic source localization formulated as a spherical semant
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation
arXiv:2508.09223v2 Announce Type: replace-cross Abstract: Test-time adaptation allows pretrained models to adjust to incoming data streams, addressing distribut
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Mapping the Course for Prompt-based Structured Prediction
arXiv:2508.15090v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated strong performance in a wide-range of language tasks wi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The Information Dynamics of Generative Diffusion
arXiv:2508.19897v4 Announce Type: replace-cross Abstract: Generative diffusion models have emerged as a powerful class of models in machine learning, yet a unif
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago
MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation
arXiv:2508.21435v2 Announce Type: replace-cross Abstract: Synthetic medical data offers a scalable solution for training robust models, but significant domain g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
End-to-End Low-Level Neural Control of an Industrial-Grade 6D Magnetic Levitation System
arXiv:2509.01388v2 Announce Type: replace-cross Abstract: Magnetic levitation is poised to revolutionize industrial automation by integrating flexible in-machin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response
arXiv:2509.19354v3 Announce Type: replace-cross Abstract: LLMs excel at linguistic tasks but lack the inner geospatial capabilities needed for time-critical dis
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
arXiv:2509.24296v2 Announce Type: replace-cross Abstract: The rapid advancement of Diffusion Large Language Models (dLLMs) introduces unprecedented vulnerabilit
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago
CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints
arXiv:2510.10415v2 Announce Type: replace-cross Abstract: Evaluating multi-paragraph clinical question answering (QA) systems is resource-intensive and challeng
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1mo ago
Constrained Diffusion for Protein Design with Hard Structural Constraints
arXiv:2510.14989v2 Announce Type: replace-cross Abstract: Diffusion models offer a powerful means of capturing the manifold of realistic protein structures, ena
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
arXiv:2510.24821v3 Announce Type: replace-cross Abstract: We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 1mo ago
Generative deep learning for foundational video translation in ultrasound
arXiv:2511.03255v2 Announce Type: replace-cross Abstract: Deep learning (DL) has the potential to revolutionize image acquisition and interpretation across medi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Foundry: Distilling 3D Foundation Models for the Edge
arXiv:2511.20721v2 Announce Type: replace-cross Abstract: Foundation models pre-trained with self-supervised learning (SSL) on large-scale datasets have become
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
A cross-species neural foundation model for end-to-end speech decoding
arXiv:2511.21740v4 Announce Type: replace-cross Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by tran
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval
arXiv:2512.00804v2 Announce Type: replace-cross Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in retrieval-a