📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 1,213 articles · Updated every 3 hours · View all news
All
⚡ AI Lessons (4973)
ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Instruction Following by Principled Boosting Attention of Large Language Models
arXiv:2506.13734v3 Announce Type: replace-cross Abstract: Large language models' behavior is often shaped by instructions such as system prompts, refusal bounda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models
arXiv:2506.14861v2 Announce Type: replace-cross Abstract: Transcriptomic foundation models pretrained with masked language modeling can achieve low pretraining
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
U-DREAM: Unsupervised Dereverberation guided by a Reverberation Model
arXiv:2507.14237v2 Announce Type: replace-cross Abstract: This paper explores the outcome of training state-of-the-art dereverberation models with supervision s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning
arXiv:2507.19737v2 Announce Type: replace-cross Abstract: The vulnerability of cities has increased with urbanization and climate change, making it more importa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
CodeNER: Code Prompting for Named Entity Recognition
arXiv:2507.20423v4 Announce Type: replace-cross Abstract: Recent studies have explored various approaches for treating candidate named entity spans as both sour
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
arXiv:2508.00307v2 Announce Type: replace-cross Abstract: We introduce a U-net model for 360{\deg} acoustic source localization formulated as a spherical semant
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation
arXiv:2508.09223v2 Announce Type: replace-cross Abstract: Test-time adaptation allows pretrained models to adjust to incoming data streams, addressing distribut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Mapping the Course for Prompt-based Structured Prediction
arXiv:2508.15090v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated strong performance in a wide-range of language tasks wi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
The Information Dynamics of Generative Diffusion
arXiv:2508.19897v4 Announce Type: replace-cross Abstract: Generative diffusion models have emerged as a powerful class of models in machine learning, yet a unif
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation
arXiv:2508.21435v2 Announce Type: replace-cross Abstract: Synthetic medical data offers a scalable solution for training robust models, but significant domain g
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
End-to-End Low-Level Neural Control of an Industrial-Grade 6D Magnetic Levitation System
arXiv:2509.01388v2 Announce Type: replace-cross Abstract: Magnetic levitation is poised to revolutionize industrial automation by integrating flexible in-machin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response
arXiv:2509.19354v3 Announce Type: replace-cross Abstract: LLMs excel at linguistic tasks but lack the inner geospatial capabilities needed for time-critical dis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
arXiv:2509.24296v2 Announce Type: replace-cross Abstract: The rapid advancement of Diffusion Large Language Models (dLLMs) introduces unprecedented vulnerabilit
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints
arXiv:2510.10415v2 Announce Type: replace-cross Abstract: Evaluating multi-paragraph clinical question answering (QA) systems is resource-intensive and challeng
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
Constrained Diffusion for Protein Design with Hard Structural Constraints
arXiv:2510.14989v2 Announce Type: replace-cross Abstract: Diffusion models offer a powerful means of capturing the manifold of realistic protein structures, ena
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
arXiv:2510.24821v3 Announce Type: replace-cross Abstract: We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
3d ago
Generative deep learning for foundational video translation in ultrasound
arXiv:2511.03255v2 Announce Type: replace-cross Abstract: Deep learning (DL) has the potential to revolutionize image acquisition and interpretation across medi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Foundry: Distilling 3D Foundation Models for the Edge
arXiv:2511.20721v2 Announce Type: replace-cross Abstract: Foundation models pre-trained with self-supervised learning (SSL) on large-scale datasets have become
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
A cross-species neural foundation model for end-to-end speech decoding
arXiv:2511.21740v4 Announce Type: replace-cross Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by tran
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval
arXiv:2512.00804v2 Announce Type: replace-cross Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in retrieval-a
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
Constant-Time Motion Planning with Manipulation Behaviors
arXiv:2512.00939v2 Announce Type: replace-cross Abstract: Recent progress in contact-rich robotic manipulation has been striking, yet most deployed systems rema
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
3d ago
ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking
arXiv:2512.07885v2 Announce Type: replace-cross Abstract: Accurate tropical cyclones (TCs) tracking represents a critical challenge in the context of weather an
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing
arXiv:2512.10411v5 Announce Type: replace-cross Abstract: The quadratic complexity of self attention in Transformer based LLMs renders long context inference pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3d ago
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
arXiv:2512.14698v2 Announce Type: replace-cross Abstract: This paper does not introduce a novel method but instead establishes a straightforward, incremental, y
DeepCamp AI