📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 1,754 articles · Updated every 3 hours · View all news
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation
arXiv:2510.24821v3 Announce Type: replace-cross Abstract: We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
4d ago
Generative deep learning for foundational video translation in ultrasound
arXiv:2511.03255v2 Announce Type: replace-cross Abstract: Deep learning (DL) has the potential to revolutionize image acquisition and interpretation across medi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Foundry: Distilling 3D Foundation Models for the Edge
arXiv:2511.20721v2 Announce Type: replace-cross Abstract: Foundation models pre-trained with self-supervised learning (SSL) on large-scale datasets have become
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
A cross-species neural foundation model for end-to-end speech decoding
arXiv:2511.21740v4 Announce Type: replace-cross Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by tran
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval
arXiv:2512.00804v2 Announce Type: replace-cross Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in retrieval-a
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
4d ago
Constant-Time Motion Planning with Manipulation Behaviors
arXiv:2512.00939v2 Announce Type: replace-cross Abstract: Recent progress in contact-rich robotic manipulation has been striking, yet most deployed systems rema
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
4d ago
ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking
arXiv:2512.07885v2 Announce Type: replace-cross Abstract: Accurate tropical cyclones (TCs) tracking represents a critical challenge in the context of weather an
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing
arXiv:2512.10411v5 Announce Type: replace-cross Abstract: The quadratic complexity of self attention in Transformer based LLMs renders long context inference pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
arXiv:2512.14698v2 Announce Type: replace-cross Abstract: This paper does not introduce a novel method but instead establishes a straightforward, incremental, y
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
4d ago
IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting
arXiv:2601.03824v3 Announce Type: replace-cross Abstract: Generalizable 3D Gaussian Splatting aims to directly predict Gaussian parameters using a feed-forward
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification
arXiv:2601.06394v2 Announce Type: replace-cross Abstract: Understanding student behavior in the classroom is essential to improve both pedagogical quality and s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts
arXiv:2601.08881v2 Announce Type: replace-cross Abstract: Unified image generation and editing models suffer from severe task interference in dense diffusion tr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Information Access of the Oppressed: A Problem-Posing Framework for Envisioning Emancipatory Information Access Platforms
arXiv:2601.09600v2 Announce Type: replace-cross Abstract: Online information access (IA) platforms are targets of authoritarian capture. We explore the question
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
4d ago
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment
arXiv:2601.12910v2 Announce Type: replace-cross Abstract: We present SciCoQA, a dataset for detecting discrepancies between scientific publications and their co
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
4d ago
Gradient Regularized Natural Gradients
arXiv:2601.18420v2 Announce Type: replace-cross Abstract: Gradient regularization (GR) has been shown to improve the generalizability of trained models. While N
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
4d ago
Temporal Sepsis Modeling: a Fully Interpretable Relational Way
arXiv:2601.21747v2 Announce Type: replace-cross Abstract: Sepsis remains one of the most complex and heterogeneous syndromes in intensive care, characterized by
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
4d ago
Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark and Strategy
arXiv:2602.01939v3 Announce Type: replace-cross Abstract: Recently, active vision has reemerged as an important concept for manipulation, since visual occlusion
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
4d ago
Monocular Normal Estimation via Shading Sequence Estimation
arXiv:2602.09929v5 Announce Type: replace-cross Abstract: Monocular normal estimation aims to estimate the normal map from a single RGB image of an object under
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia
arXiv:2602.18455v2 Announce Type: replace-cross Abstract: Search engines increasingly display LLM-generated answers shown above organic links, shifting search f
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
4d ago
The Landscape of AI in Science Education: What is Changing and How to Respond
arXiv:2602.18469v2 Announce Type: replace-cross Abstract: This introductory chapter explores the transformative role of artificial intelligence (AI) in reshapin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis
arXiv:2602.20951v2 Announce Type: replace-cross Abstract: Despite recent advances in diffusion models, AI generated images still often contain visual artifacts
ArXiv cs.AI
👁️ Computer Vision
📄 Paper
⚡ AI Lesson
4d ago
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
arXiv:2603.00141v3 Announce Type: replace-cross Abstract: Image Chain-of-Thought (Image-CoT) is a test-time scaling paradigm that improves image generation by e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails
arXiv:2603.03099v3 Announce Type: replace-cross Abstract: Despite Adam demonstrating faster empirical convergence than SGD in many applications, much of the exi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual Prompting
arXiv:2603.06663v2 Announce Type: replace-cross Abstract: Recent advances in training-free visual prompting, such as Set-of-Mark, have emerged as a promising di
DeepCamp AI