AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

arXiv:2510.24821v3 Announce Type: replace-cross Abstract: We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago

Generative deep learning for foundational video translation in ultrasound

arXiv:2511.03255v2 Announce Type: replace-cross Abstract: Deep learning (DL) has the potential to revolutionize image acquisition and interpretation across medi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Foundry: Distilling 3D Foundation Models for the Edge

arXiv:2511.20721v2 Announce Type: replace-cross Abstract: Foundation models pre-trained with self-supervised learning (SSL) on large-scale datasets have become

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

A cross-species neural foundation model for end-to-end speech decoding

arXiv:2511.21740v4 Announce Type: replace-cross Abstract: Speech brain-computer interfaces (BCIs) aim to restore communication for people with paralysis by tran

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Epistemic Bias Injection: Biasing LLMs via Selective Context Retrieval

arXiv:2512.00804v2 Announce Type: replace-cross Abstract: When answering user queries, LLMs often retrieve knowledge from external sources stored in retrieval-a

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

Constant-Time Motion Planning with Manipulation Behaviors

arXiv:2512.00939v2 Announce Type: replace-cross Abstract: Recent progress in contact-rich robotic manipulation has been striking, yet most deployed systems rema

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking

arXiv:2512.07885v2 Announce Type: replace-cross Abstract: Accurate tropical cyclones (TCs) tracking represents a critical challenge in the context of weather an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing

arXiv:2512.10411v5 Announce Type: replace-cross Abstract: The quadratic complexity of self attention in Transformer based LLMs renders long context inference pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

arXiv:2512.14698v2 Announce Type: replace-cross Abstract: This paper does not introduce a novel method but instead establishes a straightforward, incremental, y

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago

IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting

arXiv:2601.03824v3 Announce Type: replace-cross Abstract: Generalizable 3D Gaussian Splatting aims to directly predict Gaussian parameters using a feed-forward

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification

arXiv:2601.06394v2 Announce Type: replace-cross Abstract: Understanding student behavior in the classroom is essential to improve both pedagogical quality and s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

arXiv:2601.08881v2 Announce Type: replace-cross Abstract: Unified image generation and editing models suffer from severe task interference in dense diffusion tr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Information Access of the Oppressed: A Problem-Posing Framework for Envisioning Emancipatory Information Access Platforms

arXiv:2601.09600v2 Announce Type: replace-cross Abstract: Online information access (IA) platforms are targets of authoritarian capture. We explore the question

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

SciCoQA: Quality Assurance for Scientific Paper--Code Alignment

arXiv:2601.12910v2 Announce Type: replace-cross Abstract: We present SciCoQA, a dataset for detecting discrepancies between scientific publications and their co

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

Gradient Regularized Natural Gradients

arXiv:2601.18420v2 Announce Type: replace-cross Abstract: Gradient regularization (GR) has been shown to improve the generalizability of trained models. While N

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

Temporal Sepsis Modeling: a Fully Interpretable Relational Way

arXiv:2601.21747v2 Announce Type: replace-cross Abstract: Sepsis remains one of the most complex and heterogeneous syndromes in intensive care, characterized by

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago

Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark and Strategy

arXiv:2602.01939v3 Announce Type: replace-cross Abstract: Recently, active vision has reemerged as an important concept for manipulation, since visual occlusion

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago

Monocular Normal Estimation via Shading Sequence Estimation

arXiv:2602.09929v5 Announce Type: replace-cross Abstract: Monocular normal estimation aims to estimate the normal map from a single RGB image of an object under

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

arXiv:2602.18455v2 Announce Type: replace-cross Abstract: Search engines increasingly display LLM-generated answers shown above organic links, shifting search f

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 4d ago

The Landscape of AI in Science Education: What is Changing and How to Respond

arXiv:2602.18469v2 Announce Type: replace-cross Abstract: This introductory chapter explores the transformative role of artificial intelligence (AI) in reshapin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

arXiv:2602.20951v2 Announce Type: replace-cross Abstract: Despite recent advances in diffusion models, AI generated images still often contain visual artifacts

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 4d ago

From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

arXiv:2603.00141v3 Announce Type: replace-cross Abstract: Image Chain-of-Thought (Image-CoT) is a test-time scaling paradigm that improves image generation by e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails

arXiv:2603.03099v3 Announce Type: replace-cross Abstract: Despite Adam demonstrating faster empirical convergence than SGD in many applications, much of the exi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago

Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual Prompting

arXiv:2603.06663v2 Announce Type: replace-cross Abstract: Recent advances in training-free visual prompting, such as Set-of-Mark, have emerged as a promising di

📰 ArXiv cs.AI