AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Before We Trust Them: Decision-Making Failures in Navigation of Foundation Models

arXiv:2601.05529v4 Announce Type: replace Abstract: High success rates on navigation-related tasks do not necessarily translate into reliable decision making by

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

arXiv:2601.08323v3 Announce Type: replace Abstract: Equipping agents with memory is essential for solving real-world long-horizon problems. However, most existi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

See, Symbolize, Act: Grounding VLMs with Spatial Representations for Better Gameplay

arXiv:2603.11601v2 Announce Type: replace Abstract: Vision-Language Models (VLMs) excel at describing visual scenes, yet struggle to translate perception into p

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

Draft-and-Prune: Improving the Reliability of Auto-formalization for Logical Reasoning

arXiv:2603.17233v2 Announce Type: replace Abstract: Auto-formalization (AF) translates natural-language reasoning problems into solver-executable programs, enab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems

arXiv:2603.20833v2 Announce Type: replace Abstract: As AI agent ecosystems grow, agents need mechanisms to monitor relevant knowledge in real time. Semantic pub

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

arXiv:2405.00181v3 Announce Type: replace-cross Abstract: Video anomaly understanding (VAU) aims to automatically comprehend unusual occurrences in videos, ther

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients

arXiv:2406.06751v3 Announce Type: replace-cross Abstract: We propose a novel deep symbolic regression approach to enhance the robustness and interpretability of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

CGRA4ML: A Hardware/Software Framework to Implement Neural Networks for Scientific Edge Computing

arXiv:2408.15561v4 Announce Type: replace-cross Abstract: The scientific community increasingly relies on machine learning (ML) for near-sensor processing, leve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation

arXiv:2502.00262v4 Announce Type: replace-cross Abstract: Autonomous driving systems face significant challenges in handling unpredictable edge-case scenarios,

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

Biogeochemistry-Informed Neural Network (BINN) for Improving Accuracy of Model Prediction and Scientific Understanding of Soil Organic Carbon

arXiv:2502.00672v3 Announce Type: replace-cross Abstract: The increasing availability of large-scale observational data and the rapid development of artificial

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

Hierarchical and Multimodal Data for Daily Activity Understanding

arXiv:2504.17696v4 Announce Type: replace-cross Abstract: Daily Activity Recordings for Artificial Intelligence (DARai, pronounced "Dahr-ree") is a multimodal,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

arXiv:2505.20353v3 Announce Type: replace-cross Abstract: Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

StreamDiT: Real-Time Streaming Text-to-Video Generation

arXiv:2507.03745v4 Announce Type: replace-cross Abstract: Recently, great progress has been achieved in text-to-video (T2V) generation by scaling transformer-ba

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning

arXiv:2508.14765v3 Announce Type: replace-cross Abstract: Designing therapeutic peptides with tailored properties is hindered by the vastness of sequence space,

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

ExtrinSplat: Decoupling Geometry and Semantics for Open-Vocabulary Understanding in 3D Gaussian Splatting

arXiv:2509.22225v2 Announce Type: replace-cross Abstract: Lifting 2D open-vocabulary understanding into 3D Gaussian Splatting (3DGS) scenes is a critical challe

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings

arXiv:2510.01448v2 Announce Type: replace-cross Abstract: Worldwide visual geo-localization aims to determine the geographic location of an image anywhere on Ea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Attention-Aligned Reasoning for Large Language Models

arXiv:2510.03223v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) tend to generate a long reasoning chain when solving complex tasks. Howev

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 10h ago

Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices

arXiv:2510.06882v2 Announce Type: replace-cross Abstract: Edge devices have limited resources, which inevitably leads to situations where stream processing serv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

arXiv:2511.00810v3 Announce Type: replace-cross Abstract: Graphical user interface (GUI) grounding is a key capability for computer-use agents, mapping natural-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Route Experts by Sequence, not by Token

arXiv:2511.06494v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures scale large language models (LLMs) by activating only a subset

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 10h ago

Binary Verification for Zero-Shot Vision

arXiv:2511.10983v2 Announce Type: replace-cross Abstract: We propose a training-free, binary verification workflow for zero-shot vision with off-the-shelf VLMs.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv:2511.18746v2 Announce Type: replace-cross Abstract: While video-generation-based embodied world models have gained increasing attention, their reliance on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning

arXiv:2511.21075v2 Announce Type: replace-cross Abstract: Aligning Large Language Models (LLMs) with biomedical knowledge requires understanding both concepts a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 10h ago

StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

arXiv:2512.01707v2 Announce Type: replace-cross Abstract: Streaming video understanding requires models not only to process temporally incoming frames, but also

📰 ArXiv cs.AI