AI News — Latest Developments & Breakthroughs

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

UniCA: Unified Covariate Adaptation for Time Series Foundation Model

arXiv:2506.22039v2 Announce Type: replace-cross Abstract: Time Series Foundation Models (TSFMs) have achieved remarkable success through large-scale pretraining

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models

arXiv:2507.00026v2 Announce Type: replace-cross Abstract: As large language models (LLMs) are increasingly deployed as black-box components in real-world applic

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 6d ago

MS-DGCNN++: Multi-Scale Dynamic Graph Convolution with Scale-Dependent Normalization for Robust LiDAR Tree Species Classification

arXiv:2507.12602v2 Announce Type: replace-cross Abstract: Graph-based deep learning on LiDAR point clouds encodes geometry through edge features, yet standard i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Graph Structure Learning with Privacy Guarantees for Open Graph Data

arXiv:2507.19116v3 Announce Type: replace-cross Abstract: Publishing open graph data while preserving individual privacy remains challenging when data publisher

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

From Product Hilbert Spaces to the Generalized Koopman Operator and the Nonlinear Fundamental Lemma

arXiv:2508.07494v2 Announce Type: replace-cross Abstract: The generalization of the Koopman operator to systems with control input and the derivation of a nonli

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

From Context to Intent: Reasoning-Guided Function-Level Code Completion

arXiv:2508.09537v2 Announce Type: replace-cross Abstract: The growing capabilities of Large Language Models (LLMs) have led to their widespread adoption for fun

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

From Noisy Labels to Intrinsic Structure: A Geometric-Structural Dual-Guided Framework for Noise-Robust Medical Image Segmentation

arXiv:2509.02419v2 Announce Type: replace-cross Abstract: The effectiveness of convolutional neural networks in medical image segmentation relies on large-scale

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

From Editor to Dense Geometry Estimator

arXiv:2509.04338v2 Announce Type: replace-cross Abstract: Leveraging visual priors from pre-trained text-to-image (T2I) generative models has shown success in d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

DreamAudio: Customized Text-to-Audio Generation with Diffusion Models

arXiv:2509.06027v2 Announce Type: replace-cross Abstract: With the development of large-scale diffusion-based and language-modeling-based generative models, imp

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Selective Classifier-free Guidance for Zero-shot Text-to-speech

arXiv:2509.19668v2 Announce Type: replace-cross Abstract: In zero-shot text-to-speech, achieving a balance between fidelity to the target speaker and adherence

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

MARS: toward more efficient multi-agent collaboration for LLM reasoning

arXiv:2509.20502v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved impressive results in natural language understanding, yet t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

VL-KnG: Persistent Spatiotemporal Knowledge Graphs from Egocentric Video for Embodied Scene Understanding

arXiv:2510.01483v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) demonstrate strong image-level scene understanding but often lack persis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using a GPT-Based VLM: A Preliminary Study on Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework

arXiv:2510.02001v4 Announce Type: replace-cross Abstract: Vision-language models (VLMs) such as GPT (Generative Pre-Trained Transformer) have shown potential fo

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Counterfactual Identifiability via Dynamic Optimal Transport

arXiv:2510.08294v2 Announce Type: replace-cross Abstract: We address the open question of counterfactual identification for high-dimensional multivariate outcom

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 6d ago

Happiness is Sharing a Vocabulary: A Study of Transliteration Methods

arXiv:2510.10827v2 Announce Type: replace-cross Abstract: Transliteration has emerged as a promising means to bridge the gap between various languages in multil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents

arXiv:2510.14967v2 Announce Type: replace-cross Abstract: Large language model (LLM)-based agents are increasingly trained with reinforcement learning (RL) to e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

arXiv:2510.15994v2 Announce Type: replace-cross Abstract: The Model Context Protocol (MCP) standardizes how large language model (LLM) agents discover, describe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

GUIrilla: A Scalable Framework for Automated Desktop UI Exploration

arXiv:2510.16051v2 Announce Type: replace-cross Abstract: The performance and generalization of foundation models for interactive systems critically depend on t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding

arXiv:2510.21356v2 Announce Type: replace-cross Abstract: Eye gaze offers valuable cues about attention, short-term intent, and future actions, making it a powe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Quantifying Systemic Vulnerability in the Foundation Model Industry

arXiv:2510.23421v2 Announce Type: replace-cross Abstract: The foundation model industry exhibits unprecedented concentration in critical inputs: semiconductors,

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 6d ago

Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

arXiv:2510.26865v2 Announce Type: replace-cross Abstract: Reading measurement instruments is effortless for humans and requires relatively little domain experti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

arXiv:2511.05919v3 Announce Type: replace-cross Abstract: LLMs are now an integral part of information retrieval. As such, their role as question answering chat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 6d ago

MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding

arXiv:2511.12449v2 Announce Type: replace-cross Abstract: Recent Multimodal Large Language Models (MLLMs) have significantly advanced e-commerce product underst

ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 6d ago

Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network

arXiv:2511.20008v2 Announce Type: replace-cross Abstract: Pedestrian crossing intention prediction is essential for the deployment of autonomous vehicles (AVs)

📰 ArXiv cs.AI