📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 3,204 articles · Updated every 3 hours · View all reads

arXiv:2604.09554v1 Announce Type: new Abstract: Optimism for accelerating scientific discovery with AI continues to grow. Current applications of AI in scientif

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1h ago

Linear Programming for Multi-Criteria Assessment with Cardinal and Ordinal Data: A Pessimistic Virtual Gap Analysis

arXiv:2604.09555v1 Announce Type: new Abstract: Multi-criteria Analysis (MCA) is used to rank alternatives based on various criteria. Key MCA methods, such as M

ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1h ago

Seven simple steps for log analysis in AI systems

arXiv:2604.09563v1 Announce Type: new Abstract: AI systems produce large volumes of logs as they interact with tools and users. Analysing these logs can help un

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

arXiv:2604.09574v1 Announce Type: new Abstract: The rise of autonomous GUI agents has triggered adversarial countermeasures from digital platforms, yet existing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

AHC: Meta-Learned Adaptive Compression for Continual Object Detection on Memory-Constrained Microcontrollers

arXiv:2604.09576v1 Announce Type: new Abstract: Deploying continual object detection on microcontrollers (MCUs) with under 100KB memory requires efficient featu

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Explainable Planning for Hybrid Systems

arXiv:2604.09578v1 Announce Type: new Abstract: The recent advancement in artificial intelligence (AI) technologies facilitates a paradigm shift toward automati

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Help Without Being Asked: A Deployed Proactive Agent System for On-Call Support with Continuous Self-Improvement

arXiv:2604.09579v1 Announce Type: new Abstract: In large-scale cloud service platforms, thousands of customer tickets are generated daily and are typically hand

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

OOWM: Structuring Embodied Reasoning and Planning via Object-Oriented Programmatic World Modeling

arXiv:2604.09580v1 Announce Type: new Abstract: Standard Chain-of-Thought (CoT) prompting empowers Large Language Models (LLMs) with reasoning capabilities, yet

ArXiv cs.AI 🖌️ UI/UX Design 📄 Paper ⚡ AI Lesson 1h ago

OpeFlo: Automated UX Evaluation via Simulated Human Web Interaction with GUI Grounding

arXiv:2604.09581v1 Announce Type: new Abstract: Evaluating web usability typically requires time-consuming user studies and expert reviews, which often limits i

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1h ago

Factorizing formal contexts from closures of necessity operators

arXiv:2604.09582v1 Announce Type: new Abstract: Factorizing datasets is an interesting process in a multitude of approaches, but many times it is not possible o

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Agentic Exploration of PDE Spaces using Latent Foundation Models for Parameterized Simulations

arXiv:2604.09584v1 Announce Type: new Abstract: Flow physics and more broadly physical phenomena governed by partial differential equations (PDEs), are inherent

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

MobiFlow: Real-World Mobile Agent Benchmarking through Trajectory Fusion

arXiv:2604.09587v1 Announce Type: new Abstract: Mobile agents can autonomously complete user-assigned tasks through GUI interactions. However, existing mainstre

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Persistent Identity in AI Agents: A Multi-Anchor Architecture for Resilient Memory and Continuity

arXiv:2604.09588v1 Announce Type: new Abstract: Modern AI agents suffer from a fundamental identity problem: when context windows overflow and conversation hist

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

DeepReviewer 2.0: A Traceable Agentic System for Auditable Scientific Peer Review

arXiv:2604.09590v1 Announce Type: new Abstract: Automated peer review is often framed as generating fluent critique, yet reviewers and area chairs need judgment

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Spatial Competence Benchmark

arXiv:2604.09594v1 Announce Type: new Abstract: Spatial competence is the quality of maintaining a consistent internal representation of an environment and usin

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings

arXiv:2604.09596v1 Announce Type: new Abstract: Dermatologic diseases impose a large and growing global burden, affecting billions and substantially reducing qu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

CID-TKG: Collaborative Historical Invariance and Evolutionary Dynamics Learning for Temporal Knowledge Graph Reasoning

arXiv:2604.09600v1 Announce Type: new Abstract: Temporal knowledge graph (TKG) reasoning aims to infer future facts at unseen timestamps from temporally evolvin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

Hubble: An LLM-Driven Agentic Framework for Safe and Automated Alpha Factor Discovery

arXiv:2604.09601v1 Announce Type: new Abstract: Discovering predictive alpha factors in quantitative finance remains a formidable challenge due to the vast comb

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

LLMs for Text-Based Exploration and Navigation Under Partial Observability

arXiv:2604.09604v1 Announce Type: new Abstract: Exploration and goal-directed navigation in unknown layouts are central to inspection, logistics, and search-and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

Evaluating Reliability Gaps in Large Language Model Safety via Repeated Prompt Sampling

arXiv:2604.09606v1 Announce Type: new Abstract: Traditional benchmarks for large language models (LLMs), such as HELM and AIR-BENCH, primarily assess safety ris

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Unifying Ontology Construction and Semantic Alignment for Deterministic Enterprise Reasoning at Scale

arXiv:2604.09608v1 Announce Type: new Abstract: While enterprises amass vast quantities of data, much of it remains chaotic and effectively dormant, preventing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1h ago

General-purpose LLMs as Models of Human Driver Behavior: The Case of Simplified Merging

arXiv:2604.09609v1 Announce Type: new Abstract: Human behavior models are essential as behavior references and for simulating human agents in virtual safety ass

ArXiv cs.AI 🤖 AI Agents & Automation 📄 Paper ⚡ AI Lesson 1h ago

Beyond Theory of Mind in Robotics

arXiv:2604.09612v1 Announce Type: new Abstract: Theory of Mind, the capacity to explain and predict behavior by inferring hidden mental states, has become the d

ArXiv cs.AI 📐 ML Fundamentals 📄 Paper ⚡ AI Lesson 1h ago

The Geometry of Knowing: From Possibilistic Ignorance to Probabilistic Certainty -- A Measure-Theoretic Framework for Epistemic Convergence

arXiv:2604.09614v1 Announce Type: new Abstract: This paper develops a measure-theoretic framework establishing when and how a possibilistic representation of in