📰 ArXiv cs.AI
Articles from ArXiv cs.AI · 3,204 articles · Updated every 3 hours · View all reads
All
⚡ AI Lessons (10765)
ArXiv cs.AIDev.to · FORUM WEBDev.to AIForbes InnovationOpenAI NewsHugging Face Blog
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
LABBench2: An Improved Benchmark for AI Systems Performing Biology Research
arXiv:2604.09554v1 Announce Type: new Abstract: Optimism for accelerating scientific discovery with AI continues to grow. Current applications of AI in scientif
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1h ago
Linear Programming for Multi-Criteria Assessment with Cardinal and Ordinal Data: A Pessimistic Virtual Gap Analysis
arXiv:2604.09555v1 Announce Type: new Abstract: Multi-criteria Analysis (MCA) is used to rank alternatives based on various criteria. Key MCA methods, such as M
ArXiv cs.AI
📄 Paper
⚡ AI Lesson
1h ago
Seven simple steps for log analysis in AI systems
arXiv:2604.09563v1 Announce Type: new Abstract: AI systems produce large volumes of logs as they interact with tools and users. Analysing these logs can help un
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization
arXiv:2604.09574v1 Announce Type: new Abstract: The rise of autonomous GUI agents has triggered adversarial countermeasures from digital platforms, yet existing
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
AHC: Meta-Learned Adaptive Compression for Continual Object Detection on Memory-Constrained Microcontrollers
arXiv:2604.09576v1 Announce Type: new Abstract: Deploying continual object detection on microcontrollers (MCUs) with under 100KB memory requires efficient featu
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Explainable Planning for Hybrid Systems
arXiv:2604.09578v1 Announce Type: new Abstract: The recent advancement in artificial intelligence (AI) technologies facilitates a paradigm shift toward automati
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Help Without Being Asked: A Deployed Proactive Agent System for On-Call Support with Continuous Self-Improvement
arXiv:2604.09579v1 Announce Type: new Abstract: In large-scale cloud service platforms, thousands of customer tickets are generated daily and are typically hand
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
OOWM: Structuring Embodied Reasoning and Planning via Object-Oriented Programmatic World Modeling
arXiv:2604.09580v1 Announce Type: new Abstract: Standard Chain-of-Thought (CoT) prompting empowers Large Language Models (LLMs) with reasoning capabilities, yet
ArXiv cs.AI
🖌️ UI/UX Design
📄 Paper
⚡ AI Lesson
1h ago
OpeFlo: Automated UX Evaluation via Simulated Human Web Interaction with GUI Grounding
arXiv:2604.09581v1 Announce Type: new Abstract: Evaluating web usability typically requires time-consuming user studies and expert reviews, which often limits i
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1h ago
Factorizing formal contexts from closures of necessity operators
arXiv:2604.09582v1 Announce Type: new Abstract: Factorizing datasets is an interesting process in a multitude of approaches, but many times it is not possible o
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Agentic Exploration of PDE Spaces using Latent Foundation Models for Parameterized Simulations
arXiv:2604.09584v1 Announce Type: new Abstract: Flow physics and more broadly physical phenomena governed by partial differential equations (PDEs), are inherent
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
MobiFlow: Real-World Mobile Agent Benchmarking through Trajectory Fusion
arXiv:2604.09587v1 Announce Type: new Abstract: Mobile agents can autonomously complete user-assigned tasks through GUI interactions. However, existing mainstre
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Persistent Identity in AI Agents: A Multi-Anchor Architecture for Resilient Memory and Continuity
arXiv:2604.09588v1 Announce Type: new Abstract: Modern AI agents suffer from a fundamental identity problem: when context windows overflow and conversation hist
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
DeepReviewer 2.0: A Traceable Agentic System for Auditable Scientific Peer Review
arXiv:2604.09590v1 Announce Type: new Abstract: Automated peer review is often framed as generating fluent critique, yet reviewers and area chairs need judgment
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Spatial Competence Benchmark
arXiv:2604.09594v1 Announce Type: new Abstract: Spatial competence is the quality of maintaining a consistent internal representation of an environment and usin
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings
arXiv:2604.09596v1 Announce Type: new Abstract: Dermatologic diseases impose a large and growing global burden, affecting billions and substantially reducing qu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
CID-TKG: Collaborative Historical Invariance and Evolutionary Dynamics Learning for Temporal Knowledge Graph Reasoning
arXiv:2604.09600v1 Announce Type: new Abstract: Temporal knowledge graph (TKG) reasoning aims to infer future facts at unseen timestamps from temporally evolvin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
Hubble: An LLM-Driven Agentic Framework for Safe and Automated Alpha Factor Discovery
arXiv:2604.09601v1 Announce Type: new Abstract: Discovering predictive alpha factors in quantitative finance remains a formidable challenge due to the vast comb
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
LLMs for Text-Based Exploration and Navigation Under Partial Observability
arXiv:2604.09604v1 Announce Type: new Abstract: Exploration and goal-directed navigation in unknown layouts are central to inspection, logistics, and search-and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
Evaluating Reliability Gaps in Large Language Model Safety via Repeated Prompt Sampling
arXiv:2604.09606v1 Announce Type: new Abstract: Traditional benchmarks for large language models (LLMs), such as HELM and AIR-BENCH, primarily assess safety ris
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Unifying Ontology Construction and Semantic Alignment for Deterministic Enterprise Reasoning at Scale
arXiv:2604.09608v1 Announce Type: new Abstract: While enterprises amass vast quantities of data, much of it remains chaotic and effectively dormant, preventing
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1h ago
General-purpose LLMs as Models of Human Driver Behavior: The Case of Simplified Merging
arXiv:2604.09609v1 Announce Type: new Abstract: Human behavior models are essential as behavior references and for simulating human agents in virtual safety ass
ArXiv cs.AI
🤖 AI Agents & Automation
📄 Paper
⚡ AI Lesson
1h ago
Beyond Theory of Mind in Robotics
arXiv:2604.09612v1 Announce Type: new Abstract: Theory of Mind, the capacity to explain and predict behavior by inferring hidden mental states, has become the d
ArXiv cs.AI
📐 ML Fundamentals
📄 Paper
⚡ AI Lesson
1h ago
The Geometry of Knowing: From Possibilistic Ignorance to Probabilistic Certainty -- A Measure-Theoretic Framework for Epistemic Convergence
arXiv:2604.09614v1 Announce Type: new Abstract: This paper develops a measure-theoretic framework establishing when and how a possibilistic representation of in
DeepCamp AI