2,044 articles

📰 ArXiv cs.AI

Articles from ArXiv cs.AI · 2,044 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (5184) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIHackernoon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface
arXiv:2603.22519v1 Announce Type: cross Abstract: Textual Large Language Models (LLMs) provide a simple and familiar interface: a string of text is used for bot
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs
arXiv:2603.22528v1 Announce Type: cross Abstract: Large Language Models (LLMs) combined with Retrieval-Augmented Generation (RAG) and knowledge graphs offer new
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
arXiv:2603.22529v1 Announce Type: cross Abstract: Multimodal AI agents are increasingly automating complex real-world workflows that involve online web executio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
arXiv:2603.22577v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated potential in code generation, yet they struggle with the multi-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?
arXiv:2603.22582v1 Announce Type: cross Abstract: Chain-of-thought (CoT) reasoning has been proposed as a transparency mechanism for large language models in sa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
flexvec: SQL Vector Retrieval with Programmatic Embedding Modulation
arXiv:2603.22587v1 Announce Type: cross Abstract: As AI agents become the primary consumers of retrieval APIs, there is an opportunity to expose more of the ret
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Language Models Can Explain Visual Features via Steering
arXiv:2603.22593v1 Announce Type: cross Abstract: Sparse Autoencoders uncover thousands of features in vision models, yet explaining these features without requ
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Do Consumers Accept AIs as Moral Compliance Agents?
arXiv:2603.22617v1 Announce Type: cross Abstract: Consumers are generally resistant to Artificial Intelligence (AI) involvement in moral decision-making, percei
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Causal Discovery in Action: Learning Chain-Reaction Mechanisms from Interventions
arXiv:2603.22620v1 Announce Type: cross Abstract: Causal discovery is challenging in general dynamical systems because, without strong structural assumptions, t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models
arXiv:2603.22623v1 Announce Type: cross Abstract: Vision-language models (VLMs) adapted to the medical domain have shown strong performance on visual question a
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion
arXiv:2603.22624v1 Announce Type: cross Abstract: Attribution maps for semantic segmentation are almost always judged by visual plausibility. Yet looking convin
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation
arXiv:2603.22629v1 Announce Type: cross Abstract: Adapting pretrained language models to low-resource, morphologically rich languages remains a significant chal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals
arXiv:2603.22634v1 Announce Type: cross Abstract: Productive human-AI collaboration requires appropriate reliance, yet contemporary AI systems are often miscali
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research
arXiv:2603.22648v1 Announce Type: cross Abstract: There are different goals for literature research, from understanding an unfamiliar topic to generate hypothes
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Generalizing Dynamics Modeling More Easily from Representation Perspective
arXiv:2603.22655v1 Announce Type: cross Abstract: Learning system dynamics from observations is a critical problem in many applications over various real-world
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Vision-based Deep Learning Analysis of Unordered Biomedical Tabular Datasets via Optimal Spatial Cartography
arXiv:2603.22675v1 Announce Type: cross Abstract: Tabular data are central to biomedical research, from liquid biopsy and bulk and single-cell transcriptomics t
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment
arXiv:2603.22690v1 Announce Type: cross Abstract: Privacy-preserving semantic understanding of human activities is important for indoor sensing, yet existing Wi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset
arXiv:2603.22714v1 Announce Type: cross Abstract: We present PopResume, a population-representative resume dataset for causal fairness auditing of LLM- and VLM-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
KALAVAI: Predicting When Independent Specialist Fusion Works -- A Quantitative Model for Post-Hoc Cooperative LLM Training
arXiv:2603.22755v1 Announce Type: cross Abstract: Independently trained domain specialists can be fused post-hoc into a single model that outperforms any indivi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona
arXiv:2603.22765v1 Announce Type: cross Abstract: Data scarcity remains a persistent challenge in low-resource domains. While existing data augmentation methods
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
From Overload to Convergence: Supporting Multi-Issue Human-AI Negotiation with Bayesian Visualization
arXiv:2603.22766v1 Announce Type: cross Abstract: As AI systems increasingly mediate negotiations, understanding how the number of negotiated issues impacts hum
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
arXiv:2603.22770v1 Announce Type: cross Abstract: The deployment of deep neural networks (DNNs) in safety-critical edge environments necessitates robustness aga
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1w ago
KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao
arXiv:2603.22779v1 Announce Type: cross Abstract: Large Language Models (LLMs) are equipped with profound semantic knowledge, making them a natural choice for i
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 1w ago
Exposure-Normalized Bed and Chair Fall Rates via Continuous AI Monitoring
arXiv:2603.22785v1 Announce Type: cross Abstract: This retrospective cohort study used continuous AI monitoring to estimate fall rates by exposure time rather t