6,209 articles

📰 AI News

6,209 articles · Updated every 3 hours

All ⚡ AI Lessons (4931) ArXiv cs.AIOpenAI NewsHugging Face BlogForbes InnovationDev.to AIWeaviate Blog
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
arXiv:2405.00181v3 Announce Type: replace-cross Abstract: Video anomaly understanding (VAU) aims to automatically comprehend unusual occurrences in videos, ther
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 15h ago
Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients
arXiv:2406.06751v3 Announce Type: replace-cross Abstract: We propose a novel deep symbolic regression approach to enhance the robustness and interpretability of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
CGRA4ML: A Hardware/Software Framework to Implement Neural Networks for Scientific Edge Computing
arXiv:2408.15561v4 Announce Type: replace-cross Abstract: The scientific community increasingly relies on machine learning (ML) for near-sensor processing, leve
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation
arXiv:2502.00262v4 Announce Type: replace-cross Abstract: Autonomous driving systems face significant challenges in handling unpredictable edge-case scenarios,
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 15h ago
Biogeochemistry-Informed Neural Network (BINN) for Improving Accuracy of Model Prediction and Scientific Understanding of Soil Organic Carbon
arXiv:2502.00672v3 Announce Type: replace-cross Abstract: The increasing availability of large-scale observational data and the rapid development of artificial
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 15h ago
Hierarchical and Multimodal Data for Daily Activity Understanding
arXiv:2504.17696v4 Announce Type: replace-cross Abstract: Daily Activity Recordings for Artificial Intelligence (DARai, pronounced "Dahr-ree") is a multimodal,
ArXiv cs.AI 📄 Paper 15h ago
The Accountability Paradox: How Platform API Restrictions Undermine AI Transparency Mandates
arXiv:2505.11577v3 Announce Type: replace-cross Abstract: Recent application programming interface (API) restrictions on major social media platforms challenge
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation
arXiv:2505.20353v3 Announce Type: replace-cross Abstract: Diffusion Transformers (DiT) are powerful generative models but remain computationally intensive due t
ArXiv cs.AI 📄 Paper 15h ago
Evidence-based diagnostic reasoning with multi-agent copilot for human pathology
arXiv:2506.20964v2 Announce Type: replace-cross Abstract: Pathology is experiencing rapid digital transformation driven by whole-slide imaging and artificial in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
StreamDiT: Real-Time Streaming Text-to-Video Generation
arXiv:2507.03745v4 Announce Type: replace-cross Abstract: Recently, great progress has been achieved in text-to-video (T2V) generation by scaling transformer-ba
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
arXiv:2508.14765v3 Announce Type: replace-cross Abstract: Designing therapeutic peptides with tailored properties is hindered by the vastness of sequence space,
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 15h ago
ExtrinSplat: Decoupling Geometry and Semantics for Open-Vocabulary Understanding in 3D Gaussian Splatting
arXiv:2509.22225v2 Announce Type: replace-cross Abstract: Lifting 2D open-vocabulary understanding into 3D Gaussian Splatting (3DGS) scenes is a critical challe
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 15h ago
GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings
arXiv:2510.01448v2 Announce Type: replace-cross Abstract: Worldwide visual geo-localization aims to determine the geographic location of an image anywhere on Ea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
Attention-Aligned Reasoning for Large Language Models
arXiv:2510.03223v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) tend to generate a long reasoning chain when solving complex tasks. Howev
ArXiv cs.AI 📄 Paper ⚡ AI Lesson 15h ago
Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices
arXiv:2510.06882v2 Announce Type: replace-cross Abstract: Edge devices have limited resources, which inevitably leads to situations where stream processing serv
ArXiv cs.AI 📄 Paper 15h ago
Gelina: Unified Speech and Gesture Synthesis via Interleaved Token Prediction
arXiv:2510.12834v3 Announce Type: replace-cross Abstract: Human communication is multimodal, with speech and gestures tightly coupled, yet most computational me
ArXiv cs.AI 📄 Paper 15h ago
Generating the Modal Worker: A Cross-Model Audit of Race and Gender in LLM-Generated Personas Across 41 Occupations
arXiv:2510.21011v2 Announce Type: replace-cross Abstract: As generative AI tools are increasingly used to portray people in professional roles, understanding th
ArXiv cs.AI 📄 Paper 15h ago
Compositional Image Synthesis with Inference-Time Scaling
arXiv:2510.24133v2 Announce Type: replace-cross Abstract: Despite their impressive realism, modern text-to-image models still struggle with compositionality, of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding
arXiv:2511.00810v3 Announce Type: replace-cross Abstract: Graphical user interface (GUI) grounding is a key capability for computer-use agents, mapping natural-
ArXiv cs.AI 📄 Paper 15h ago
Causal Graph Neural Networks for Healthcare
arXiv:2511.02531v5 Announce Type: replace-cross Abstract: Healthcare artificial intelligence systems often degrade in performance when deployed across instituti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
Route Experts by Sequence, not by Token
arXiv:2511.06494v2 Announce Type: replace-cross Abstract: Mixture-of-Experts (MoE) architectures scale large language models (LLMs) by activating only a subset
ArXiv cs.AI 👁️ Computer Vision 📄 Paper ⚡ AI Lesson 15h ago
Binary Verification for Zero-Shot Vision
arXiv:2511.10983v2 Announce Type: replace-cross Abstract: We propose a training-free, binary verification workflow for zero-shot vision with off-the-shelf VLMs.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
Any4D: Open-Prompt 4D Generation from Natural Language and Images
arXiv:2511.18746v2 Announce Type: replace-cross Abstract: While video-generation-based embodied world models have gained increasing attention, their reliance on
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 15h ago
Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning
arXiv:2511.21075v2 Announce Type: replace-cross Abstract: Aligning Large Language Models (LLMs) with biomedical knowledge requires understanding both concepts a