Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,326 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models
arXiv:2603.24629v1 Announce Type: cross Abstract: Converting process sketches into executable simulation models remains a major bottleneck in process systems en
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
TRAJEVAL: Decomposing Code Agent Trajectories for Fine-Grained Diagnosis
arXiv:2603.24631v1 Announce Type: cross Abstract: Code agents can autonomously resolve GitHub issues, yet when they fail, current evaluation provides no visibil
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Dual-Graph Multi-Agent Reinforcement Learning for Handover Optimization
arXiv:2603.24634v1 Announce Type: cross Abstract: HandOver (HO) control in cellular networks is governed by a set of HO control parameters that are traditionall
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DyMRL: Dynamic Multispace Representation Learning for Multimodal Event Forecasting in Knowledge Graph
arXiv:2603.24636v1 Announce Type: cross Abstract: Accurate representation of multimodal knowledge is crucial for event forecasting in real-world scenarios. Howe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Experiential Reflective Learning for Self-Improving LLM Agents
arXiv:2603.24639v1 Announce Type: cross Abstract: Recent advances in large language models (LLMs) have enabled the development of autonomous agents capable of c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models
arXiv:2603.24721v1 Announce Type: cross Abstract: Spatial reasoning focuses on locating target objects based on spatial relations in 3D scenes, which plays a cr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Decentralized Task Scheduling in Distributed Systems: A Deep Reinforcement Learning Approach
arXiv:2603.24738v1 Announce Type: cross Abstract: Efficient task scheduling in large-scale distributed systems presents significant challenges due to dynamic wo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Grokking as a Falsifiable Finite-Size Transition
arXiv:2603.24746v1 Announce Type: cross Abstract: Grokking -- the delayed onset of generalization after early memorization -- is often described with phase-tran
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset
arXiv:2603.24772v1 Announce Type: cross Abstract: Clinical documentation is a critical factor for patient safety, diagnosis, and continuity of care. The adminis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Untestable to Testable: Metamorphic Testing in the Age of LLMs
arXiv:2603.24774v1 Announce Type: cross Abstract: This article discusses the challenges of testing software systems with increasingly integrated AI and LLM func
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driven Analysis
arXiv:2603.24801v1 Announce Type: cross Abstract: Computed tomography image segmentation of complex abdominal aortic aneurysms (AAA) often fails because the mod
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretraining
arXiv:2603.24804v1 Announce Type: cross Abstract: Until recently, the success of large-scale vision-language models (VLMs) has primarily relied on billion-sampl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions
arXiv:2603.24806v1 Announce Type: cross Abstract: Diffusion models are increasingly used for robot learning, but current designs face a clear trade-off. Action-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting
arXiv:2603.24821v1 Announce Type: cross Abstract: State-of-the-art crowd counting and localization are primarily modeled using two paradigms: density maps and p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models
arXiv:2603.24844v1 Announce Type: cross Abstract: Given a question, a language model (LM) implicitly encodes a distribution over possible answers. In practice,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Neurological Disorders
arXiv:2603.24846v1 Announce Type: cross Abstract: Recent advances in multimodal large language models enable new possibilities for image-based decision support.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective
arXiv:2603.24857v1 Announce Type: cross Abstract: As machine learning (ML) systems expand in both scale and functionality, the security landscape has become inc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
More Than "Means to an End": Supporting Reasoning with Transparently Designed AI Data Science Processes
arXiv:2603.24877v1 Announce Type: cross Abstract: Generative artificial intelligence (AI) tools can now help people perform complex data science tasks regardles
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware
arXiv:2603.24891v1 Announce Type: cross Abstract: Spiking Neural Networks (SNNs) offer inherent advantages for low-power inference through sparse, event-driven
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Sovereign AI at the Front Door of Care: A Physically Unidirectional Architecture for Secure Clinical Intelligence
arXiv:2603.24898v1 Announce Type: cross Abstract: We present a Sovereign AI architecture for clinical triage in which all inference is performed on-device and i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Shaping the Future of Mathematics in the Age of AI
arXiv:2603.24914v1 Announce Type: cross Abstract: Artificial intelligence is transforming mathematics at a speed and scale that demand active engagement from th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system
arXiv:2603.24940v1 Announce Type: cross Abstract: This paper introduces the design and development of a framework that integrates a large language model (LLM) w
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators
arXiv:2603.24986v1 Announce Type: cross Abstract: Large language model based health agents are increasingly used by health consumers and clinicians to interpret
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model
arXiv:2603.24989v1 Announce Type: cross Abstract: Learning diverse and high-fidelity traffic simulations from human driving demonstrations is crucial for autono
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models
arXiv:2603.25015v1 Announce Type: cross Abstract: System prompt instructions that cooperate in English compete in Spanish, with the same semantic content, but o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Closing the Confidence-Faithfulness Gap in Large Language Models
arXiv:2603.25052v1 Announce Type: cross Abstract: Large language models (LLMs) tend to verbalize confidence scores that are largely detached from their actual a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities
arXiv:2603.25056v1 Announce Type: cross Abstract: System prompt configuration can make the difference between near-total phishing blindness and near-perfect det
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization
arXiv:2603.25063v1 Announce Type: cross Abstract: Recent agentic systems demonstrate that large language models can generate scientific visualizations from natu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Pixelis: Reasoning in Pixels, from Seeing to Acting
arXiv:2603.25091v1 Announce Type: cross Abstract: Most vision-language systems are static observers: they describe pixels, do not act, and cannot safely improve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization
arXiv:2603.25099v1 Announce Type: cross Abstract: We present a framework in which a large language model (LLM) acts as an online adaptive controller for SIMP to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Layer-Specific Lipschitz Modulation for Fault-Tolerant Multimodal Representation Learning
arXiv:2603.25103v1 Announce Type: cross Abstract: Modern multimodal systems deployed in industrial and safety-critical environments must remain reliable under p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
arXiv:2603.25112v1 Announce Type: cross Abstract: Standard evaluation of LLM confidence relies on calibration metrics (ECE, Brier score) that conflate two disti
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Reinforcement learning for quantum processes with memory
arXiv:2603.25138v1 Announce Type: cross Abstract: In reinforcement learning, an agent interacts sequentially with an environment to maximize a reward, receiving
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment
arXiv:2603.25140v1 Announce Type: cross Abstract: Multimodal deepfakes can exhibit subtle visual artifacts and cross-modal inconsistencies, which remain challen
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation
arXiv:2603.25144v1 Announce Type: cross Abstract: Dataset distillation (DD) compresses a large training set into a small synthetic set, reducing storage and tra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence
arXiv:2603.25146v1 Announce Type: cross Abstract: Context: The rapid adoption of AI-assisted code generation tools, such as large language models (LLMs), is tra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models
arXiv:2603.25155v1 Announce Type: cross Abstract: Multimodal large language models are promising for clinical visual question answering tasks, but scaling to 3D
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
arXiv:2603.25164v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across a wide range of applications. How
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model
arXiv:2603.25184v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become essential for post-training large language models (LLMs) in reasoning t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Probing the Lack of Stable Internal Beliefs in LLMs
arXiv:2603.25187v1 Announce Type: cross Abstract: Persona-driven large language models (LLMs) require consistent behavioral tendencies across interactions to si
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
A Decade-Scale Benchmark Evaluating LLMs' Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations
arXiv:2603.25196v1 Announce Type: cross Abstract: Clinical practice guidelines (CPGs) play a pivotal role in ensuring evidence-based decision-making and improvi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
arXiv:2603.25209v1 Announce Type: cross Abstract: Generating long videos using pre-trained video diffusion models, which are typically trained on short clips, p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
A Wireless World Model for AI-Native 6G Networks
arXiv:2603.25216v1 Announce Type: cross Abstract: Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approache
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing
arXiv:2603.25226v1 Announce Type: cross Abstract: The emergence of Large Language Models (LLMs) has catalyzed a paradigm shift in programming, giving rise to "v
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA
arXiv:2603.25243v1 Announce Type: cross Abstract: Large language models and autonomous agents are increasingly explored for EDA automation, but many existing in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models
arXiv:2603.25250v1 Announce Type: cross Abstract: Out-of-distribution (OOD) detection aims to identify samples that deviate from in-distribution (ID). One popul
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation
arXiv:2603.25253v1 Announce Type: cross Abstract: Large language models (LLMs) hold considerable potential for advancing scientific discovery, yet systematic as
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CRAFT: Grounded Multi-Agent Coordination Under Partial Information
arXiv:2603.25268v1 Announce Type: cross Abstract: We introduce CRAFT, a multi-agent benchmark for evaluating pragmatic communication in large language models un
DeepCamp AI