Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,087 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents
arXiv:2604.04157v1 Announce Type: new Abstract: Theory of Mind (ToM) -- the ability to model others' mental states -- is fundamental to human social cognition.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A Model of Understanding in Deep Learning Systems
arXiv:2604.04171v1 Announce Type: new Abstract: I propose a model of systematic understanding, suitable for machine learning systems. On this account, an agent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection
arXiv:2604.04174v1 Announce Type: new Abstract: The proliferation of fake news across diverse domains highlights critical limitations in current detection syste
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty
arXiv:2604.04182v1 Announce Type: new Abstract: Non-stationary environments require agents to revise previously learned action values when contingencies change.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Schema-Aware Planning and Hybrid Knowledge Toolset for Reliable Knowledge Graph Triple Verification
arXiv:2604.04190v1 Announce Type: new Abstract: Knowledge Graphs (KGs) serve as a critical foundation for AI systems, yet their automated construction inevitabl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Don't Blink: Evidence Collapse during Multimodal Reasoning
arXiv:2604.04207v1 Announce Type: new Abstract: Reasoning VLMs can become more accurate while progressively losing visual grounding as they think. This creates
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TimeSeek: Temporal Reliability of Agentic Forecasters
arXiv:2604.04220v1 Announce Type: new Abstract: We introduce TimeSeek, a benchmark for studying how the reliability of agentic LLM forecasters changes over a pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Pedagogical Safety in Educational Reinforcement Learning: Formalizing and Detecting Reward Hacking in AI Tutoring Systems
arXiv:2604.04237v1 Announce Type: new Abstract: Reinforcement learning (RL) is increasingly used to personalize instruction in intelligent tutoring systems, yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents
arXiv:2604.04247v1 Announce Type: new Abstract: Recent advances in prompt learning allow large language model agents to acquire task-relevant knowledge from inf
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Context Engineering: A Practitioner Methodology for Structured Human-AI Collaboration
arXiv:2604.04258v1 Announce Type: new Abstract: The quality of AI-generated output is often attributed to prompting technique, but extensive empirical observati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
InferenceEvolve: Towards Automated Causal Effect Estimators through Self-Evolving AI
arXiv:2604.04274v1 Announce Type: new Abstract: Causal inference is central to scientific discovery, yet choosing appropriate methods remains challenging becaus
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts
arXiv:2604.04281v1 Announce Type: new Abstract: Width expansion offers a practical route to reuse smaller causal-language-model checkpoints, but selecting a wid
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence
arXiv:2604.04297v1 Announce Type: new Abstract: Physiological foundation models (FMs) have shown promise for biosignal representation learning, yet most remain
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
RESCORE: LLM-Driven Simulation Recovery in Control Systems Research Papers
arXiv:2604.04324v1 Announce Type: new Abstract: Reconstructing numerical simulations from control systems research papers is often hindered by underspecified pa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Thermodynamic-Inspired Explainable GeoAI: Uncovering Regime-Dependent Mechanisms in Heterogeneous Spatial Systems
arXiv:2604.04339v1 Announce Type: new Abstract: Modeling spatial heterogeneity and associated critical transitions remains a fundamental challenge in geography
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Implementing surrogate goals for safer bargaining in LLM-based agents
arXiv:2604.04341v1 Announce Type: new Abstract: Surrogate goals have been proposed as a strategy for reducing risks from bargaining failures. A surrogate goal i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Domain-Contextualized Inference: A Computable Graph Architecture for Explicit-Domain Reasoning
arXiv:2604.04344v1 Announce Type: new Abstract: We establish a computation-substrate-agnostic inference architecture in which domain is an explicit first-class
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
REAM: Merging Improves Pruning of Experts in LLMs
arXiv:2604.04356v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) large language models (LLMs) are among the top-performing architectures. The largest mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Decocted Experience Improves Test-Time Inference in LLM Agents
arXiv:2604.04373v1 Announce Type: new Abstract: There is growing interest in improving LLMs without updating model parameters. One well-established direction is
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Optimizing Service Operations via LLM-Powered Multi-Agent Simulation
arXiv:2604.04383v1 Announce Type: new Abstract: Service system performance depends on how participants respond to design choices, but modeling these responses i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Automatically Generating Hard Math Problems from Hypothesis-Driven Error Analysis
arXiv:2604.04386v1 Announce Type: new Abstract: Numerous math benchmarks exist to evaluate LLMs' mathematical capabilities. However, most involve extensive manu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MolDA: Molecular Understanding and Generation via Large Language Diffusion Model
arXiv:2604.04403v1 Announce Type: new Abstract: Large Language Models (LLMs) have significantly advanced molecular discovery, but existing multimodal molecular
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PSY-STEP: Structuring Therapeutic Targets and Action Sequences for Proactive Counseling Dialogue Systems
arXiv:2604.04448v1 Announce Type: new Abstract: Cognitive Behavioral Therapy (CBT) aims to identify and restructure automatic negative thoughts pertaining to in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Empirical Characterization of Rationale Stability Under Controlled Perturbations for Explainable Pattern Recognition
arXiv:2604.04456v1 Announce Type: new Abstract: Reliable pattern recognition systems should exhibit consistent behavior across similar inputs, and their explana
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Topology of Multimodal Fusion: Why Current Architectures Fail at Creative Cognition
arXiv:2604.04465v1 Announce Type: new Abstract: This paper identifies a structural limitation in current multimodal AI architectures that is topological rather
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
What Makes a Sale? Rethinking End-to-End Seller--Buyer Retail Dynamics with LLM Agents
arXiv:2604.04468v1 Announce Type: new Abstract: Evaluating retail strategies before deployment is difficult, as outcomes are determined across multiple stages,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scalable and Explainable Learner-Video Interaction Prediction using Multimodal Large Language Models
arXiv:2604.04482v1 Announce Type: new Abstract: Learners' use of video controls in educational videos provides implicit signals of cognitive processing and inst
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Memory Intelligence Agent
arXiv:2604.04503v1 Announce Type: new Abstract: Deep research agents (DRAs) integrate LLM reasoning with external tools. Memory systems enable DRAs to leverage
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Search, Do not Guess: Teaching Small Language Models to Be Effective Search Agents
arXiv:2604.04651v1 Announce Type: new Abstract: Agents equipped with search tools have emerged as effective solutions for knowledge-intensive tasks. While Large
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Springdrift: An Auditable Persistent Runtime for LLM Agents with Case-Based Memory, Normative Safety, and Ambient Self-Perception
arXiv:2604.04660v1 Announce Type: new Abstract: We present Springdrift, a persistent runtime for long-lived LLM agents. The system integrates an auditable execu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
AI Assistance Reduces Persistence and Hurts Independent Performance
arXiv:2604.04721v1 Announce Type: new Abstract: People often optimize for long-term goals in collaboration: A mentor or companion doesn't just answer questions,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents
arXiv:2604.04853v1 Announce Type: new Abstract: Large Language Model (LLM) agents require persistent memory to maintain personalization, factual continuity, and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
QED-Nano: Teaching a Tiny Model to Prove Hard Theorems
arXiv:2604.04898v1 Announce Type: new Abstract: Proprietary AI systems have recently demonstrated impressive capabilities on complex proof-based problems, with
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLMs-Healthcare : Current Applications and Challenges of Large Language Models in various Medical Specialties
arXiv:2311.12882v3 Announce Type: cross Abstract: We aim to present a comprehensive overview of the latest advancements in utilizing Large Language Models (LLMs
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification
arXiv:2504.19959v3 Announce Type: cross Abstract: Verification presents a major bottleneck in Integrated Circuit (IC) development, consuming nearly 70% of the t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance
arXiv:2604.03237v1 Announce Type: cross Abstract: While natural-language explanations from large language models (LLMs) are widely adopted to improve transparen
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scaling DPPs for RAG: Density Meets Diversity
arXiv:2604.03240v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by grounding generation in external
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Classifying Problem and Solution Framing in Congressional Social Media
arXiv:2604.03247v1 Announce Type: cross Abstract: Policy setting in the USA according to the ``Garbage Can'' model differentiates between ``problem'' and ``solu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
BLK-Assist: A Methodological Framework for Artist-Led Co-Creation with Generative AI Models
arXiv:2604.03249v1 Announce Type: cross Abstract: This paper presents BLK-Assist, a modular framework for artist-specific fine-tuning of diffusion models using
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation
arXiv:2604.03257v1 Announce Type: cross Abstract: The ability to rigorously estimate the failure rates of large language models (LLMs) is a prerequisite for the
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression
arXiv:2604.03258v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated impressive capabilities across various tasks, but the billion-s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Why Attend to Everything? Focus is the Key
arXiv:2604.03260v1 Announce Type: cross Abstract: We introduce Focus, a method that learns which token pairs matter rather than approximating all of them. Learn
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LPC-SM: Local Predictive Coding and Sparse Memory for Long-Context Language Modeling
arXiv:2604.03263v1 Announce Type: cross Abstract: Most current long-context language models still rely on attention to handle both local interaction and long-ra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Impact of geophysical fields on Deep Learning-based Lagrangian drift simulations
arXiv:2604.03292v1 Announce Type: cross Abstract: We assess the influence of different Eulerian geophysical input fields on Lagrangian drift simulations using D
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems
arXiv:2604.03295v1 Announce Type: cross Abstract: Large language model (LLM) multi-agent systems can scale along two distinct dimensions: by increasing the numb
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
3D-IDE: 3D Implicit Depth Emergent
arXiv:2604.03296v1 Announce Type: cross Abstract: Leveraging 3D information within Multimodal Large Language Models (MLLMs) has recently shown significant advan
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
XAttnRes: Cross-Stage Attention Residuals for Medical Image Segmentation
arXiv:2604.03297v1 Announce Type: cross Abstract: In the field of Large Language Models (LLMs), Attention Residuals have recently demonstrated that learned, sel
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Embedding-Only Uplink for Onboard Retrieval Under Shift in Remote Sensing
arXiv:2604.03301v1 Announce Type: cross Abstract: Downlink bottlenecks motivate onboard systems that prioritize hazards without transmitting raw pixels. We stud
DeepCamp AI