Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,480

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,393 Reads 5,087

Showing 5,087 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents

arXiv:2604.04157v1 Announce Type: new Abstract: Theory of Mind (ToM) -- the ability to model others' mental states -- is fundamental to human social cognition.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

A Model of Understanding in Deep Learning Systems

arXiv:2604.04171v1 Announce Type: new Abstract: I propose a model of systematic understanding, suitable for machine learning systems. On this account, an agent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection

arXiv:2604.04174v1 Announce Type: new Abstract: The proliferation of fake news across diverse domains highlights critical limitations in current detection syste

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty

arXiv:2604.04182v1 Announce Type: new Abstract: Non-stationary environments require agents to revise previously learned action values when contingencies change.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Schema-Aware Planning and Hybrid Knowledge Toolset for Reliable Knowledge Graph Triple Verification

arXiv:2604.04190v1 Announce Type: new Abstract: Knowledge Graphs (KGs) serve as a critical foundation for AI systems, yet their automated construction inevitabl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Don't Blink: Evidence Collapse during Multimodal Reasoning

arXiv:2604.04207v1 Announce Type: new Abstract: Reasoning VLMs can become more accurate while progressively losing visual grounding as they think. This creates

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

TimeSeek: Temporal Reliability of Agentic Forecasters

arXiv:2604.04220v1 Announce Type: new Abstract: We introduce TimeSeek, a benchmark for studying how the reliability of agentic LLM forecasters changes over a pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Pedagogical Safety in Educational Reinforcement Learning: Formalizing and Detecting Reward Hacking in AI Tutoring Systems

arXiv:2604.04237v1 Announce Type: new Abstract: Reinforcement learning (RL) is increasingly used to personalize instruction in intelligent tutoring systems, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

arXiv:2604.04247v1 Announce Type: new Abstract: Recent advances in prompt learning allow large language model agents to acquire task-relevant knowledge from inf

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Context Engineering: A Practitioner Methodology for Structured Human-AI Collaboration

arXiv:2604.04258v1 Announce Type: new Abstract: The quality of AI-generated output is often attributed to prompting technique, but extensive empirical observati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

InferenceEvolve: Towards Automated Causal Effect Estimators through Self-Evolving AI

arXiv:2604.04274v1 Announce Type: new Abstract: Causal inference is central to scientific discovery, yet choosing appropriate methods remains challenging becaus

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts

arXiv:2604.04281v1 Announce Type: new Abstract: Width expansion offers a practical route to reuse smaller causal-language-model checkpoints, but selecting a wid

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence

arXiv:2604.04297v1 Announce Type: new Abstract: Physiological foundation models (FMs) have shown promise for biosignal representation learning, yet most remain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

RESCORE: LLM-Driven Simulation Recovery in Control Systems Research Papers

arXiv:2604.04324v1 Announce Type: new Abstract: Reconstructing numerical simulations from control systems research papers is often hindered by underspecified pa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Thermodynamic-Inspired Explainable GeoAI: Uncovering Regime-Dependent Mechanisms in Heterogeneous Spatial Systems

arXiv:2604.04339v1 Announce Type: new Abstract: Modeling spatial heterogeneity and associated critical transitions remains a fundamental challenge in geography

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Implementing surrogate goals for safer bargaining in LLM-based agents

arXiv:2604.04341v1 Announce Type: new Abstract: Surrogate goals have been proposed as a strategy for reducing risks from bargaining failures. A surrogate goal i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Domain-Contextualized Inference: A Computable Graph Architecture for Explicit-Domain Reasoning

arXiv:2604.04344v1 Announce Type: new Abstract: We establish a computation-substrate-agnostic inference architecture in which domain is an explicit first-class

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

REAM: Merging Improves Pruning of Experts in LLMs

arXiv:2604.04356v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) large language models (LLMs) are among the top-performing architectures. The largest mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Decocted Experience Improves Test-Time Inference in LLM Agents

arXiv:2604.04373v1 Announce Type: new Abstract: There is growing interest in improving LLMs without updating model parameters. One well-established direction is

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Optimizing Service Operations via LLM-Powered Multi-Agent Simulation

arXiv:2604.04383v1 Announce Type: new Abstract: Service system performance depends on how participants respond to design choices, but modeling these responses i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Automatically Generating Hard Math Problems from Hypothesis-Driven Error Analysis

arXiv:2604.04386v1 Announce Type: new Abstract: Numerous math benchmarks exist to evaluate LLMs' mathematical capabilities. However, most involve extensive manu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

arXiv:2604.04403v1 Announce Type: new Abstract: Large Language Models (LLMs) have significantly advanced molecular discovery, but existing multimodal molecular

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

PSY-STEP: Structuring Therapeutic Targets and Action Sequences for Proactive Counseling Dialogue Systems

arXiv:2604.04448v1 Announce Type: new Abstract: Cognitive Behavioral Therapy (CBT) aims to identify and restructure automatic negative thoughts pertaining to in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Empirical Characterization of Rationale Stability Under Controlled Perturbations for Explainable Pattern Recognition

arXiv:2604.04456v1 Announce Type: new Abstract: Reliable pattern recognition systems should exhibit consistent behavior across similar inputs, and their explana

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Topology of Multimodal Fusion: Why Current Architectures Fail at Creative Cognition

arXiv:2604.04465v1 Announce Type: new Abstract: This paper identifies a structural limitation in current multimodal AI architectures that is topological rather

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

What Makes a Sale? Rethinking End-to-End Seller--Buyer Retail Dynamics with LLM Agents

arXiv:2604.04468v1 Announce Type: new Abstract: Evaluating retail strategies before deployment is difficult, as outcomes are determined across multiple stages,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scalable and Explainable Learner-Video Interaction Prediction using Multimodal Large Language Models

arXiv:2604.04482v1 Announce Type: new Abstract: Learners' use of video controls in educational videos provides implicit signals of cognitive processing and inst

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Memory Intelligence Agent

arXiv:2604.04503v1 Announce Type: new Abstract: Deep research agents (DRAs) integrate LLM reasoning with external tools. Memory systems enable DRAs to leverage

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Search, Do not Guess: Teaching Small Language Models to Be Effective Search Agents

arXiv:2604.04651v1 Announce Type: new Abstract: Agents equipped with search tools have emerged as effective solutions for knowledge-intensive tasks. While Large

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Springdrift: An Auditable Persistent Runtime for LLM Agents with Case-Based Memory, Normative Safety, and Ambient Self-Perception

arXiv:2604.04660v1 Announce Type: new Abstract: We present Springdrift, a persistent runtime for long-lived LLM agents. The system integrates an auditable execu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

AI Assistance Reduces Persistence and Hurts Independent Performance

arXiv:2604.04721v1 Announce Type: new Abstract: People often optimize for long-term goals in collaboration: A mentor or companion doesn't just answer questions,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents

arXiv:2604.04853v1 Announce Type: new Abstract: Large Language Model (LLM) agents require persistent memory to maintain personalization, factual continuity, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

arXiv:2604.04898v1 Announce Type: new Abstract: Proprietary AI systems have recently demonstrated impressive capabilities on complex proof-based problems, with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLMs-Healthcare : Current Applications and Challenges of Large Language Models in various Medical Specialties

arXiv:2311.12882v3 Announce Type: cross Abstract: We aim to present a comprehensive overview of the latest advancements in utilizing Large Language Models (LLMs

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification

arXiv:2504.19959v3 Announce Type: cross Abstract: Verification presents a major bottleneck in Integrated Circuit (IC) development, consuming nearly 70% of the t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance

arXiv:2604.03237v1 Announce Type: cross Abstract: While natural-language explanations from large language models (LLMs) are widely adopted to improve transparen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scaling DPPs for RAG: Density Meets Diversity

arXiv:2604.03240v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by grounding generation in external

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Classifying Problem and Solution Framing in Congressional Social Media

arXiv:2604.03247v1 Announce Type: cross Abstract: Policy setting in the USA according to the ``Garbage Can'' model differentiates between ``problem'' and ``solu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

BLK-Assist: A Methodological Framework for Artist-Led Co-Creation with Generative AI Models

arXiv:2604.03249v1 Announce Type: cross Abstract: This paper presents BLK-Assist, a modular framework for artist-specific fine-tuning of diffusion models using

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation

arXiv:2604.03257v1 Announce Type: cross Abstract: The ability to rigorously estimate the failure rates of large language models (LLMs) is a prerequisite for the

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression

arXiv:2604.03258v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated impressive capabilities across various tasks, but the billion-s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Why Attend to Everything? Focus is the Key

arXiv:2604.03260v1 Announce Type: cross Abstract: We introduce Focus, a method that learns which token pairs matter rather than approximating all of them. Learn

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LPC-SM: Local Predictive Coding and Sparse Memory for Long-Context Language Modeling

arXiv:2604.03263v1 Announce Type: cross Abstract: Most current long-context language models still rely on attention to handle both local interaction and long-ra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Impact of geophysical fields on Deep Learning-based Lagrangian drift simulations

arXiv:2604.03292v1 Announce Type: cross Abstract: We assess the influence of different Eulerian geophysical input fields on Lagrangian drift simulations using D

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems

arXiv:2604.03295v1 Announce Type: cross Abstract: Large language model (LLM) multi-agent systems can scale along two distinct dimensions: by increasing the numb

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

3D-IDE: 3D Implicit Depth Emergent

arXiv:2604.03296v1 Announce Type: cross Abstract: Leveraging 3D information within Multimodal Large Language Models (MLLMs) has recently shown significant advan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

XAttnRes: Cross-Stage Attention Residuals for Medical Image Segmentation

arXiv:2604.03297v1 Announce Type: cross Abstract: In the field of Large Language Models (LLMs), Attention Residuals have recently demonstrated that learned, sel

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Embedding-Only Uplink for Onboard Retrieval Under Shift in Remote Sensing

arXiv:2604.03301v1 Announce Type: cross Abstract: Downlink bottlenecks motivate onboard systems that prioritize hazards without transmitting raw pixels. We stud