Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,480

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,393 Reads 5,087

Showing 5,087 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems

arXiv:2604.04767v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has improved the reasoning abilities of LLMs, yet a fund

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

arXiv:2604.04804v1 Announce Type: cross Abstract: Learning from experience is critical for building capable large language model (LLM) agents, yet prevailing se

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection

arXiv:2604.04815v1 Announce Type: cross Abstract: The rapid development of Large Language Models (LLMs) has transformed fake news detection and fact-checking ta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not

arXiv:2604.04825v1 Announce Type: cross Abstract: Large language models achieve strong performance on many language tasks, yet it remains unclear whether they i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

InfBaGel: Human-Object-Scene Interaction Generation with Dynamic Perception and Iterative Refinement

arXiv:2604.04843v1 Announce Type: cross Abstract: Human-object-scene interactions (HOSI) generation has broad applications in embodied AI, simulation, and anima

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework

arXiv:2604.04852v1 Announce Type: cross Abstract: Chain-of-Thought (CoT) prompting has been used to enhance the reasoning capability of LLMs. However, its relia

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms

arXiv:2604.04868v1 Announce Type: cross Abstract: Tabular foundation models (TFMs) such as TabPFN (Tabular Prior-Data Fitted Network) are designed to generalize

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation

arXiv:2604.04894v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Agentic Federated Learning: The Future of Distributed Training Orchestration

arXiv:2604.04895v1 Announce Type: cross Abstract: Although Federated Learning (FL) promises privacy and distributed collaboration, its effectiveness in real-wor

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Vero: An Open RL Recipe for General Visual Reasoning

arXiv:2604.04917v1 Announce Type: cross Abstract: What does it take to build a visual reasoner that works across charts, science, spatial understanding, and ope

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Your Pre-trained Diffusion Model Secretly Knows Restoration

arXiv:2604.04924v1 Announce Type: cross Abstract: Pre-trained diffusion models have enabled significant advancements in All-in-One Restoration (AiOR), offering

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Early Stopping for Large Reasoning Models via Confidence Dynamics

arXiv:2604.04930v1 Announce Type: cross Abstract: Large reasoning models rely on long chain-of-thought generation to solve complex problems, but extended reason

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

arXiv:2302.00797v4 Announce Type: replace Abstract: Opponent modeling methods typically involve two crucial steps: building a belief distribution over opponents

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible

arXiv:2411.06498v2 Announce Type: replace Abstract: A recent paper (van Rooij et al. 2024) claims to have proved that achieving human-like intelligence using le

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

arXiv:2502.13388v3 Announce Type: replace Abstract: StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models

arXiv:2506.17585v3 Announce Type: replace Abstract: Trustworthy language models should provide both correct and verifiable answers. However, citations generated

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Similarity Field Theory: A Mathematical Framework for Intelligence

arXiv:2509.18218v5 Announce Type: replace Abstract: We posit that transforming similarity relations form the structural basis of comprehensible dynamic systems.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics

arXiv:2510.09901v2 Announce Type: replace Abstract: Computing has long served as a cornerstone of scientific discovery. Recently, a paradigm shift has emerged w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval

arXiv:2511.14130v2 Announce Type: replace Abstract: With the rapid progress of large language models (LLMs), financial information retrieval has become a critic

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Drill-Down and Fabricate Test (DDFT): A Protocol for Measuring Epistemic Robustness in Language Models

arXiv:2512.23850v2 Announce Type: replace Abstract: Current language model evaluations measure what models know under ideal conditions but not how robustly they

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers

arXiv:2601.06338v2 Announce Type: replace Abstract: Diffusion Transformers (DiTs) have greatly advanced text-to-image generation, but models still struggle to g

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

arXiv:2601.08950v2 Announce Type: replace Abstract: Despite their growing adoption in education, LLMs remain misaligned with the core principle of effective tut

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making

arXiv:2601.21439v2 Announce Type: replace Abstract: While Large Language Models (LLMs) are widely documented to be sensitive to minor prompt perturbations and p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization

arXiv:2601.22776v2 Announce Type: replace Abstract: Multi-turn tool-integrated reasoning enables Large Language Models (LLMs) to solve complex tasks through ite

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional Feature Restoration

arXiv:2602.03151v2 Announce Type: replace Abstract: Vision Language Model (VLM) typically assume complete modality input during inference. However, their effect

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

arXiv:2602.07943v2 Announce Type: replace Abstract: In the presence of confounding between an endogenous variable and the outcome, instrumental variables (IVs)

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning

arXiv:2602.13218v2 Announce Type: replace Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) is bottlenecked by data: existing synthesis pipelines

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

KLong: Training LLM Agent for Extremely Long-horizon Tasks

arXiv:2602.17547v2 Announce Type: replace Abstract: This paper introduces KLong, an open-source LLM agent trained to solve extremely long-horizon tasks. The pri

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality

arXiv:2603.05912v2 Announce Type: replace Abstract: Search-augmented LLM agents can produce deep research reports (DRRs), but verifying claim-level factuality r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

arXiv:2603.08388v4 Announce Type: replace Abstract: We propose a Hierarchical Error-Corrective Graph FrameworkforAutonomousAgentswithLLM-BasedActionGeneration(H

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Collective AI can amplify tiny perturbations into divergent decisions

arXiv:2603.09127v2 Announce Type: replace Abstract: Large language models are increasingly deployed not as single assistants but as committees whose members del

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

arXiv:2603.18633v2 Announce Type: replace Abstract: The rapid evolution of artificial intelligence, from task-specific systems to foundation models exhibiting b

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 2w ago

ClawSafety: "Safe" LLMs, Unsafe Agents

arXiv:2604.01438v2 Announce Type: replace Abstract: Personal AI agents like OpenClaw run with elevated privileges on users' local machines, where a single succe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Domain-constrained knowledge representation: A modal framework

arXiv:2604.01770v2 Announce Type: replace Abstract: Knowledge graphs store large numbers of relations efficiently, but they remain weak at representing a quiete

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

arXiv:2406.14194v3 Announce Type: replace-cross Abstract: The emergence of Large Vision-Language Models (LVLMs) marks significant strides towards achieving gene

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models

arXiv:2408.11871v3 Announce Type: replace-cross Abstract: Fake news significantly influences decision-making processes by misleading individuals, organizations,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

SPRIG: Improving Large Language Model Performance by System Prompt Optimization

arXiv:2410.14826v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown impressive capabilities in many scenarios, but their performan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

arXiv:2410.21169v5 Announce Type: replace-cross Abstract: Document parsing (DP) transforms unstructured or semi-structured documents into structured, machine-re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Implicit Bias-Like Patterns in Reasoning Models

arXiv:2503.11572v4 Announce Type: replace-cross Abstract: Implicit biases refer to automatic mental processes that shape perceptions, judgments, and behaviors.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

BalancedDPO: Adaptive Multi-Metric Alignment

arXiv:2503.12575v2 Announce Type: replace-cross Abstract: Diffusion models have achieved remarkable progress in text-to-image generation, yet aligning them with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLMs Judging LLMs: A Simplex Perspective

arXiv:2505.21972v3 Announce Type: replace-cross Abstract: Given the challenge of automatically evaluating free-form outputs from large language models (LLMs), a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Beyond Linear Steering: Unified Multi-Attribute Control for Language Models

arXiv:2505.24535v3 Announce Type: replace-cross Abstract: Controlling multiple behavioral attributes in large language models (LLMs) at inference time is a chal

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Large Language Models for Combinatorial Optimization of Design Structure Matrix

arXiv:2506.09749v3 Announce Type: replace-cross Abstract: In complex engineering systems, the dependencies among components or development activities are often

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

arXiv:2506.13130v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) often generate hallucinations, where the output deviates from

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Making Prompts First-Class Citizens for Adaptive LLM Pipelines

arXiv:2508.05012v2 Announce Type: replace-cross Abstract: Modern LLM pipelines increasingly resemble complex data-centric applications: they retrieve data, corr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference

arXiv:2508.16703v2 Announce Type: replace-cross Abstract: On-device running Large Language Models (LLMs) is nowadays a critical enabler towards preserving user

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Measuring Competency, Not Performance: Item-Aware Evaluation Across Medical Benchmarks

arXiv:2509.24186v2 Announce Type: replace-cross Abstract: Accuracy-based evaluation of Large Language Models (LLMs) measures benchmark-specific performance rath

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ACT: Agentic Classification Tree

arXiv:2509.26433v4 Announce Type: replace-cross Abstract: When used in high-stakes settings, AI systems are expected to produce decisions that are transparent,