Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,087 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems
arXiv:2604.04767v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has improved the reasoning abilities of LLMs, yet a fund
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SkillX: Automatically Constructing Skill Knowledge Bases for Agents
arXiv:2604.04804v1 Announce Type: cross Abstract: Learning from experience is critical for building capable large language model (LLM) agents, yet prevailing se
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection
arXiv:2604.04815v1 Announce Type: cross Abstract: The rapid development of Large Language Models (LLMs) has transformed fake news detection and fact-checking ta
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not
arXiv:2604.04825v1 Announce Type: cross Abstract: Large language models achieve strong performance on many language tasks, yet it remains unclear whether they i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
InfBaGel: Human-Object-Scene Interaction Generation with Dynamic Perception and Iterative Refinement
arXiv:2604.04843v1 Announce Type: cross Abstract: Human-object-scene interactions (HOSI) generation has broad applications in embodied AI, simulation, and anima
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework
arXiv:2604.04852v1 Announce Type: cross Abstract: Chain-of-Thought (CoT) prompting has been used to enhance the reasoning capability of LLMs. However, its relia
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms
arXiv:2604.04868v1 Announce Type: cross Abstract: Tabular foundation models (TFMs) such as TabPFN (Tabular Prior-Data Fitted Network) are designed to generalize
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation
arXiv:2604.04894v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Agentic Federated Learning: The Future of Distributed Training Orchestration
arXiv:2604.04895v1 Announce Type: cross Abstract: Although Federated Learning (FL) promises privacy and distributed collaboration, its effectiveness in real-wor
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Vero: An Open RL Recipe for General Visual Reasoning
arXiv:2604.04917v1 Announce Type: cross Abstract: What does it take to build a visual reasoner that works across charts, science, spatial understanding, and ope
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Your Pre-trained Diffusion Model Secretly Knows Restoration
arXiv:2604.04924v1 Announce Type: cross Abstract: Pre-trained diffusion models have enabled significant advancements in All-in-One Restoration (AiOR), offering
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Early Stopping for Large Reasoning Models via Confidence Dynamics
arXiv:2604.04930v1 Announce Type: cross Abstract: Large reasoning models rely on long chain-of-thought generation to solve complex problems, but extended reason
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning
arXiv:2302.00797v4 Announce Type: replace Abstract: Opponent modeling methods typically involve two crucial steps: building a belief distribution over opponents
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible
arXiv:2411.06498v2 Announce Type: replace Abstract: A recent paper (van Rooij et al. 2024) claims to have proved that achieving human-like intelligence using le
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
arXiv:2502.13388v3 Announce Type: replace Abstract: StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
arXiv:2506.17585v3 Announce Type: replace Abstract: Trustworthy language models should provide both correct and verifiable answers. However, citations generated
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Similarity Field Theory: A Mathematical Framework for Intelligence
arXiv:2509.18218v5 Announce Type: replace Abstract: We posit that transforming similarity relations form the structural basis of comprehensible dynamic systems.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
arXiv:2510.09901v2 Announce Type: replace Abstract: Computing has long served as a cornerstone of scientific discovery. Recently, a paradigm shift has emerged w
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval
arXiv:2511.14130v2 Announce Type: replace Abstract: With the rapid progress of large language models (LLMs), financial information retrieval has become a critic
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Drill-Down and Fabricate Test (DDFT): A Protocol for Measuring Epistemic Robustness in Language Models
arXiv:2512.23850v2 Announce Type: replace Abstract: Current language model evaluations measure what models know under ideal conditions but not how robustly they
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers
arXiv:2601.06338v2 Announce Type: replace Abstract: Diffusion Transformers (DiTs) have greatly advanced text-to-image generation, but models still struggle to g
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors
arXiv:2601.08950v2 Announce Type: replace Abstract: Despite their growing adoption in education, LLMs remain misaligned with the core principle of effective tut
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making
arXiv:2601.21439v2 Announce Type: replace Abstract: While Large Language Models (LLMs) are widely documented to be sensitive to minor prompt perturbations and p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization
arXiv:2601.22776v2 Announce Type: replace Abstract: Multi-turn tool-integrated reasoning enables Large Language Models (LLMs) to solve complex tasks through ite
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional Feature Restoration
arXiv:2602.03151v2 Announce Type: replace Abstract: Vision Language Model (VLM) typically assume complete modality input during inference. However, their effect
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery
arXiv:2602.07943v2 Announce Type: replace Abstract: In the presence of confounding between an endogenous variable and the outcome, instrumental variables (IVs)
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
arXiv:2602.13218v2 Announce Type: replace Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) is bottlenecked by data: existing synthesis pipelines
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
KLong: Training LLM Agent for Extremely Long-horizon Tasks
arXiv:2602.17547v2 Announce Type: replace Abstract: This paper introduces KLong, an open-source LLM agent trained to solve extremely long-horizon tasks. The pri
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality
arXiv:2603.05912v2 Announce Type: replace Abstract: Search-augmented LLM agents can produce deep research reports (DRRs), but verifying claim-level factuality r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
arXiv:2603.08388v4 Announce Type: replace Abstract: We propose a Hierarchical Error-Corrective Graph FrameworkforAutonomousAgentswithLLM-BasedActionGeneration(H
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Collective AI can amplify tiny perturbations into divergent decisions
arXiv:2603.09127v2 Announce Type: replace Abstract: Large language models are increasingly deployed not as single assistants but as committees whose members del
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
An Onto-Relational-Sophic Framework for Governing Synthetic Minds
arXiv:2603.18633v2 Announce Type: replace Abstract: The rapid evolution of artificial intelligence, from task-specific systems to foundation models exhibiting b
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
2w ago
ClawSafety: "Safe" LLMs, Unsafe Agents
arXiv:2604.01438v2 Announce Type: replace Abstract: Personal AI agents like OpenClaw run with elevated privileges on users' local machines, where a single succe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Domain-constrained knowledge representation: A modal framework
arXiv:2604.01770v2 Announce Type: replace Abstract: Knowledge graphs store large numbers of relations efficiently, but they remain weak at representing a quiete
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model
arXiv:2406.14194v3 Announce Type: replace-cross Abstract: The emergence of Large Vision-Language Models (LVLMs) marks significant strides towards achieving gene
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models
arXiv:2408.11871v3 Announce Type: replace-cross Abstract: Fake news significantly influences decision-making processes by misleading individuals, organizations,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SPRIG: Improving Large Language Model Performance by System Prompt Optimization
arXiv:2410.14826v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown impressive capabilities in many scenarios, but their performan
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
arXiv:2410.21169v5 Announce Type: replace-cross Abstract: Document parsing (DP) transforms unstructured or semi-structured documents into structured, machine-re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Implicit Bias-Like Patterns in Reasoning Models
arXiv:2503.11572v4 Announce Type: replace-cross Abstract: Implicit biases refer to automatic mental processes that shape perceptions, judgments, and behaviors.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
BalancedDPO: Adaptive Multi-Metric Alignment
arXiv:2503.12575v2 Announce Type: replace-cross Abstract: Diffusion models have achieved remarkable progress in text-to-image generation, yet aligning them with
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLMs Judging LLMs: A Simplex Perspective
arXiv:2505.21972v3 Announce Type: replace-cross Abstract: Given the challenge of automatically evaluating free-form outputs from large language models (LLMs), a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Beyond Linear Steering: Unified Multi-Attribute Control for Language Models
arXiv:2505.24535v3 Announce Type: replace-cross Abstract: Controlling multiple behavioral attributes in large language models (LLMs) at inference time is a chal
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Large Language Models for Combinatorial Optimization of Design Structure Matrix
arXiv:2506.09749v3 Announce Type: replace-cross Abstract: In complex engineering systems, the dependencies among components or development activities are often
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ZINA: Multimodal Fine-grained Hallucination Detection and Editing
arXiv:2506.13130v2 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) often generate hallucinations, where the output deviates from
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Making Prompts First-Class Citizens for Adaptive LLM Pipelines
arXiv:2508.05012v2 Announce Type: replace-cross Abstract: Modern LLM pipelines increasingly resemble complex data-centric applications: they retrieve data, corr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference
arXiv:2508.16703v2 Announce Type: replace-cross Abstract: On-device running Large Language Models (LLMs) is nowadays a critical enabler towards preserving user
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Measuring Competency, Not Performance: Item-Aware Evaluation Across Medical Benchmarks
arXiv:2509.24186v2 Announce Type: replace-cross Abstract: Accuracy-based evaluation of Large Language Models (LLMs) measures benchmark-specific performance rath
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ACT: Agentic Classification Tree
arXiv:2509.26433v4 Announce Type: replace-cross Abstract: When used in high-stakes settings, AI systems are expected to produce decisions that are transparent,
DeepCamp AI