Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,329

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 21,479 Reads 29,850

All Reads (29,850) Articles (12694)Blog Posts (5648)Tutorials (2402)Research Papers (8232)News (874)

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Estimating Grammatical Gender Directions in Contextual Embeddings under Controlled and Natural Contexts

arXiv:2606.30152v1 Announce Type: cross Abstract: Contextual language models conflate grammatical gender and social semantic bias in gendered languages such as

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Few-Shot Domain Incremental Learning via Continual Vision-Language Consolidation

arXiv:2606.30190v1 Announce Type: cross Abstract: Existing domain-incremental learning (DIL) strategies call for massive amounts of data to adapt to new domains

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Forewarned is Forearmed: When Non-Sequential Embedding Turns Into an Anomaly Detector

arXiv:2606.30196v1 Announce Type: cross Abstract: This paper offers an in-depth analysis of non-sequential multimodal sentence-level embeddings, with a particul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

KnowsTFM: Knowledge-Informed Fine-Tuning of Small Tabular Foundation Models

arXiv:2606.30258v1 Announce Type: cross Abstract: Tabular foundation models have advanced deep learning for tabular data by delivering strong default performanc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Research Entity Extraction and Topic Detection from UKRI Grant Proposals

arXiv:2606.30304v1 Announce Type: cross Abstract: This paper presents preliminary findings from a UKRI-funded Metascience project comparing three LLM-based appr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Always-OnAgents:A Survey of Persistent Memory, State, and Governance in LLMAgents

arXiv:2606.30306v1 Announce Type: cross Abstract: Always-on agents are systems whose future behavior depends on durable state accumulated across earlier interac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

MCP Server Architecture Patterns for LLM-Integrated Applications

arXiv:2606.30317v1 Announce Type: cross Abstract: The Model Context Protocol (MCP), introduced by Anthropic in November 2024, defines a standardized interface f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

DRIFT: Difficulty Routing Self-DIstillation with Rhythm-Gated Exploration and Success BuFfer Training

arXiv:2606.30345v1 Announce Type: cross Abstract: Enabling large language models to achieve stable self-improvement without external expert supervision remains

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

A Stochastic--Geometric Theory of Scaling Laws in Grokking

arXiv:2606.30388v1 Announce Type: cross Abstract: Delayed generalization (\ie~grokking) refers to the phenomenon in which a neural network fits its training dat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Can LLMs Rank? A Tale of Triads and Triage

arXiv:2606.30412v1 Announce Type: cross Abstract: From housing allocation for households experiencing homelessness to triage in emergency departments, LLMs are

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Beyond Point Estimates for Glaucoma Visual Field Forecasting with Diffusion Models

arXiv:2606.30417v1 Announce Type: cross Abstract: Forecasting visual fields (VFs) is critical for personalized monitoring and treatment planning in glaucoma. Th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Transformer Architectures as Complete Bayes Processes: A Formal Proof in the Measure-Theoretic Kernel Framework

arXiv:2606.30440v1 Announce Type: cross Abstract: We present a complete formal proof that transformer architectures, when their internal update mechanisms satis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Translating Natural Language to Strategic Temporal Specifications via LLMs

arXiv:2606.30441v1 Announce Type: cross Abstract: A rigorous formalization of system requirements is a fundamental prerequisite for the verification of Multi-Ag

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Collective cooperation without individual fidelity in LLM agents

arXiv:2606.30454v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used as agents in simulations of social systems, yet it remains

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Field Order Should Not Matter: Permutation-Invariant Embedding Model Fine-Tuning for Structured Metadata Retrieval

arXiv:2606.30473v1 Announce Type: cross Abstract: We study retrieval over catalogs of structured metadata, where each record is a small schema whose fields answ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

On the Faithfulness of Post-Hoc Concept Bottleneck Models

arXiv:2606.30498v1 Announce Type: cross Abstract: Human decision-making interprets the world through high-level concepts, such as recognizing a bird by its bell

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Informational Frustration in Neural Manifolds: Shannon Bottlenecks and the Limits of Learnability

arXiv:2606.30512v1 Announce Type: cross Abstract: Why overparameterised deep networks generalise so remarkably well remains one of the most stubborn open questi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

TRACE: Temporal Relationship-Aware Conversational Entrainment Detection in Dyadic Speech

arXiv:2606.30543v1 Announce Type: cross Abstract: With the proliferation of speech AI agents, understanding emotional entrainment in conversational interaction

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

TraceLab: Characterizing Coding Agent Workloads for LLM Serving

arXiv:2606.30560v2 Announce Type: cross Abstract: Coding agents are rapidly becoming a major application of agentic LLMs, but serving them efficiently remains c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Words Speak Louder Than Code: Investigating Cognitive Heuristics in LLM-Based Code Vulnerability Detection

arXiv:2606.30587v1 Announce Type: cross Abstract: Researchers and practitioners increasingly apply Large Language Models (LLMs) for automated vulnerability dete

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

C$^{2}$R: Cross-sample Consistency Regularization Mitigates Feature Splitting and Absorption in Sparse Autoencoders

arXiv:2606.30609v1 Announce Type: cross Abstract: Sparse Autoencoders (SAEs) are widely used to interpret large language models by decomposing activations into

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Optimization Dynamics Imprint Semantic Specificity in Contrastive Embedding Norms

arXiv:2606.30625v1 Announce Type: cross Abstract: Contrastive embedding models trained with scale-invariant losses are typically paired with distance metrics li

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Pessimism's Paradox: Conservative Offline Training Amplifies Reward Hacking During Online Adaptation in Reasoning Models

arXiv:2606.30627v1 Announce Type: cross Abstract: Conservative offline training is widely advocated as a safe foundation for subsequent online adaptation: if a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

LeVo 2: Stable and Melodious Song Generation via Hierarchical Representation Modeling and Progressive Post-Training

arXiv:2606.30642v1 Announce Type: cross Abstract: Full-length song generation must preserve coherence and musicality, render detailed vocal and accompaniment ac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

CLMASP: Coupling Large Language Models with Answer Set Programming for Robotic Task Planning

arXiv:2406.03367v2 Announce Type: replace Abstract: Large Language Models (LLMs) possess extensive foundational knowledge and moderate reasoning abilities, maki

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale

arXiv:2407.19633v4 Announce Type: replace Abstract: Optimization problems are pervasive in sectors from manufacturing and distribution to healthcare. However, m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

MARS: A neurosymbolic approach for interpretable drug discovery

arXiv:2410.05289v4 Announce Type: replace Abstract: Background: Neurosymbolic (NeSy) artificial intelligence describes the combination of logic or rule-based te

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley

arXiv:2507.07445v3 Announce Type: replace Abstract: Autonomous agents navigating human society must master both production activities and social interactions, y

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning

arXiv:2508.02178v3 Announce Type: replace Abstract: Large reasoning models (LRMs) often exhibit overthinking, producing verbose Chain-of-Thought (CoT) traces th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning

arXiv:2509.23292v4 Announce Type: replace Abstract: Tool-integrated reasoning (TIR) has become a key approach for improving large reasoning models (LRMs) on com

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Echoes of Human Malice in Agents: Benchmarking LLMs for Multi-Turn Online Harassment Attacks

arXiv:2510.14207v3 Announce Type: replace Abstract: Large Language Model (LLM) agents are powering a growing share of interactive web applications, yet remain v

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

CaveAgent: Transforming LLMs into Stateful Runtime Operators

arXiv:2601.01569v4 Announce Type: replace Abstract: LLM-based agents are increasingly capable of complex task execution, yet current agentic systems remain cons

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Knowing Bias, Doing Better: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement

arXiv:2601.21864v2 Announce Type: replace Abstract: Large language models (LLMs) exhibit social biases that reinforce harmful stereotypes, limiting their safe d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization

arXiv:2602.11351v2 Announce Type: replace Abstract: Proactive large language model (LLM) agents aim to actively plan, query, and interact over multiple turns, e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

StackingNet: Collective Inference Across Independent AI Foundation Models

arXiv:2602.13792v2 Announce Type: replace Abstract: Artificial intelligence built on large foundation models has transformed language understanding, computer vi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

ProSpec RL: Plan Ahead, then Execute

arXiv:2407.21359v2 Announce Type: replace-cross Abstract: Imagining potential outcomes of actions before execution helps agents make more informed decisions, a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation

arXiv:2412.15529v4 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) synergizes the retrieval of pertinent data with the generative ca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

CASE-Bench: Context-Aware SafEty Benchmark for Large Language Models

arXiv:2501.14940v4 Announce Type: replace-cross Abstract: Aligning large language models (LLMs) with human values is essential for their safe deployment and wid

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Ontology-Guided Reverse Thinking Makes Large Language Models Stronger on Knowledge Graph Question Answering

arXiv:2502.11491v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have shown remarkable capabilities in natural language processing. Howeve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Distributionally Robust Reinforcement Learning with Human Feedback

arXiv:2503.00539v2 Announce Type: replace-cross Abstract: Reinforcement learning from human feedback (RLHF) has evolved to be one of the main methods for fine-t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks

arXiv:2504.17421v2 Announce Type: replace-cross Abstract: Large language models (LMs) offer broad generalization capabilities but require vast amounts of data a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Scaling Textual Gradients via Sampling-Based Momentum

arXiv:2506.00400v4 Announce Type: replace-cross Abstract: LLM-based prompt optimization, which uses LLM-provided ``textual gradients'' (feedback) to refine prom

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Modeling Earth-Scale Human-Like Societies with One Billion Agents

arXiv:2506.12078v2 Announce Type: replace-cross Abstract: Understanding the dynamic evolution of complex social phenomena requires both high-fidelity modeling o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Code Reasoning for Software Engineering Tasks: A Survey and A Call to Action

arXiv:2506.13932v3 Announce Type: replace-cross Abstract: The rise of large language models (LLMs) has led to dramatic improvements across a wide range of natur

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

arXiv:2507.05257v4 Announce Type: replace-cross Abstract: Recent benchmarks for Large Language Model (LLM) agents primarily focus on evaluating reasoning, plann

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

LLM Serving Optimization with Variable Prefill and Decode Lengths

arXiv:2508.06133v4 Announce Type: replace-cross Abstract: We study offline scheduling for large language model (LLM) serving under a fixed KV-cache memory budge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

Post-training for Efficient Communication via Convention Formation

arXiv:2508.06482v2 Announce Type: replace-cross Abstract: Humans communicate with increasing efficiency in multi-turn interactions, by adapting their language a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago

TAR: Temporal Anchor-Constrained Reasoning for Video Temporal Grounding

arXiv:2508.07683v2 Announce Type: replace-cross Abstract: Video Temporal Grounding (VTG) aims to localize specific video segments corresponding to natural langu