Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

50,993

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 21,464 Reads 29,529

All Reads (29,529) Articles (12561)Blog Posts (5580)Tutorials (2294)Research Papers (8224)News (870)

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning

arXiv:2606.31742v1 Announce Type: cross Abstract: Explainable AI (XAI) methods have demonstrated significant success in recent years at identifying relevant fea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

CHERRY: Compressed Hierarchical Experts with Recurrent Representational Yield

arXiv:2606.31796v1 Announce Type: cross Abstract: We study three complementary techniques for training compute-efficient language models. (1) Selective supervis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Geometry-Preserving Orthonormal Initialization for Low-Rank Adaptation in RLVR

arXiv:2606.31813v1 Announce Type: cross Abstract: Low-rank adaptation (LoRA) and its variants enable parameter-efficient fine-tuning of large language models un

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

arXiv:2606.31825v1 Announce Type: cross Abstract: Recent multimodal large language models have shown great promise in clinical image reasoning, but existing pos

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Real-Time Source-Free Object Detection

arXiv:2606.31834v1 Announce Type: cross Abstract: Real-world detectors for autonomous driving, surveillance, and robotics must handle domain-shifts under strict

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Z-1: Efficient Reinforcement Learning for Vision-Language-Action Models

arXiv:2606.31846v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models offer a promising framework for robotic manipulation by connecting languag

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Attend, Transform, or Silence: Operator-Level Visual Skipping for Efficient Multimodal LLM Inference

arXiv:2606.31903v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) increasingly process long visual-token sequences, increasing the over

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

GR2 Technical Report

arXiv:2606.31984v1 Announce Type: cross Abstract: Industrial recommendation systems serve billions of users through a multi-stage funnel -- retrieval, early-sta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Amplifying Membership Signal Through Chained Regeneration

arXiv:2606.31991v1 Announce Type: cross Abstract: The tendency of large generative models to memorize training data makes sample verification critical for priva

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

AdaJEPA: An Adaptive Latent World Model

arXiv:2606.32026v1 Announce Type: cross Abstract: Latent world models enable planning from high-dimensional observations by predicting future states in a compac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

arXiv:2606.32029v1 Announce Type: cross Abstract: While large language models (LLMs) perform well on table tasks, they still make data referencing errors (DREs)

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

arXiv:2606.32032v1 Announce Type: cross Abstract: Metacognition is a critical component of intelligence that describes the ability to monitor and regulate one's

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

QVal: Cheaply Evaluating Dense Supervision Signals for Long-Horizon LLM Agents

arXiv:2606.32034v1 Announce Type: cross Abstract: LLM agents increasingly act over long horizons, where a single trajectory can contain hundreds or thousands of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Introspective Coupling: Self-Explanation Training Tracks Behavioral Change Despite Fixed Supervision

arXiv:2606.32038v1 Announce Type: cross Abstract: When does training language models (LMs) to generate explanations of their predictions yield faithful introspe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Disentangling Reasoning Logic to Resolve Explicit Knowledge Conflicts

arXiv:2508.01273v3 Announce Type: replace Abstract: Explicit knowledge conflicts, occurring when retrieved contexts contain contradictory information, pose a fu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Deductive Logic in Language Models: Horizontal vs Vertical Reasoning

arXiv:2510.09340v2 Announce Type: replace Abstract: Recent language models exhibit significant logical reasoning abilities, yet the mechanisms supporting deduct

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

LLM-Empowered Agentic MAC Protocols: A Dynamic Stackelberg Game Approach

arXiv:2510.10895v2 Announce Type: replace Abstract: Medium Access Control (MAC) protocols, essential for wireless networks, are typically manually configured. W

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Improving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression

arXiv:2601.08187v3 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated promising capabilities in Text-Attributed Graph (TAG) underst

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Learning by Surprise: Adaptive Mitigation of Model Collapse in Large Language Models

arXiv:2410.12341v4 Announce Type: replace-cross Abstract: As AI-generated content increasingly populates the web, generative AI models are at growing risk of be

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection

arXiv:2502.15845v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) often hallucinate, limiting their reliability in sensitive applications.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

SAGE: A Search-AuGmented Evaluation of Large Language Models on Free-Form QA

arXiv:2504.07385v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) become increasingly used for question-answering (QA), relying on stati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

TraCeS: Learning Per-Timestep Constraint-Violation Credit from Sparse Trajectory-Level Labels

arXiv:2504.12557v3 Announce Type: replace-cross Abstract: Ensuring safe behavior in reinforcement learning (RL) is challenging when safety constraints are impli

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

A Reproducible Benchmark of Lightweight CNNs: Accuracy, Efficiency, and the Impact of Pretrained Initialization

arXiv:2505.03303v3 Announce Type: replace-cross Abstract: Lightweight convolutional neural networks are often compared using results obtained with different tra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Dataset Construction for Training LLM to Learn Analog Circuit Knowledge

arXiv:2508.10409v3 Announce Type: replace-cross Abstract: This paper constructs a textual dataset for training large language models (LLMs) to learn analog circ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Optimal Self-Consistency for Efficient Reasoning with Large Language Models

arXiv:2511.12309v2 Announce Type: replace-cross Abstract: Self-consistency (SC) is a widely used test-time inference technique for improving performance in chai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Revisiting Audio-language Pretraining for Learning General-purpose Audio Representation

arXiv:2511.16757v2 Announce Type: replace-cross Abstract: Audio-language pretraining (ALP) holds promise for learning general-purpose audio representation, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

arXiv:2512.21002v3 Announce Type: replace-cross Abstract: Distilling the capabilities from a large reasoning model (LRM) to a smaller student model often involv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

arXiv:2601.04126v3 Announce Type: replace-cross Abstract: GUI agents that interact with graphical interfaces on behalf of users represent a promising direction

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching

arXiv:2601.23088v2 Announce Type: replace-cross Abstract: Semantic caching has emerged as a pivotal technique for scaling LLM applications, widely adopted by ma

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

DeXposure-FM: A Time-series, Graph Foundation Model for Credit Exposures and Stability on Decentralized Financial Networks

arXiv:2602.03981v2 Announce Type: replace-cross Abstract: Credit exposure in Decentralized Finance (DeFi) is often implicit and token-mediated, creating a dense

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models

arXiv:2603.12893v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) has become a standard technique for post-training diffusion-based image sy

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago

Visual Prompt Discovery via Semantic Exploration

arXiv:2603.16250v2 Announce Type: replace-cross Abstract: LVLMs encounter significant challenges in image understanding and visual reasoning, leading to critica

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 [03:59:05]

Dev.to · anon1 anon1 🧠 Large Language Models ⚡ AI Lesson 1d ago

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 [03:59:05]

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 ...

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1d ago

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

How I designed a multi-agent system that frames machine learning problems, engineers features, trains and evaluates models, performs… Continue reading on Medium

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 1d ago

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

How I designed a multi-agent system that frames machine learning problems, engineers features, trains and evaluates models, performs… Continue reading on Medium

I Tried ChatGPT Alternatives — Here’s the Truth

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 1d ago

I Tried ChatGPT Alternatives — Here’s the Truth

Not the polished review kind. The confused-at-2AM, slightly disappointed, but honestly curious kind. Continue reading on Medium »

Why ChatGPT Makes Smart People Sound Surprisingly Average

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 1d ago

Why ChatGPT Makes Smart People Sound Surprisingly Average

The biggest risk of AI isn’t that it writes badly. It’s that it makes average thinking sound complete. Continue reading on Medium »

Streaming vs Batching LLM Responses: A Cost and Latency Analysis

Dev.to · kapil Maheshwari 🧠 Large Language Models ⚡ AI Lesson 1d ago

Streaming vs Batching LLM Responses: A Cost and Latency Analysis

Explore the trade-offs between streaming and batching LLM responses to optimize costs and latency for your startup.

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1d ago

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Part 1 of the “Complete Guide to Retrieval-Augmented Generation (RAG)” series Continue reading on Artificial Intelligence in Plain English »

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Medium · NLP 🧠 Large Language Models ⚡ AI Lesson 1d ago

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Part 1 of the “Complete Guide to Retrieval-Augmented Generation (RAG)” series Continue reading on Artificial Intelligence in Plain English »

How We Translate 300-Page Books Using Claude Without Hitting Token Limits

Dev.to · 龚旭东 🧠 Large Language Models ⚡ AI Lesson 1d ago

How We Translate 300-Page Books Using Claude Without Hitting Token Limits

Breaking long documents into overlapping chunks, preserving context, and reassembling with...

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

Medium · AI 🧠 Large Language Models ⚡ AI Lesson 1d ago

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

This is the hands-on companion to Part 1: Your LLM Isn’t Dumb — It Just Lacks Your Context. There, we covered the idea: LLMs fail on your… Continue reading on M

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 1d ago

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

This is the hands-on companion to Part 1: Your LLM Isn’t Dumb — It Just Lacks Your Context. There, we covered the idea: LLMs fail on your… Continue reading on T

A simple way to test model fallbacks with RouterBase

Dev.to · routerbasecom 🧠 Large Language Models ⚡ AI Lesson 1d ago

A simple way to test model fallbacks with RouterBase

Fallback logic is easier to reason about when the application has one request shape and the model...

Why I Stopped Asking AI “What Should I Do?”

Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 1d ago

Why I Stopped Asking AI “What Should I Do?”

A subtle prompting mistake that was holding me back Continue reading on Medium »

Learning at the Learning Conference: A Brief from ICLR 2026

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 1d ago

Learning at the Learning Conference: A Brief from ICLR 2026

Highlights from the TELUS Digital Research Hub for teams building with — and around — LLMs and agents. Continue reading on TELUS Digital Research Hub Briefs »

AI Update — July 1, 2026: 5 Things That Just Dropped

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 1d ago

AI Update — July 1, 2026: 5 Things That Just Dropped

Astra glasses ship, Codex agents code for you, Qwen 3 Max goes open, FSD goes unsupervised, and AI voices just got legal. Continue reading on Adi Insights & Inn

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1d ago

Trump drops restrictions on Anthropic’s Mythos and Fable models

Anthropic said it would begin restoring access to the Fable on July 1.