Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

40,053

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 21,463 Reads 18,590

All Reads (18,590) Articles (9041)Blog Posts (3394)Tutorials (2104)Research Papers (3833)News (218)

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 16h ago

A Reproducible Benchmark of Lightweight CNNs: Accuracy, Efficiency, and the Impact of Pretrained Initialization

arXiv:2505.03303v3 Announce Type: replace-cross Abstract: Lightweight convolutional neural networks are often compared using results obtained with different tra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 16h ago

Dataset Construction for Training LLM to Learn Analog Circuit Knowledge

arXiv:2508.10409v3 Announce Type: replace-cross Abstract: This paper constructs a textual dataset for training large language models (LLMs) to learn analog circ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 16h ago

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

arXiv:2512.21002v3 Announce Type: replace-cross Abstract: Distilling the capabilities from a large reasoning model (LRM) to a smaller student model often involv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 16h ago

InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

arXiv:2601.04126v3 Announce Type: replace-cross Abstract: GUI agents that interact with graphical interfaces on behalf of users represent a promising direction

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 16h ago

From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching

arXiv:2601.23088v2 Announce Type: replace-cross Abstract: Semantic caching has emerged as a pivotal technique for scaling LLM applications, widely adopted by ma

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 16h ago

DeXposure-FM: A Time-series, Graph Foundation Model for Credit Exposures and Stability on Decentralized Financial Networks

arXiv:2602.03981v2 Announce Type: replace-cross Abstract: Credit exposure in Decentralized Finance (DeFi) is often implicit and token-mediated, creating a dense

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 16h ago

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

How I designed a multi-agent system that frames machine learning problems, engineers features, trains and evaluates models, performs… Continue reading on Medium

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 16h ago

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

How I designed a multi-agent system that frames machine learning problems, engineers features, trains and evaluates models, performs… Continue reading on Medium

I Tried ChatGPT Alternatives — Here’s the Truth

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 16h ago

I Tried ChatGPT Alternatives — Here’s the Truth

Not the polished review kind. The confused-at-2AM, slightly disappointed, but honestly curious kind. Continue reading on Medium »

Why ChatGPT Makes Smart People Sound Surprisingly Average

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 17h ago

Why ChatGPT Makes Smart People Sound Surprisingly Average

The biggest risk of AI isn’t that it writes badly. It’s that it makes average thinking sound complete. Continue reading on Medium »

Streaming vs Batching LLM Responses: A Cost and Latency Analysis

Dev.to · kapil Maheshwari 🧠 Large Language Models ⚡ AI Lesson 17h ago

Streaming vs Batching LLM Responses: A Cost and Latency Analysis

Explore the trade-offs between streaming and batching LLM responses to optimize costs and latency for your startup.

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 17h ago

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Part 1 of the “Complete Guide to Retrieval-Augmented Generation (RAG)” series Continue reading on Artificial Intelligence in Plain English »

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Medium · NLP 🧠 Large Language Models ⚡ AI Lesson 17h ago

What Is RAG? The Story Behind Retrieval-Augmented Generation and Why It Changed AI Forever

Part 1 of the “Complete Guide to Retrieval-Augmented Generation (RAG)” series Continue reading on Artificial Intelligence in Plain English »

How We Translate 300-Page Books Using Claude Without Hitting Token Limits

Dev.to · 龚旭东 🧠 Large Language Models ⚡ AI Lesson 17h ago

How We Translate 300-Page Books Using Claude Without Hitting Token Limits

Breaking long documents into overlapping chunks, preserving context, and reassembling with...

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

Medium · AI 🧠 Large Language Models ⚡ AI Lesson 17h ago

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

This is the hands-on companion to Part 1: Your LLM Isn’t Dumb — It Just Lacks Your Context. There, we covered the idea: LLMs fail on your… Continue reading on M

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 17h ago

Building HITL Feedback RAG: Embeddings, Retrieval, and Reranking

This is the hands-on companion to Part 1: Your LLM Isn’t Dumb — It Just Lacks Your Context. There, we covered the idea: LLMs fail on your… Continue reading on T

A simple way to test model fallbacks with RouterBase

Dev.to · routerbasecom 🧠 Large Language Models ⚡ AI Lesson 17h ago

A simple way to test model fallbacks with RouterBase

Fallback logic is easier to reason about when the application has one request shape and the model...

Why I Stopped Asking AI “What Should I Do?”

Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 18h ago

Why I Stopped Asking AI “What Should I Do?”

A subtle prompting mistake that was holding me back Continue reading on Medium »

Learning at the Learning Conference: A Brief from ICLR 2026

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 18h ago

Learning at the Learning Conference: A Brief from ICLR 2026

Highlights from the TELUS Digital Research Hub for teams building with — and around — LLMs and agents. Continue reading on TELUS Digital Research Hub Briefs »

AI Update — July 1, 2026: 5 Things That Just Dropped

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 18h ago

AI Update — July 1, 2026: 5 Things That Just Dropped

Astra glasses ship, Codex agents code for you, Qwen 3 Max goes open, FSD goes unsupervised, and AI voices just got legal. Continue reading on Adi Insights & Inn

TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 18h ago

Trump drops restrictions on Anthropic’s Mythos and Fable models

Anthropic said it would begin restoring access to the Fable on July 1.

plz no

Reddit r/ChatGPT 🧠 Large Language Models ⚡ AI Lesson 18h ago

prompt: An authentic, completely ordinary iPhone photo taken by an employee at work. Somewhere in the scene is a professionally designed warning sign telling pe

The Journey of a Prompt Inside ChatGPT

Medium · JavaScript 🧠 Large Language Models ⚡ AI Lesson 18h ago

The Journey of a Prompt Inside ChatGPT

Every day, millions of people ask ChatGPT millions of questions. Whether it’s writing code, debugging applications, learning a new concept… Continue reading on

The Journey of a Prompt Inside ChatGPT

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 18h ago

The Journey of a Prompt Inside ChatGPT

Every day, millions of people ask ChatGPT millions of questions. Whether it’s writing code, debugging applications, learning a new concept… Continue reading on

Prompt Engineering: Getting the Words Right, and the Hole Underneath

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 18h ago

Prompt Engineering: Getting the Words Right, and the Hole Underneath

Continuing the series where I share what I’m learning about AI engineering each week — including the bits that genuinely surprised me. Continue reading on Mediu

Reddit r/LocalLLaMA 🧠 Large Language Models ⚡ AI Lesson 18h ago

Biggest, baddest model to fill 144GB VRAM + 120GB RAM to the brim, regardless of speed

I'm trying to round out my quiver of daily driver models for my personal harness. Right now I drive qwen3.6 27b for balanced code and gemma4 31b for human inter

Model Context Protocol (MCP) Explained: The Complete Guide Every AI Engineer Should Read

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 18h ago

Model Context Protocol (MCP) Explained: The Complete Guide Every AI Engineer Should Read

From the limitations of early LLMs to the rise of AI agents — understand why the Model Context Protocol (MCP) is becoming the standard for… Continue reading on

Model Context Protocol (MCP) Explained: The Complete Guide Every AI Engineer Should Read

Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 18h ago

Model Context Protocol (MCP) Explained: The Complete Guide Every AI Engineer Should Read

From the limitations of early LLMs to the rise of AI agents — understand why the Model Context Protocol (MCP) is becoming the standard for… Continue reading on

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 18h ago

The 2026 AI Model Release Race: Every Major LLM Launch You Need to Know

Key Takeaways Claude Sonnet 5 landed June 30, scoring 63.2% on SWE-bench Pro at $2/$10 per million tokens — close to Opus 4.8 at 40% of its standard price. It's

Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 19h ago

Call GPT, Claude, and Gemini from one API key — a 3-step setup

If you want to try GPT, Claude, and Gemini without signing up for three separate platforms and juggling three billing dashboards, here's a 3-step setup using an

Open-Source LLM APIs Beat Self-Hosting. Here's the Math.

Dev.to · RileyKim 🧠 Large Language Models ⚡ AI Lesson 19h ago

Open-Source LLM APIs Beat Self-Hosting. Here's the Math.

So here's what happened: open-Source LLM APIs Beat Self-Hosting. Here's the Math. Last quarter I sat...

Explaining attention mechanisms without math

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 19h ago

Explaining attention mechanisms without math

Modern Language models like Claude, Google Translate, and other AI assistants can understand and generate responses to questions with… Continue reading on Mediu

The Day I Stopped Treating ChatGPT Like a Search Engine and Started Using It Like a Teammate

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 19h ago

The Day I Stopped Treating ChatGPT Like a Search Engine and Started Using It Like a Teammate

Stop using ChatGPT like a search engine—start using it like your smartest teammate. Continue reading on Medium »

Reddit r/LocalLLaMA 🧠 Large Language Models ⚡ AI Lesson 19h ago

[audio.cpp] VibeVoice 1.5B released — 90-min podcast in 22.95 min, 4.08x real-time, 2.86x faster than Python without quantization. Native C++/ggml

I’m the author of audio.cpp, a C++/ggml runtime for local audio models. I just added VibeVoice 1.5B support and wanted to share the benchmark because long-form

Your LLM Doesn’t Pick Stocks — It Remembers Them

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 19h ago

Your LLM Doesn’t Pick Stocks — It Remembers Them

The dirty secret of AI stock picking lives inside the model’s weights. Full write-up, code, and benchmarks on jiripik.com. Continue reading on Medium »

Word Representation

Medium · NLP 🧠 Large Language Models ⚡ AI Lesson 19h ago

Word Representation

This article is a reader companion to the Word Representation chapter of the Oxford Handbook of Computational Linguistics. It can be read… Continue reading on M

When Cosine Similarity Approaching Singularity in Google Search AI Mode

Medium · AI 🧠 Large Language Models ⚡ AI Lesson 20h ago

When Cosine Similarity Approaching Singularity in Google Search AI Mode

##Ontological Infiltration, Vector Collapse, and the Strategic Dilemma of Unified Knowledge Graphs Continue reading on Medium »

When Cosine Similarity Approaching Singularity in Google Search AI Mode

Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 20h ago

When Cosine Similarity Approaching Singularity in Google Search AI Mode

##Ontological Infiltration, Vector Collapse, and the Strategic Dilemma of Unified Knowledge Graphs Continue reading on Medium »

When Cosine Similarity Approaching Singularity in Google Search AI Mode

Medium · Deep Learning 🧠 Large Language Models ⚡ AI Lesson 20h ago

When Cosine Similarity Approaching Singularity in Google Search AI Mode

##Ontological Infiltration, Vector Collapse, and the Strategic Dilemma of Unified Knowledge Graphs Continue reading on Medium »

When Cosine Similarity Approaching Singularity in Google Search AI Mode

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 20h ago

When Cosine Similarity Approaching Singularity in Google Search AI Mode

##Ontological Infiltration, Vector Collapse, and the Strategic Dilemma of Unified Knowledge Graphs Continue reading on Medium »

Building a Production RAG Pipeline with Hybrid Retrieval and LangChain

Dev.to · Hector Hernandez Cruz 🧠 Large Language Models ⚡ AI Lesson 20h ago

Building a Production RAG Pipeline with Hybrid Retrieval and LangChain

Most RAG tutorials get you 70% of the way there. This is about the other 30% that actually matters in...

Mamba3 in Three Animations

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 20h ago

Mamba3 in Three Animations

The three changes that turn Mamba-2 into Mamba-3 — each one watched, not just described. Two of the three animations are driven by real… Continue reading on Tow

Anthropic’s War on Open-Source AI, or Is It Just Afraid?

Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 20h ago

Anthropic’s War on Open-Source AI, or Is It Just Afraid?

It trained on the world, then paid $1.5 billion to settle the pirated books. Its terms say you cannot use Claude to build a competitor… Continue reading on Towa

Anthropic’s War on Open-Source AI, or Is It Just Afraid?

Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 20h ago

Anthropic’s War on Open-Source AI, or Is It Just Afraid?

It trained on the world, then paid $1.5 billion to settle the pirated books. Its terms say you cannot use Claude to build a competitor… Continue reading on Towa

Simon Willison's Blog 🧠 Large Language Models ⚡ AI Lesson 20h ago

Quoting Anthropic

We’ve received notice that the Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5. We'll begin restoring access tomorrow, and will

Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 20h ago

Work across research, engineering, data, evals, and product to make models better at acting in real…

The Research Pillar: Reinforcement Learning, Reasoning, and Environments Continue reading on Medium »

You Use ChatGPT Every Day… But Do You Actually Know How It Works?

Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 20h ago

You Use ChatGPT Every Day… But Do You Actually Know How It Works?

From tokens and embeddings to transformers, temperature, and inference, here's the complete picture in simple words with diagrams and code. Continue reading on

[AI] Practical QLoRA Fine-tuning: Axolotl & Unsloth | SLM Playbook

Dev.to · Tuấn Anh 🧠 Large Language Models ⚡ AI Lesson 21h ago

[AI] Practical QLoRA Fine-tuning: Axolotl & Unsloth | SLM Playbook

← Series hub ← Previous | Next → Full-parameter fine-tuning of a large language model is a luxury....