Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,021
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
All Reads (29,557) Articles (12569)Blog Posts (5585)Tutorials (2309)Research Papers (8224)News (870)
Mamba3 in Three Animations
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 1d ago
Mamba3 in Three Animations
The three changes that turn Mamba-2 into Mamba-3 — each one watched, not just described. Two of the three animations are driven by real… Continue reading on Tow
Anthropic’s War on Open-Source AI, or Is It Just Afraid?
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1d ago
Anthropic’s War on Open-Source AI, or Is It Just Afraid?
It trained on the world, then paid $1.5 billion to settle the pirated books. Its terms say you cannot use Claude to build a competitor… Continue reading on Towa
Anthropic’s War on Open-Source AI, or Is It Just Afraid?
Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 1d ago
Anthropic’s War on Open-Source AI, or Is It Just Afraid?
It trained on the world, then paid $1.5 billion to settle the pirated books. Its terms say you cannot use Claude to build a competitor… Continue reading on Towa
Simon Willison's Blog 🧠 Large Language Models ⚡ AI Lesson 1d ago
Quoting Anthropic
We’ve received notice that the Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5. We'll begin restoring access tomorrow, and will
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 1d ago
Work across research, engineering, data, evals, and product to make models better at acting in real…
The Research Pillar: Reinforcement Learning, Reasoning, and Environments Continue reading on Medium »
You Use ChatGPT Every Day… But Do You Actually Know How It Works?
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 1d ago
You Use ChatGPT Every Day… But Do You Actually Know How It Works?
From tokens and embeddings to transformers, temperature, and inference, here's the complete picture in simple words with diagrams and code. Continue reading on
[AI] Practical QLoRA Fine-tuning: Axolotl & Unsloth | SLM Playbook
Dev.to · Tuấn Anh 🧠 Large Language Models ⚡ AI Lesson 1d ago
[AI] Practical QLoRA Fine-tuning: Axolotl & Unsloth | SLM Playbook
← Series hub ← Previous | Next → Full-parameter fine-tuning of a large language model is a luxury....
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 1d ago
The Best Prompt Is the One You Never Have to Rewrite
For a long time, I thought getting better results from AI meant getting better at writing prompts, so I did what most people do. I’d write… Continue reading on
Encoder VS Decoder Bert VS GPT
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1d ago
Encoder VS Decoder Bert VS GPT
They share the same architecture. The only real difference is which tokens each one is allowed to look at. Continue reading on Towards AI »
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 1d ago
10 ChatGPT Prompts That Will Save You Hours Every Day
Artificial intelligence is only as useful as the instructions you give it. Continue reading on Medium »
Embeddings Simplified
Medium · RAG 🧠 Large Language Models ⚡ AI Lesson 2d ago
Embeddings Simplified
1 — What is embeddings? Continue reading on Medium »
I built a tool that cuts Claude/ChatGPT token usage by 97% — here's how it works
Dev.to · Rohith Matam 🧠 Large Language Models ⚡ AI Lesson 2d ago
I built a tool that cuts Claude/ChatGPT token usage by 97% — here's how it works
The Problem You're debugging a bug. You open Claude. You paste 10 files. You hit the context...
Serverless AI in a Browser Tab: Java WebAssembly + Local WebGPU LLMs
Dev.to · vishalmysore 🧠 Large Language Models ⚡ AI Lesson 2d ago
Serverless AI in a Browser Tab: Java WebAssembly + Local WebGPU LLMs
A deep technical whitepaper on building a zero-infrastructure RAG architecture where the...
BLEU: The Metric That Taught Machines to Translate
Medium · Deep Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
BLEU: The Metric That Taught Machines to Translate
Before deep learning, before transformers, before ChatGPT — there was BLEU. Here’s why it still matters, how it works, and where it falls… Continue reading on M
BLEU: The Metric That Taught Machines to Translate
Medium · NLP 🧠 Large Language Models ⚡ AI Lesson 2d ago
BLEU: The Metric That Taught Machines to Translate
Before deep learning, before transformers, before ChatGPT — there was BLEU. Here’s why it still matters, how it works, and where it falls… Continue reading on M
Evaluating Foundation Models: A Deep Dive into ROUGE
Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 2d ago
Evaluating Foundation Models: A Deep Dive into ROUGE
How do you measure whether an AI actually understands language? One classic answer is ROUGE and it’s more nuanced than it looks. Continue reading on Medium »
Evaluating Foundation Models: A Deep Dive into ROUGE
Medium · NLP 🧠 Large Language Models ⚡ AI Lesson 2d ago
Evaluating Foundation Models: A Deep Dive into ROUGE
How do you measure whether an AI actually understands language? One classic answer is ROUGE and it’s more nuanced than it looks. Continue reading on Medium »
Simon Willison's Blog 🧠 Large Language Models ⚡ AI Lesson 2d ago
What's new in Claude Sonnet 5
What's new in Claude Sonnet 5 Claude Sonnet 5 came out this morning . I always head straight for the "what's new" developer docs because they tend to have more
Semantic Deduplication with OpenAI Embeddings and pgvector
Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 2d ago
Semantic Deduplication with OpenAI Embeddings and pgvector
Applications that process large amounts of text often run into duplicate content. While exact duplicates are easy to detect using hashes… Continue reading on Me
Semantic Deduplication with OpenAI Embeddings and pgvector
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 2d ago
Semantic Deduplication with OpenAI Embeddings and pgvector
Applications that process large amounts of text often run into duplicate content. While exact duplicates are easy to detect using hashes… Continue reading on Me
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Why language models might be doing something close to dreaming — and what that means for how we train, evaluate, and one day secure them Continue reading on Med
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Medium · Cybersecurity 🧠 Large Language Models ⚡ AI Lesson 2d ago
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Why language models might be doing something close to dreaming — and what that means for how we train, evaluate, and one day secure them Continue reading on Med
Don't Let Your LLM Wing It: Building a Knowledge Base That Actually Knows Things
Dev.to · hugolesta 🧠 Large Language Models ⚡ AI Lesson 2d ago
Don't Let Your LLM Wing It: Building a Knowledge Base That Actually Knows Things
A field guide to provisioning a Bedrock Knowledge Base on Aurora pgvector with Terraform, then keeping it in sync with a GitHub Actions workflow that ships docs
An LLM Doesn’t Know Your Data. RAG Gives It the Right Page
Medium · RAG 🧠 Large Language Models ⚡ AI Lesson 2d ago
An LLM Doesn’t Know Your Data. RAG Gives It the Right Page
An LLM only knows what it was trained on. That’s a snapshot of the past, and it stops there. It doesn’t know today’s news, and more… Continue reading on Medium
Building LSTMs with PyTorch and Lightning AI Part 7: Resuming Training with Checkpoints
Dev.to · Rijul Rajesh 🧠 Large Language Models ⚡ AI Lesson 2d ago
Building LSTMs with PyTorch and Lightning AI Part 7: Resuming Training with Checkpoints
In the previous article, we used TensorBoard to analyze the training process. Based on the graphs, we...
How AI Learns with Less Labeled Data
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 2d ago
How AI Learns with Less Labeled Data
Most people think machine learning is mainly about choosing the best model. Continue reading on Medium »
Comparing Sarvam-30B and Qwen2.5–14B on Spider Text-to-SQL: An Active-Parameter Perspective
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 2d ago
Comparing Sarvam-30B and Qwen2.5–14B on Spider Text-to-SQL: An Active-Parameter Perspective
A controlled comparison on the Spider benchmark, scored by execution accuracy. Active-parameter count tells you more than the number in a… Continue reading on M
Claude Sonnet 5 closes the gap to Opus without the Opus bill
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 2d ago
Claude Sonnet 5 closes the gap to Opus without the Opus bill
Claude Sonnet 5: The New Default Worker Tier Continue reading on Medium »
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Four teams, four products, zero coordination — and the same four ingredients show up in every one. Continue reading on Towards AI »
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Medium · Deep Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Four teams, four products, zero coordination — and the same four ingredients show up in every one. Continue reading on Towards AI »
TechRepublic 🧠 Large Language Models ⚡ AI Lesson 2d ago
Claude Sonnet 5: Everything to Know About Anthropic’s New Model
Claude Sonnet 5 brings stronger agentic AI features, lower pricing, and updated safety protections. Here's what IT leaders need to know. The post Claude Sonnet
Does QLoRA’s NF4 Advantage Still Hold on LLaMA-3? I Put NF4 and FP4 Head to Head.
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
Does QLoRA’s NF4 Advantage Still Hold on LLaMA-3? I Put NF4 and FP4 Head to Head.
The QLoRA paper demonstrated a 1–2 point accuracy advantage for the NF4 data type over FP4 on LLaMA-1 and LLaMA-2. Continue reading on Medium »
Debugging Benchmark: DeepSeek V4 Pro vs MiMo V2.5 Pro
Dev.to · Stanislav 🧠 Large Language Models ⚡ AI Lesson 2d ago
Debugging Benchmark: DeepSeek V4 Pro vs MiMo V2.5 Pro
A real-world comparison of two LLMs on a genuine race condition bug from GitHub ...
Reddit r/LocalLLaMA 🧠 Large Language Models ⚡ AI Lesson 2d ago
Devs - you have 64gb of VRAM - which model do you use for coding?
I've currently settled on an unsloth version of Qwen 3.5 122b-a10b model (UD-IQ4_NL). With 100k bf16 context window, I only had to load a few layers into CPU/RA
How ChatGPT Understands Your Questions?
Medium · JavaScript 🧠 Large Language Models ⚡ AI Lesson 2d ago
How ChatGPT Understands Your Questions?
If I ask you something you might have never heard before, can you answer it? Obviously, you would say, “Sorry, I can’t answer this… Continue reading on Medium »
How ChatGPT Understands Your Questions?
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 2d ago
How ChatGPT Understands Your Questions?
If I ask you something you might have never heard before, can you answer it? Obviously, you would say, “Sorry, I can’t answer this… Continue reading on Medium »
How I'm re-discovering computer science with LLM revolution
Dev.to · popiol 🧠 Large Language Models ⚡ AI Lesson 2d ago
How I'm re-discovering computer science with LLM revolution
I've been in this business for years and the best (or worst?) thing is that once I gain enough...
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 2d ago
How LLM Works Under the hood?
Have you ever asked ChatGPT to write code, solve a math problem, or explain a difficult concept and wondered, “How does it know all of… Continue reading on Medi
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 2d ago
How LLM Works Under the hood?
Have you ever asked ChatGPT to write code, solve a math problem, or explain a difficult concept and wondered, “How does it know all of… Continue reading on Medi
Knowledge Leakage Is the New Data Leakage in LLMs.
Medium · Python 🧠 Large Language Models ⚡ AI Lesson 2d ago
Knowledge Leakage Is the New Data Leakage in LLMs.
Your LLM may know more than your data. Learn how knowledge leakage can silently influence your results. Continue reading on Data Science Collective »
OCR vs. Vision LLMs: Choosing the Right Tool for Intelligent Document Processing
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 2d ago
OCR vs. Vision LLMs: Choosing the Right Tool for Intelligent Document Processing
Vision-capable LLMs have dramatically simplified intelligent document processing by replacing rigid template-based OCR workflows with semantic document understa
Running Chinese LLMs at Scale: DeepSeek, Qwen, Kimi & GLM
Dev.to · fiercedash 🧠 Large Language Models ⚡ AI Lesson 2d ago
Running Chinese LLMs at Scale: DeepSeek, Qwen, Kimi & GLM
Running Chinese LLMs at Scale: DeepSeek, Qwen, Kimi & GLM I've been running production workloads...
Search Engine Journal 🧠 Large Language Models ⚡ AI Lesson 2d ago
ChatGPT Users Are Now Mostly Non-English via @sejournal, @MattGSouthern
OpenAI data show that active ChatGPT users on consumer plans now mostly use non-English languages, with growth fastest in Africa and Asia. The post ChatGPT User
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 2d ago
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Everyone says AI can solve almost anything. Continue reading on Readers Club »
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 2d ago
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Everyone says AI can solve almost anything. Continue reading on Readers Club »
Claude Sonnet 5 Is Here: Why It Might Replace Your Opus Subscription
Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 2d ago
Claude Sonnet 5 Is Here: Why It Might Replace Your Opus Subscription
Anthropic released Claude Sonnet 5 today. It gets close to flagship-level performance at roughly a third of the price, and the story… Continue reading on AI Eng
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
Introducing Claude Sonnet 5 on AWS: Anthropic’s most capable Sonnet model
Today, we’re excited to announce the availability of Anthropic’s most advanced Sonnet model, Claude Sonnet 5, on Amazon Bedrock and Claude Platform on AWS. Clau
Claude Sonnet 5 just closed the gap with Opus
Medium · Deep Learning 🧠 Large Language Models ⚡ AI Lesson 2d ago
Claude Sonnet 5 just closed the gap with Opus
Anthropic shipped Claude Sonnet 5 today. The short version: it’s the most agentic Sonnet model Anthropic has built, it scores close to… Continue reading on Boot