Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,207
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
All Reads (29,734) Articles (12645)Blog Posts (5613)Tutorials (2371)Research Papers (8231)News (874)
Evaluating Foundation Models: A Deep Dive into ROUGE
Medium · Data Science 🧠 Large Language Models ⚡ AI Lesson 4d ago
Evaluating Foundation Models: A Deep Dive into ROUGE
How do you measure whether an AI actually understands language? One classic answer is ROUGE and it’s more nuanced than it looks. Continue reading on Medium »
Evaluating Foundation Models: A Deep Dive into ROUGE
Medium · NLP 🧠 Large Language Models ⚡ AI Lesson 4d ago
Evaluating Foundation Models: A Deep Dive into ROUGE
How do you measure whether an AI actually understands language? One classic answer is ROUGE and it’s more nuanced than it looks. Continue reading on Medium »
Simon Willison's Blog 🧠 Large Language Models ⚡ AI Lesson 4d ago
What's new in Claude Sonnet 5
What's new in Claude Sonnet 5 Claude Sonnet 5 came out this morning . I always head straight for the "what's new" developer docs because they tend to have more
Semantic Deduplication with OpenAI Embeddings and pgvector
Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 4d ago
Semantic Deduplication with OpenAI Embeddings and pgvector
Applications that process large amounts of text often run into duplicate content. While exact duplicates are easy to detect using hashes… Continue reading on Me
Semantic Deduplication with OpenAI Embeddings and pgvector
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
Semantic Deduplication with OpenAI Embeddings and pgvector
Applications that process large amounts of text often run into duplicate content. While exact duplicates are easy to detect using hashes… Continue reading on Me
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Why language models might be doing something close to dreaming — and what that means for how we train, evaluate, and one day secure them Continue reading on Med
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Medium · Cybersecurity 🧠 Large Language Models ⚡ AI Lesson 4d ago
Does AI Dream? A Mechanistic Hypothesis for Hallucination
Why language models might be doing something close to dreaming — and what that means for how we train, evaluate, and one day secure them Continue reading on Med
Don't Let Your LLM Wing It: Building a Knowledge Base That Actually Knows Things
Dev.to · hugolesta 🧠 Large Language Models ⚡ AI Lesson 4d ago
Don't Let Your LLM Wing It: Building a Knowledge Base That Actually Knows Things
A field guide to provisioning a Bedrock Knowledge Base on Aurora pgvector with Terraform, then keeping it in sync with a GitHub Actions workflow that ships docs
An LLM Doesn’t Know Your Data. RAG Gives It the Right Page
Medium · RAG 🧠 Large Language Models ⚡ AI Lesson 4d ago
An LLM Doesn’t Know Your Data. RAG Gives It the Right Page
An LLM only knows what it was trained on. That’s a snapshot of the past, and it stops there. It doesn’t know today’s news, and more… Continue reading on Medium
Building LSTMs with PyTorch and Lightning AI Part 7: Resuming Training with Checkpoints
Dev.to · Rijul Rajesh 🧠 Large Language Models ⚡ AI Lesson 4d ago
Building LSTMs with PyTorch and Lightning AI Part 7: Resuming Training with Checkpoints
In the previous article, we used TensorBoard to analyze the training process. Based on the graphs, we...
How AI Learns with Less Labeled Data
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 4d ago
How AI Learns with Less Labeled Data
Most people think machine learning is mainly about choosing the best model. Continue reading on Medium »
Comparing Sarvam-30B and Qwen2.5–14B on Spider Text-to-SQL: An Active-Parameter Perspective
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
Comparing Sarvam-30B and Qwen2.5–14B on Spider Text-to-SQL: An Active-Parameter Perspective
A controlled comparison on the Spider benchmark, scored by execution accuracy. Active-parameter count tells you more than the number in a… Continue reading on M
Claude Sonnet 5 closes the gap to Opus without the Opus bill
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
Claude Sonnet 5 closes the gap to Opus without the Opus bill
Claude Sonnet 5: The New Default Worker Tier Continue reading on Medium »
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Four teams, four products, zero coordination — and the same four ingredients show up in every one. Continue reading on Towards AI »
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Medium · Deep Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
The Same Architecture Quietly Powers Claude Code, Manus, OpenAI Deep Research — And LangChain Just…
Four teams, four products, zero coordination — and the same four ingredients show up in every one. Continue reading on Towards AI »
TechRepublic 🧠 Large Language Models ⚡ AI Lesson 4d ago
Claude Sonnet 5: Everything to Know About Anthropic’s New Model
Claude Sonnet 5 brings stronger agentic AI features, lower pricing, and updated safety protections. Here's what IT leaders need to know. The post Claude Sonnet
Does QLoRA’s NF4 Advantage Still Hold on LLaMA-3? I Put NF4 and FP4 Head to Head.
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
Does QLoRA’s NF4 Advantage Still Hold on LLaMA-3? I Put NF4 and FP4 Head to Head.
The QLoRA paper demonstrated a 1–2 point accuracy advantage for the NF4 data type over FP4 on LLaMA-1 and LLaMA-2. Continue reading on Medium »
Debugging Benchmark: DeepSeek V4 Pro vs MiMo V2.5 Pro
Dev.to · Stanislav 🧠 Large Language Models ⚡ AI Lesson 4d ago
Debugging Benchmark: DeepSeek V4 Pro vs MiMo V2.5 Pro
A real-world comparison of two LLMs on a genuine race condition bug from GitHub ...
Reddit r/LocalLLaMA 🧠 Large Language Models ⚡ AI Lesson 4d ago
Devs - you have 64gb of VRAM - which model do you use for coding?
I've currently settled on an unsloth version of Qwen 3.5 122b-a10b model (UD-IQ4_NL). With 100k bf16 context window, I only had to load a few layers into CPU/RA
How ChatGPT Understands Your Questions?
Medium · JavaScript 🧠 Large Language Models ⚡ AI Lesson 4d ago
How ChatGPT Understands Your Questions?
If I ask you something you might have never heard before, can you answer it? Obviously, you would say, “Sorry, I can’t answer this… Continue reading on Medium »
How ChatGPT Understands Your Questions?
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
How ChatGPT Understands Your Questions?
If I ask you something you might have never heard before, can you answer it? Obviously, you would say, “Sorry, I can’t answer this… Continue reading on Medium »
How I'm re-discovering computer science with LLM revolution
Dev.to · popiol 🧠 Large Language Models ⚡ AI Lesson 4d ago
How I'm re-discovering computer science with LLM revolution
I've been in this business for years and the best (or worst?) thing is that once I gain enough...
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
How LLM Works Under the hood?
Have you ever asked ChatGPT to write code, solve a math problem, or explain a difficult concept and wondered, “How does it know all of… Continue reading on Medi
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 4d ago
How LLM Works Under the hood?
Have you ever asked ChatGPT to write code, solve a math problem, or explain a difficult concept and wondered, “How does it know all of… Continue reading on Medi
Knowledge Leakage Is the New Data Leakage in LLMs.
Medium · Python 🧠 Large Language Models ⚡ AI Lesson 4d ago
Knowledge Leakage Is the New Data Leakage in LLMs.
Your LLM may know more than your data. Learn how knowledge leakage can silently influence your results. Continue reading on Data Science Collective »
OCR vs. Vision LLMs: Choosing the Right Tool for Intelligent Document Processing
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 4d ago
OCR vs. Vision LLMs: Choosing the Right Tool for Intelligent Document Processing
Vision-capable LLMs have dramatically simplified intelligent document processing by replacing rigid template-based OCR workflows with semantic document understa
Running Chinese LLMs at Scale: DeepSeek, Qwen, Kimi & GLM
Dev.to · fiercedash 🧠 Large Language Models ⚡ AI Lesson 4d ago
Running Chinese LLMs at Scale: DeepSeek, Qwen, Kimi & GLM
Running Chinese LLMs at Scale: DeepSeek, Qwen, Kimi & GLM I've been running production workloads...
Search Engine Journal 🧠 Large Language Models ⚡ AI Lesson 4d ago
ChatGPT Users Are Now Mostly Non-English via @sejournal, @MattGSouthern
OpenAI data show that active ChatGPT users on consumer plans now mostly use non-English languages, with growth fastest in Africa and Asia. The post ChatGPT User
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Medium · AI 🧠 Large Language Models ⚡ AI Lesson 4d ago
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Everyone says AI can solve almost anything. Continue reading on Readers Club »
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Medium · ChatGPT 🧠 Large Language Models ⚡ AI Lesson 4d ago
I Asked ChatGPT to Fix My Life. It Couldn’t — Until I Changed One Thing
Everyone says AI can solve almost anything. Continue reading on Readers Club »
Claude Sonnet 5 Is Here: Why It Might Replace Your Opus Subscription
Medium · Programming 🧠 Large Language Models ⚡ AI Lesson 4d ago
Claude Sonnet 5 Is Here: Why It Might Replace Your Opus Subscription
Anthropic released Claude Sonnet 5 today. It gets close to flagship-level performance at roughly a third of the price, and the story… Continue reading on AI Eng
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
Introducing Claude Sonnet 5 on AWS: Anthropic’s most capable Sonnet model
Today, we’re excited to announce the availability of Anthropic’s most advanced Sonnet model, Claude Sonnet 5, on Amazon Bedrock and Claude Platform on AWS. Clau
Claude Sonnet 5 just closed the gap with Opus
Medium · Deep Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
Claude Sonnet 5 just closed the gap with Opus
Anthropic shipped Claude Sonnet 5 today. The short version: it’s the most agentic Sonnet model Anthropic has built, it scores close to… Continue reading on Boot
Claude Sonnet 5 just closed the gap with Opus
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
Claude Sonnet 5 just closed the gap with Opus
Anthropic shipped Claude Sonnet 5 today. The short version: it’s the most agentic Sonnet model Anthropic has built, it scores close to… Continue reading on Boot
Beyond the Raw LLM: The Architecture Behind Reliable AI Agents
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
Beyond the Raw LLM: The Architecture Behind Reliable AI Agents
Why a powerful model isn’t enough — and what actually turns an LLM into a dependable software system. Continue reading on Medium »
mcpgen: Turn any OpenAPI spec into an MCP server in seconds
Dev.to · 56 Prince Pal 🧠 Large Language Models ⚡ AI Lesson 4d ago
mcpgen: Turn any OpenAPI spec into an MCP server in seconds
I got tired of manually writing MCP tools for every REST endpoint I wanted to expose to an LLM. So I...
Prompting Is Dead. Long Live the Loop.
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
Prompting Is Dead. Long Live the Loop.
Single-shot prompts got us here. Autonomous agent loops are taking AI the rest of the way. Continue reading on Medium »
Reddit r/LocalLLaMA 🧠 Large Language Models ⚡ AI Lesson 4d ago
The harness matters more than the model. A 27B behind good critics changed my mind.
I saw someone test Qwen3.6-27B with a 3-critic harness. The harness included code review, test review and Playwright e2e. Each critic had context. The result wa
LLMs didn't kill feature engineering. Engineers did.
Dev.to · Jignesh Maheshwari 🧠 Large Language Models ⚡ AI Lesson 4d ago
LLMs didn't kill feature engineering. Engineers did.
Somewhere around when LLMs started eating every roadmap, a quiet belief took over a lot of teams. If...
Claude Sonnet 5 Benchmarks
Reddit r/singularity 🧠 Large Language Models ⚡ AI Lesson 4d ago
Claude Sonnet 5 Benchmarks
submitted by /u/WhyLifeIs4 <a href="https://www.reddit.com/gallery/1ujwmnr"
The AI Industry Spent Billions on Memory Before a Surprisingly Simple Idea Inspired Google’s Open…
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
The AI Industry Spent Billions on Memory Before a Surprisingly Simple Idea Inspired Google’s Open…
The next edge in enterprise AI won’t come from a bigger model. It will come from better-organized knowledge. Continue reading on Towards Dev »
The AI Industry Spent Billions on Memory Before a Surprisingly Simple Idea Inspired Google’s Open…
Medium · RAG 🧠 Large Language Models ⚡ AI Lesson 4d ago
The AI Industry Spent Billions on Memory Before a Surprisingly Simple Idea Inspired Google’s Open…
The next edge in enterprise AI won’t come from a bigger model. It will come from better-organized knowledge. Continue reading on Towards Dev »
LLMs: The New Compiler, NOT The New Deity
Medium · LLM 🧠 Large Language Models ⚡ AI Lesson 4d ago
LLMs: The New Compiler, NOT The New Deity
Treat it as anything else and you’ll get burned. Continue reading on Salty Deprecated Software Engineer »
Nano Banana 2 Lite with MCP, and Antigravity CLI
Medium · Python 🧠 Large Language Models ⚡ AI Lesson 4d ago
Nano Banana 2 Lite with MCP, and Antigravity CLI
This article covers the MCP setup and configuration for using Google Nano Banana 2 Lite and underlying Gemini LLM with Antigravity CLI to… Continue reading on G
Medium · Machine Learning 🧠 Large Language Models ⚡ AI Lesson 4d ago
My AI Lied to a Client — Here’s How I Fixed It.
Stop AI hallucinations without going insane Continue reading on Medium »
Reddit r/MachineLearning 🧠 Large Language Models ⚡ AI Lesson 4d ago
Are all LLM research papers nowadays 100+ pages beasts?[D]
Was reading some research papers put out by Anthropic (and some other organizations/researchers) and one thing I've noticed is that these research papers consis
Feeding the LLM the Whole Thread: Prompt Chaining for Contextual Replies
Dev.to · HelperX 🧠 Large Language Models ⚡ AI Lesson 4d ago
Feeding the LLM the Whole Thread: Prompt Chaining for Contextual Replies
I wrote earlier about our persona engine — how we make one LLM sound like many different people. But...
Redaction is Not Enough: When an LLM can still Infer the PII You Stripped Out
Dev.to · Trevor 🧠 Large Language Models ⚡ AI Lesson 4d ago
Redaction is Not Enough: When an LLM can still Infer the PII You Stripped Out
A reader left a sharp question on my last post about redacting PII before sending prompts to an LLM....