Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

25,183
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,693 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
LHAW: Controllable Underspecification for Long-Horizon Tasks
arXiv:2602.10525v2 Announce Type: replace-cross Abstract: Long-horizon workflow agents that operate effectively over extended periods are essential for truly au
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The Art of Efficient Reasoning: Data, Reward, and Optimization
arXiv:2602.20945v3 Announce Type: replace-cross Abstract: Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but al
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation
arXiv:2602.21424v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) agents under partial observability often condition actions on internally a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles
arXiv:2603.00523v2 Announce Type: replace-cross Abstract: Every mechanistic circuit carries an invisible asterisk: it reflects not just the model's computation,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing
arXiv:2603.01038v2 Announce Type: replace-cross Abstract: Face recognition remains vulnerable to presentation attacks, calling for robust Face Anti-Spoofing (FA
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
arXiv:2603.12180v2 Announce Type: replace-cross Abstract: Multimodal agents offer a promising path to automating complex document-intensive workflows. Yet, a cr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Prompt Injection as Role Confusion
arXiv:2603.12277v2 Announce Type: replace-cross Abstract: Language models remain vulnerable to prompt injection attacks despite extensive safety training. We tr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems
arXiv:2603.15727v2 Announce Type: replace-cross Abstract: Autonomous LLM-based agents increasingly operate as long-running processes forming densely interconnec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data
arXiv:2603.16513v2 Announce Type: replace-cross Abstract: Structured data is foundational to healthcare, finance, e-commerce, and scientific data management. La
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition
arXiv:2603.18062v2 Announce Type: replace-cross Abstract: Skeleton-based action recognition is crucial for multimedia applications but heavily relies on power-h
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Understanding Task Aggregation for Generalizable Ultrasound Foundation Models
arXiv:2603.18123v2 Announce Type: replace-cross Abstract: Foundation models promise to unify multiple clinical tasks within a single framework, but recent ultra
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Retrieval-Augmented LLMs for Security Incident Analysis
arXiv:2603.18196v2 Announce Type: replace-cross Abstract: Investigating cybersecurity incidents requires collecting and analyzing evidence from multiple log sou
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation
arXiv:2603.18202v2 Announce Type: replace-cross Abstract: A central challenge in image-based Model-Based Reinforcement Learning (MBRL) is to learn representatio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents
arXiv:2603.18377v2 Announce Type: replace-cross Abstract: Cloud-hosted large language models (LLMs) have become the de facto planners in agentic systems, coordi
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Creating with Sora Safely
To address the novel safety challenges posed by a state-of-the-art video model as well as a new social creation platform, we’ve built Sora 2 and the Sora app wi
All We Need Is Memory, Dealing With The AI RAMpocalypse
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
All We Need Is Memory, Dealing With The AI RAMpocalypse
Nvidia announcements show the current shortage of storage and memory could continue into the future, driving up prices and the value of the companies that produ
Lossy self-improvement
Interconnects 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Lossy self-improvement
The case for why self-improvement is real but it doesn't lead to fast takeoff.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Cursor admits its new coding model was built on top of Moonshot AI’s Kimi
Building on top of a Chinese model feels particularly fraught right now.
Amazon Alexa Plus: Panos Panay On How The ‘Brilliant’ New AI Is Ready Now
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Amazon Alexa Plus: Panos Panay On How The ‘Brilliant’ New AI Is Ready Now
Alexa+ is now available in the U.K., with careful localization. Amazon’s Panos Panay explains why the generative AI upgrade is ready for British homes.
AI Agents Wrote 80% Of Karpathy's Code. Junior Developers Are Paying The Price
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
AI Agents Wrote 80% Of Karpathy's Code. Junior Developers Are Paying The Price
OpenAI co-founder Andrej Karpathy says December 2025 was the inflection point. The data — and the job market — are beginning to agree.
2 Reasons I Turned Off My OpenClaw, My Personal AI Assistant
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
2 Reasons I Turned Off My OpenClaw, My Personal AI Assistant
this article explains the reasons that Paul Baier stopped using OpenClaw. These are the rawness of the software and lack of security
Vehicle AI Has A Blind Spot: Tesla FSD And GM Super Cruise In Focus
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Vehicle AI Has A Blind Spot: Tesla FSD And GM Super Cruise In Focus
Tesla Full Self-Driving and General Motors Super Cruise are seminal technologies. But vehicle AI is not flawless and drivers don’t always understand its limitat
Towards Data Science 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Prompt Caching with the OpenAI API: A Full Hands-On Python tutorial
A step-by-step guide to making your OpenAI apps faster, cheaper, and more efficient The post Prompt Caching with the OpenAI API: A Full Hands-On Python tutorial
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
An exclusive tour of Amazon’s Trainium lab, the chip that’s won over Anthropic, OpenAI, even Apple
Shortly after Amazon announced its $50 billion investment in OpenAI, AWS invited me on a private tour of the chip lab at the heart of the deal.
The Real AI Race Is Not The One You Think
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The Real AI Race Is Not The One You Think
Beneath the highly visible yet often counterproductive AI consumption race lies a far more consequential one: the race for AI production.
Why We Don’t Have More AI Power Users In The Age Of AI
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Why We Don’t Have More AI Power Users In The Age Of AI
It's critical is to identify and bring along the power users who will expand the capabilities of AI. But where are they?
InfoQ AI/ML 🧠 Large Language Models ⚡ AI Lesson 1mo ago
QCon London AI Coding State of the Game: More Capable, More Expensive, More Dangerous Coding Agents
In her QCon London keynote, Birgitta Böckeler, AI-Coding lead at Thoughtworks, reflected on the changes in the AI coding space over the past year. She emphasise
Taxonomy For Creating AI Personas In Mental Health Encompassing Therapists, Clients, Supervisors, Evaluators
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Taxonomy For Creating AI Personas In Mental Health Encompassing Therapists, Clients, Supervisors, Evaluators
I have created four sets of taxonomies checklists to invoke AI personas for a synthetic therapist, client, therapist-supervisor, and therapy evaluator. An AI In
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Publisher pulls horror novel ‘Shy Girl’ over AI concerns
Hachette Book Group said it will not be publishing “Shy Girl” over concerns that artificial intelligence was used to generate the text.
Apple Blocks Vibe Coding Tools From Store
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Apple Blocks Vibe Coding Tools From Store
Apple restricts vibe-coding apps; coding itself legal, but publishing insecure AI apps may trigger liability.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Why Wall Street wasn’t won over by Nvidia’s big conference
Despite investor fears of an AI bubble, Nvidia's latest conference shows that most in the industry aren't concerned by that possibility.
AI’s Missing Capability Is Not Intelligence But Integrity
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
AI’s Missing Capability Is Not Intelligence But Integrity
Recent developments in AI make this clear: an AI system with intelligence but without integrity is structurally unfit for civilization.
Hacker News (AI) 🧠 Large Language Models ⚡ AI Lesson 1mo ago
The next phase of artificial intelligence may require different processors
Article URL: https://www.economist.com/science-and-technology/2026/03/18/the-next-phase-of-artificial-intelligence-may-require-very-different-processors Comment
I Tried DoorDash’s Tasks App and Saw the Bleak Future of AI Gig Work
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
I Tried DoorDash’s Tasks App and Saw the Bleak Future of AI Gig Work
I recorded videos of myself doing laundry, scrambling eggs, and walking around the park in DoorDash’s new Tasks app, where gig workers are paid to train AI.
ZDNet AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
4 tips for building better AI agents that your business can trust
Agents are coming. Here are four ways to prepare for the AI-powered workplace revolution.
ZDNet AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
These 7 handy ChatGPT settings are off by default - here's what you're missing
Stop using ChatGPT on factory settings. Here are the top adjustments I use to make it a pro tool.
Inside ByteDance’s Monolith: The Engine Powering Smarter, Faster Content Feeds
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Inside ByteDance’s Monolith: The Engine Powering Smarter, Faster Content Feeds
Monolith is ByteDance’s real-time recommendation system that updates itself using live user behavior instead of waiting for batch retraining. It solves major is
This New AI Model Could Replace Half Your Coding Workflow
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 1mo ago
This New AI Model Could Replace Half Your Coding Workflow
IBM’s Granite Code models are a new family of AI systems built to handle real-world coding tasks—writing, fixing, explaining, and translating code across 116 la
OpenAI Is Toning Down The Cringe Factor Of ChatGPT But Those Smarmy AI Responses Will Still Make You Shudder
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenAI Is Toning Down The Cringe Factor Of ChatGPT But Those Smarmy AI Responses Will Still Make You Shudder
AI is being cringy. OpenAI has decided to reduce the cringe factor of ChatGPT. Here's the deal. Don't expect miracles. An AI Insider scoop.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
New court filing reveals Pentagon told Anthropic the two sides were nearly aligned — a week after Trump declared the relationship kaput
Anthropic submitted two sworn declarations to a California federal court late Friday afternoon, pushing back on the Pentagon's assertion that the AI company pos
Anthropic Denies It Could Sabotage AI Tools During War
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Anthropic Denies It Could Sabotage AI Tools During War
The Department of Defense alleges the AI developer could manipulate models in the middle of war. Company executives argue that’s impossible.
Nvidia Bet $1 Trillion On AI That Never Clocks Out
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Nvidia Bet $1 Trillion On AI That Never Clocks Out
Every GTC announcement made sense once you understood the single question Nvidia was answering: what happens to compute demand when AI stops waiting to be asked
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Microsoft rolls back some of its Copilot AI bloat on Windows
The company is reducing Copilot entry points on Windows, starting with Photos, Widgets, Notepad, and other apps.
Why Your AI Doesn’t Need the Cloud to Run Faster Anymore
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Why Your AI Doesn’t Need the Cloud to Run Faster Anymore
This paper shows how to speed up AI on edge devices by splitting neural networks across multiple machines instead of relying on the cloud. It introduces a metho
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
What happened at Nvidia GTC: NemoClaw, Robot Olaf, and a $1 trillion bet
CEO Jensen Huang took the stage at Nvidia’s GTC conference this week in his signature leather jacket to deliver a two-and-a-half-hour keynote, projecting $1 tri
Bits, Atoms, And What Comes Next
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Bits, Atoms, And What Comes Next
AI is now foundational infrastructure. The next decade belongs to cyber-physical systems, distributed compute and those who control the stack from algorithm to
Gamers Hate Nvidia's DLSS 5. Developers Aren’t Crazy About It, Either
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Gamers Hate Nvidia's DLSS 5. Developers Aren’t Crazy About It, Either
Nvidia’s new AI upscaling gaming technology struck gamers as uncanny and off-putting. Developers don't seem to like it, either, but it could be “the default” in
ZDNet AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenAI's rumored 'superapp' could finally solve one of my biggest issues with ChatGPT
Commentary: OpenAI reportedly wants one app for everything, and I'm here for it.