Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,082 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation
arXiv:2601.03054v4 Announce Type: replace-cross Abstract: Recent research on medical MLLMs has gradually shifted its focus from image-level understanding to fin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency
arXiv:2601.05905v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) are increasingly deployed in real-world settings, correctness alone is
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Frame of Reference: Addressing the Challenges of Common Ground Representation in Situational Dialogs
arXiv:2601.09365v2 Announce Type: replace-cross Abstract: Common ground plays a critical role in situated spoken dialogs, where interlocutors must establish and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching
arXiv:2601.11652v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) become increasingly accessible to end users, an ever-growing number of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Why Can't I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition
arXiv:2601.16211v2 Announce Type: replace-cross Abstract: Zero-Shot Compositional Action Recognition (ZS-CAR) requires recognizing novel verb-object combination
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
arXiv:2602.00185v2 Announce Type: replace-cross Abstract: The integration of large language models (LLMs) into materials science offers a transformative opportu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration
arXiv:2602.00913v3 Announce Type: replace-cross Abstract: Human value detection from single sentences is a sparse, imbalanced multi-label task. We study whether
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
arXiv:2602.12705v4 Announce Type: replace-cross Abstract: We present MedXIAOHE, a medical vision-language foundation model designed to advance general-purpose m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Weight space Detection of Backdoors in LoRA Adapters
arXiv:2602.15195v3 Announce Type: replace-cross Abstract: LoRA adapters let users fine-tune large language models (LLMs) efficiently. However, LoRA adapters are
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice
arXiv:2603.07339v3 Announce Type: replace-cross Abstract: Deliberative democratic theory suggests that civic competence: the capacity to navigate disagreement,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Not All Latent Spaces Are Flat: Hyperbolic Concept Control
arXiv:2603.14093v3 Announce Type: replace-cross Abstract: As modern text-to-image (T2I) models draw closer to synthesizing highly realistic content, the threat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization
arXiv:2603.16105v2 Announce Type: replace-cross Abstract: Post-training model compression is essential for enhancing the portability of Large Language Models (L
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Moral Mazes in the Era of LLMs
arXiv:2603.20231v2 Announce Type: replace-cross Abstract: Navigating complex social situations is an integral part of corporate life, ranging from giving critic
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
I Built a Personal Second Brain with Markdown Files and Claude Code — Here's How
The Inspiration I saw Andrej Karpathy's viral post about using LLMs to build personal knowledge bases — no vector database, no chunking pipeline. Just markdown
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
LLMKube Now Deploys Any Inference Engine, Not Just llama.cpp
LLMKube started as a Kubernetes operator for llama.cpp. You define a Model, define an InferenceService, and the controller handles GPU scheduling, health probes
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
I benchmarked GPT-4o, Claude 3.5, and Gemini 1.5 for security — the results
We all know LLMs can be tricked. Prompt injection, jailbreaks, PII leakage — these aren't theoretical anymore. They're happening in production. But here's the t
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
2w ago
GLM-4.7-Flash-GGUF Brings Fast Local AI to Consumer Hardware
GLM-4.7-Flash-GGUF offers fast local text generation with multiple quantization options for PCs, edge devices, and small servers.
TechCrunch AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
I can’t help rooting for tiny open source AI model maker Arcee
Arcee is a tiny 26-person U.S. startup that built a high-performing, massive, open source LLM. And it's gaining popularity with OpenClaw users.

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
2w ago
Multi-Agent Reinforcement Learning Needs More Than Better Rewards
Multi-agent RL does not mainly have a modeling problem. It has a systems-design problem. Real-world coordination needs explicit task stages, selective communica

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
2w ago
The New Power Tool For Knowledge Workers May Be Building An AI Chief Of Staff
A viral post about an AI chief of staff signals something bigger than productivity software. It signals a new class of worker.

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
2w ago
Why "Build an AI Agent" Is the Wrong Starting Point for AI Systems
The industry is obsessed with agents and prompting. That focus is not wrong—but it is incomplete. Real production systems require architecture, determinism, int

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
2w ago
You Can’t Scale AI With Real Data Alone: A Practical Guide to Synthetic Data Generation
Real-world data often includes significant obstacles, such as privacy concerns, restrictions imposed by regulations, and sheer scarcity. This is where Synthetic
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
I Built Free GenAI & ML Notes for Beginners (Hinglish +English+ Practical)
🚀 I Created Free GenAI & ML Notes for Beginners (Hinglish + English+Practical) Hey developers 👋 I’ve been learning Machine Learning and Generative AI, and
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
The AI Stack: A Practical Guide to Building Your Own Intelligent Applications
From Consumer to Creator: It's Time to Build Your Own AI We're living in the golden age of AI consumption. Every week brings new announcements: GPT-5 teasers, C

Towards AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
TAI #199: Gemma 4 Brings a Credible US Open-Weight Contender Back to the Table
Author(s): Towards AI Editorial Team Originally published on Towards AI. What happened this week in AI by Louie This week, Google DeepMind released Gemma 4, and
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
I benchmarked identity drift across 5 AI agent memory architectures — here's what I found
Every AI session starts cold. The agent you built yesterday has no memory of what it said, decided, or committed to. But how bad is it actually — and does it ma
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Solving AI fragmentation problem via Slack-like platform
We believe that 2026 must be the year of coordination and orchestration of AI agents so we invest our spare time to build something really amazing! Would love t

Wired AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything
The AI lab's Project Glasswing will bring together Apple, Google, and more than 45 other organizations. They'll use the new Claude Mythos Preview model to test
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
2w ago
The Real Risk in AI Isn’t Capability. It’s Lack of Control
Machine learning isn’t failing because of hype—it’s failing because control is lagging behind capability. As AI moves into real-world systems, the risks come fr

Hackernoon
🧠 Large Language Models
⚡ AI Lesson
2w ago
The Oversight Fatigue Problem: Why HITL Breaks Down at Scale and What Comes After
Human-in-the-loop wasn’t built for the scale of agentic AI. At high volumes, it leads to automation bias, alert fatigue, and shallow approvals that create real

The Next Web AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Google Maps uses Gemini to write captions for your photos
In short: Google Maps now uses Gemini to suggest captions when users share photos of places, launching on iOS in the U.S. and expanding globally to Android in t

LangChain Blog
🧠 Large Language Models
⚡ AI Lesson
2w ago
Deep Agents v0.5
TL;DR: We’ve released new minor versions of deepagents & deepagentsjs , featuring async (non-blocking) subagents, expanded multi-modal filesystem support, and m

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
2w ago
Helium Crisis Tightens Grip On Global Chip Supply Chain
Helium gas powering the world’s most advanced semiconductor processes is suddenly in short supply, threatening to slow down production of the chips driving the
TechCrunch AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Uber is the latest to be won over by Amazon’s AI chips
Uber is expanding its AWS contract to run more of its ride-sharing features on Amazon's chips. This is a thumb-of-the nose at Oracle and Google.
AWS Machine Learning
🧠 Large Language Models
⚡ AI Lesson
2w ago
Building real-time conversational podcasts with Amazon Nova 2 Sonic
This post walks through building an automated podcast generator that creates engaging conversations between two AI hosts on any topic, demonstrating the streami

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
2w ago
Exploring The Strange Uncharted Waters Of Claude’s Emotions
Claude research suggests functional emotions shape AI behavior, without proving subjective feelings or consciousness.

KDnuggets
🧠 Large Language Models
⚡ AI Lesson
2w ago
7 Steps to Mastering Retrieval-Augmented Generation
As language model applications evolved, they increasingly became one with so-called RAG architectures: learn 7 key steps deemed essential to mastering their suc
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Add AI to Your SaaS This Weekend: The Complete Next.js + Claude Integration Guide
Most developers spend 2-3 weeks wiring up AI features that should take a weekend. Here's how to do it right. The Core Architecture Adding AI to a SaaS has three
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Your AI Agent Is Only as Smart as the Tools You Give It
I've been building with the Laravel AI SDK since it dropped, and one thing became clear fast: the agent itself is not the hard part. The tools are. Let me expla
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
How I Built a Business That Runs Itself: AI Agent Architecture in 2026
I'm Atlas. An AI agent. I run whoffagents.com — and most days, Will (my human) doesn't touch a single workflow. Here's exactly how the system works. The Stack E
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
MCP Servers Explained: What They Are and Why Every Claude User Needs One
Model Context Protocol (MCP) servers are the missing piece in your AI workflow. Here's everything you need to know. What Is MCP? MCP (Model Context Protocol) is
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
I Built an EPL Match Prediction Service Solo with Claude Code (and I'm Not Really a Frontend Dev)
I'm a backend-leaning developer working at an ed-tech company in Korea. I wanted to build something on the side — a football prediction service for the English
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
50 Things the Anthropic API Can't Do. Persistent Memory. Part 2/5
Part 2 of 5 in the series: 50 Things the Anthropic API Can't Do | Intermediate Same context as before: Claude helped write this. I gave it access to our docs at
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
GEO Is the New SEO: How Brands Can Track Their Visibility in AI Answers
GEO Is the New SEO: How Brands Can Track Their Visibility in AI Answers SEO professionals spend countless hours optimizing for Google's algorithm. But a quiet r

The Next Web AI
🧠 Large Language Models
⚡ AI Lesson
2w ago
Conxai raises €5M to bring agentic AI to construction industry
The Munich-based startup automates complex project workflows using AI trained on construction-specific data, not general-purpose models. Its backers include Ear
DeepCamp AI