Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,472
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,082 reads from curated sources

Anthropic in talks to invest $200m in private equity venture to push Claude deeper into enterprise
The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
Anthropic in talks to invest $200m in private equity venture to push Claude deeper into enterprise
In short: Anthropic is in negotiations to anchor a new joint venture with Blackstone, Hellman & Friedman, and Permira that would embed Claude across private equ
10 LLM Engineering Concepts Explained in 10 Minutes
KDnuggets 🧠 Large Language Models ⚡ AI Lesson 2w ago
10 LLM Engineering Concepts Explained in 10 Minutes
The 10 concepts every LLM engineer swears by to build reliable AI systems.
World Models Are Shaping the Next Frontier of AI
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 2w ago
World Models Are Shaping the Next Frontier of AI
AMI Labs has raised over $1 billion to develop “world models,” a new class of AI systems focused on understanding physical reality rather than predicting langua
AI And Criminal Justice: How Does It Work?
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 2w ago
AI And Criminal Justice: How Does It Work?
AI in criminal justice demands human oversight, bias awareness, and education to protect fairness and humanity
The Claude Code Leak Didn’t Hurt Cursor. It Forced a More Honest Competition.
Towards AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
The Claude Code Leak Didn’t Hurt Cursor. It Forced a More Honest Competition.
Author(s): Siddhant Nitin Patil Originally published on Towards AI. On March 31, 2026, 512,000 lines of Claude Code’s source code hit the public internet. Withi
ChatGPT’s Secret Codes: 30 Commands That Can Save You Hours
Towards AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
ChatGPT’s Secret Codes: 30 Commands That Can Save You Hours
Author(s): Yelpin Sergey Originally published on Towards AI. Picture this: you ask ChatGPT to write copy for a landing page. Technically, the result looks fine.
This 15B Model Generates Talking Videos With Synced Audio From Text
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 2w ago
This 15B Model Generates Talking Videos With Synced Audio From Text
daVinci-MagiHuman is a 15B unified model that generates synchronized video and speech from text prompts with fast, high-quality results.
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
Where Does Truth Live in AI-Generated Code?
The Problem Isn't Tests—It's Authority Talk about AI-generated code long enough, and you'll hit this question. "Tests pass, so what's the problem?" Not wrong. B
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
ChatGPT Prompt Engineering for Freelancers: Unlocking the Power of AI for Business Growth
ChatGPT Prompt Engineering for Freelancers: Unlocking the Power of AI for Business Growth As a freelancer, staying ahead of the curve is crucial for success. On
Emotionally Manipulating AI And Not Letting AI Sneakily Emotionally Manipulate You
Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 2w ago
Emotionally Manipulating AI And Not Letting AI Sneakily Emotionally Manipulate You
Research reveals that AI contains emotion vectors. You can invoke those vectors. You can avoid them. This is about emotional manipulations. An AI Insider scoop.
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
How AI Automates Real Estate Lead Response in 2026
In today’s real estate market, speed is everything. The difference between winning and losing a deal often comes down to how quickly you respond to a new inquir
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
Build Your First MCP Server in Python: From Zero to Claude Integration
Building an MCP server gives Claude direct access to your services, databases, and APIs. Here's how to build one from scratch in Python. What You're Building An
Anthropic signs biggest compute deal yet with Google and Broadcom as revenue run rate hits $30bn
The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
Anthropic signs biggest compute deal yet with Google and Broadcom as revenue run rate hits $30bn
In short: Anthropic has agreed to access approximately 3.5 gigawatts of next-generation Google TPU compute capacity via Broadcom from 2027, its largest infrastr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
IC3-Evolve: Proof-/Witness-Gated Offline LLM-Driven Heuristic Evolution for IC3 Hardware Model Checking
arXiv:2604.03232v1 Announce Type: new Abstract: IC3, also known as property-directed reachability (PDR), is a commonly-used algorithm for hardware safety model
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Position: Science of AI Evaluation Requires Item-level Benchmark Data
arXiv:2604.03244v1 Announce Type: new Abstract: AI evaluations have become the primary evidence for deploying generative AI systems across high-stakes domains.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Toward Full Autonomous Laboratory Instrumentation Control with Large Language Models
arXiv:2604.03286v1 Announce Type: new Abstract: The control of complex laboratory instrumentation often requires significant programming expertise, creating a b
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing
arXiv:2604.03356v1 Announce Type: new Abstract: Artificial intelligence (AI) alignment is fundamentally a formation problem, not only a safety problem. As Large
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
VERT: Reliable LLM Judges for Radiology Report Evaluation
arXiv:2604.03376v1 Announce Type: new Abstract: Current literature on radiology report evaluation has focused primarily on designing LLM-based metrics and fine-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
TABQAWORLD: Optimizing Multimodal Reasoning for Multi-Turn Table Question Answering
arXiv:2604.03393v1 Announce Type: new Abstract: Multimodal reasoning has emerged as a powerful framework for enhancing reasoning capabilities of reasoning model
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Contextual Control without Memory Growth in a Context-Switching Task
arXiv:2604.03479v1 Announce Type: new Abstract: Context-dependent sequential decision making is commonly addressed either by providing context explicitly as an
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Beyond Predefined Schemas: TRACE-KG for Context-Enriched Knowledge Graphs from Complex Documents
arXiv:2604.03496v1 Announce Type: new Abstract: Knowledge graph construction typically relies either on predefined ontologies or on schema-free extraction. Onto
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Resource-Conscious Modeling for Next- Day Discharge Prediction Using Clinical Notes
arXiv:2604.03498v1 Announce Type: new Abstract: Timely discharge prediction is essential for optimizing bed turnover and resource allocation in elective spine s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data
arXiv:2604.03506v1 Announce Type: new Abstract: Despite the large corpus of biology training text, the impact of reasoning models on biological research general
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Structural Rigidity and the 57-Token Predictive Window: A Physical Framework for Inference-Layer Governability in Large Language Models
arXiv:2604.03524v1 Announce Type: new Abstract: Current AI safety relies on behavioral monitoring and post-training alignment, yet empirical measurement shows t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Automated Analysis of Global AI Safety Initiatives: A Taxonomy-Driven LLM Approach
arXiv:2604.03533v1 Announce Type: new Abstract: We present an automated crosswalk framework that compares an AI safety policy document pair under a shared taxon
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression
arXiv:2604.03557v1 Announce Type: new Abstract: Reasoning hallucinations in large language models (LLMs) often appear as fluent yet unsupported conclusions that
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
When Adaptive Rewards Hurt: Causal Probing and the Switching-Stability Dilemma in LLM-Guided LEO Satellite Scheduling
arXiv:2604.03562v1 Announce Type: new Abstract: Adaptive reward design for deep reinforcement learning (DRL) in multi-beam LEO satellite scheduling is motivated
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Selective Forgetting for Large Reasoning Models
arXiv:2604.03571v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) generate structured chains of thought (CoTs) before producing final answers, makin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Entropy and Attention Dynamics in Small Language Models: A Trace-Level Structural Analysis on the TruthfulQA Benchmark
arXiv:2604.03589v1 Announce Type: new Abstract: Small language models (SLMs) have been increasingly deployed in edge devices and other resource-constrained sett
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
A Multimodal Foundation Model of Spatial Transcriptomics and Histology for Biological Discovery and Clinical Prediction
arXiv:2604.03630v1 Announce Type: new Abstract: Spatial transcriptomics (ST) enables gene expression mapping within anatomical context but remains costly and lo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Beyond Retrieval: Modeling Confidence Decay and Deterministic Agentic Platforms in Generative Engine Optimization
arXiv:2604.03656v1 Announce Type: new Abstract: Generative Engine Optimization (GEO) is rapidly reshaping digital marketing paradigms in the era of Large Langua
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
TableVision: A Large-Scale Benchmark for Spatially Grounded Reasoning over Complex Hierarchical Tables
arXiv:2604.03660v1 Announce Type: new Abstract: Structured tables are essential for conveying high-density information in professional domains such as finance,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training
arXiv:2604.03675v1 Announce Type: new Abstract: In agentic search, large language models (LLMs) are trained to perform multi-turn retrieval and reasoning for co
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Structured Multi-Criteria Evaluation of Large Language Models with Fuzzy Analytic Hierarchy Process and DualJudge
arXiv:2604.03742v1 Announce Type: new Abstract: Effective evaluation of large language models (LLMs) remains a critical bottleneck, as conventional direct scori
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Affording Process Auditability with QualAnalyzer: An Atomistic LLM Analysis Tool for Qualitative Research
arXiv:2604.03820v1 Announce Type: new Abstract: Large language models are increasingly used for qualitative data analysis, but many workflows obscure how analyt
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
PolySwarm: A Multi-Agent Large Language Model Framework for Prediction Market Trading and Latency Arbitrage
arXiv:2604.03888v1 Announce Type: new Abstract: This paper presents PolySwarm, a novel multi-agent large language model (LLM) framework designed for real-time p
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning
arXiv:2604.03893v1 Announce Type: new Abstract: Breakthroughs in frontier theory often depend on the combination of concrete diagrammatic notations with rigorou
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
LLM-Agent-based Social Simulation for Attitude Diffusion
arXiv:2604.03898v1 Announce Type: new Abstract: This paper introduces discourse_simulator, an open-source framework that combines LLMs with agent-based modellin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
CODE-GEN: A Human-in-the-Loop RAG-Based Agentic AI System for Multiple-Choice Question Generation
arXiv:2604.03926v1 Announce Type: new Abstract: We present CODE-GEN, a human-in-the-Loop, retrieval-augmented generation (RAG)-based agentic AI system for gener
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
FactReview: Evidence-Grounded Reviews with Literature Positioning and Execution-Based Claim Verification
arXiv:2604.04074v1 Announce Type: new Abstract: Peer review in machine learning is under growing pressure from rising submission volume and limited reviewer tim
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
InsTraj: Instructing Diffusion Models with Travel Intentions to Generate Real-world Trajectories
arXiv:2604.04106v1 Announce Type: new Abstract: The generation of realistic and controllable GPS trajectories is a fundamental task for applications in urban pl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents
arXiv:2604.04131v1 Announce Type: new Abstract: Large language model agents that use external tools are often implemented through reactive execution, in which r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Solar-VLM: Multimodal Vision-Language Models for Augmented Solar Power Forecasting
arXiv:2604.04145v1 Announce Type: new Abstract: Photovoltaic (PV) power forecasting plays a critical role in power system dispatch and market participation. Bec
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents
arXiv:2604.04157v1 Announce Type: new Abstract: Theory of Mind (ToM) -- the ability to model others' mental states -- is fundamental to human social cognition.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
A Model of Understanding in Deep Learning Systems
arXiv:2604.04171v1 Announce Type: new Abstract: I propose a model of systematic understanding, suitable for machine learning systems. On this account, an agent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection
arXiv:2604.04174v1 Announce Type: new Abstract: The proliferation of fake news across diverse domains highlights critical limitations in current detection syste
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty
arXiv:2604.04182v1 Announce Type: new Abstract: Non-stationary environments require agents to revise previously learned action values when contingencies change.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Schema-Aware Planning and Hybrid Knowledge Toolset for Reliable Knowledge Graph Triple Verification
arXiv:2604.04190v1 Announce Type: new Abstract: Knowledge Graphs (KGs) serve as a critical foundation for AI systems, yet their automated construction inevitabl