Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,925
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,465 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
From Prompts to Packets: A View from the Network on ChatGPT, Copilot, and Gemini
arXiv:2510.11269v2 Announce Type: replace-cross Abstract: GenAI chatbots are now pervasive in digital ecosystems, fundamentally reshaping user interactions over
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries
arXiv:2510.14751v2 Announce Type: replace-cross Abstract: Next-token prediction (NTP) has driven the success of large language models (LLMs), but it struggles w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
arXiv:2510.15495v2 Announce Type: replace-cross Abstract: Reinforcement learning algorithms typically utilize an interactive simulator (i.e., environment) with
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Is Multilingual LLM Watermarking Truly Multilingual? Scaling Robustness to 100+ Languages via Back-Translation
arXiv:2510.18019v2 Announce Type: replace-cross Abstract: Multilingual watermarking aims to make large language model (LLM) outputs traceable across languages,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations
arXiv:2511.06767v2 Announce Type: replace-cross Abstract: Transformer-based models have revolutionized computer vision (CV) and natural language processing (NLP
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts
arXiv:2511.11743v3 Announce Type: replace-cross Abstract: Deploying deep neural networks on resource-constrained devices faces two critical challenges: maintain
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning
arXiv:2511.18000v2 Announce Type: replace-cross Abstract: We present ContagionRL, a Gymnasium-compatible reinforcement learning platform specifically designed f
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation
arXiv:2511.18281v3 Announce Type: replace-cross Abstract: Diffusion models (DMs) produce high-quality images, yet their sampling remains costly when adapted to
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion
arXiv:2511.21542v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models offer a unified framework for robotic manipulation by integrating
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Goal-Oriented Multi-Agent Semantic Networking: Unifying Intents, Semantics, and Intelligence
arXiv:2512.01035v2 Announce Type: replace-cross Abstract: 6G services are evolving toward goal-oriented and AI-native communication, which are expected to deliv
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature
arXiv:2512.02566v2 Announce Type: replace-cross Abstract: There is a growing interest in developing strong biomedical vision-language models. A popular approach
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
arXiv:2512.04000v2 Announce Type: replace-cross Abstract: The application of Large Multimodal Models (LMMs) to long-form video understanding is constrained by l
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support
arXiv:2512.07801v5 Announce Type: replace-cross Abstract: LLM-based agents are increasingly deployed for expert decision support, yet human-AI teams in high-sta
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ODMA: On-Demand Memory Allocation Strategy for LLM Serving on LPDDR-Class Accelerators
arXiv:2512.09427v3 Announce Type: replace-cross Abstract: Existing memory management techniques severely hinder efficient Large Language Model serving on accele
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Physics-driven human-like working memory outperforms digital networks in dynamic vision
arXiv:2512.15829v3 Announce Type: replace-cross Abstract: While the unsustainable energy cost of artificial intelligence necessitates physics-driven computing,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning
arXiv:2601.00473v2 Announce Type: replace-cross Abstract: We revisit the analogy between feed-forward deep neural networks (DNNs) and discrete dynamical systems
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection
arXiv:2601.09195v2 Announce Type: replace-cross Abstract: Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (L
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset
arXiv:2601.10305v3 Announce Type: replace-cross Abstract: Vision-Language Pre-training (VLP) models have achieved remarkable success by leveraging large-scale i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation
arXiv:2601.11702v2 Announce Type: replace-cross Abstract: AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation
arXiv:2601.19072v2 Announce Type: replace-cross Abstract: Large Language models (LLMs) have shown strong capabilities in code review automation, such as review
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
arXiv:2602.02378v2 Announce Type: replace-cross Abstract: As LLMs expand from assistance to decision support, a dangerous pattern emerges: fluent agreement with
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
SPARE: Self-distillation for PARameter-Efficient Removal
arXiv:2602.07058v2 Announce Type: replace-cross Abstract: Machine Unlearning aims to remove the influence of specific data or concepts from trained models while
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
On Randomness in Agentic Evals
arXiv:2602.07150v3 Announce Type: replace-cross Abstract: Agentic systems are evaluated on benchmarks where agents interact with environments to solve tasks. Mo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering
arXiv:2602.07906v4 Announce Type: replace-cross Abstract: Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimiza
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
arXiv:2602.16485v2 Announce Type: replace-cross Abstract: Existing Multi-Agent Systems (MAS) typically rely on homogeneous model configurations, failing to expl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Smooth Gate Functions for Soft Advantage Policy Optimization
arXiv:2602.19345v2 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) has significantly advanced the training of large language mo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies
arXiv:2602.23811v3 Announce Type: replace-cross Abstract: We investigate the theoretical aspects of offline reinforcement learning (RL) under general function a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings
arXiv:2603.09643v3 Announce Type: replace-cross Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Exploring Collatz Dynamics with Human-LLM Collaboration
arXiv:2603.11066v3 Announce Type: replace-cross Abstract: We develop a structural and quantitative framework for analyzing the Collatz map through modular dynam
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies
arXiv:2603.12510v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models have significant potential to enable general-purpose robotic syste
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents
arXiv:2603.12564v3 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly serve as multi-turn advisors in high-stakes domains, yet their
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Geometry-Guided Camera Motion Understanding in VideoLLMs
arXiv:2603.13119v2 Announce Type: replace-cross Abstract: Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning
arXiv:2603.14867v2 Announce Type: replace-cross Abstract: Many strategic decision-making problems, such as environment design for warehouse robots, can be natur
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models
arXiv:2603.15970v3 Announce Type: replace-cross Abstract: Several data warehouse and database providers have recently introduced extensions to SQL called AI Que
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
arXiv:2603.17872v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Evolutionarily Stable Stackelberg Equilibrium
arXiv:2603.18385v2 Announce Type: replace-cross Abstract: We present a new solution concept called evolutionarily stable Stackelberg equilibrium (SESS). We stud
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models
arXiv:2603.20957v2 Announce Type: replace-cross Abstract: Frontier LLM companies have repeatedly assured courts and regulators that their models do not store co
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection
arXiv:2603.21576v2 Announce Type: replace-cross Abstract: Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of sca
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection
arXiv:2603.21853v2 Announce Type: replace-cross Abstract: This paper proposes a novel alternative to existing sim-to-real methods for training control policies
Qwen3.5-9b-uncensored-hauhaucs-Aggressive Model: A Beginner's Guide to Get You Started
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Qwen3.5-9b-uncensored-hauhaucs-Aggressive Model: A Beginner's Guide to Get You Started
Qwen3.5-9B-Uncensored-HauhauCS-Aggressive is an uncensored variant of the base model created by Hauhau CS. This 9-billion parameter model removes safety filters
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Unlocking video insights at scale with Amazon Bedrock multimodal models
In this post, we explore how the multimodal foundation models (FMs) of Amazon Bedrock enable scalable video understanding through three distinct architectural a
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Melania Trump wants a robot to homeschool your child
The first lady sees AI and robotics playing a prominent role in the future of American education.
AWS Machine Learning 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Deploy voice agents with Pipecat and Amazon Bedrock AgentCore Runtime – Part 1
In this series of posts, you will learn how streaming architectures help address these challenges using Pipecat voice agents on Amazon Bedrock AgentCore Runtime
There’s Something Very Dark About a Lot of Those Viral AI Fruit Videos
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
There’s Something Very Dark About a Lot of Those Viral AI Fruit Videos
With female AI fruit being fart-shamed and even sexually assaulted, there’s a misogynistic undercurrent to the fruit slop microdramas, even as they appear to be
OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage
Wired AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage
In a controlled experiment, OpenClaw agents proved prone to panic and vulnerable to manipulation. They even disabled their own functionality when gaslit by huma
The Verge 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Spotify is letting artists manually approve releases to combat AI fakes
Spotify is beta-testing a new feature called Artist Profile Protection that lets artists review releases before they go live. Sometimes songs end up on the wron
Protecting people from harmful manipulation
DeepMind Blog 🧠 Large Language Models ⚡ AI Lesson 1mo ago
Protecting people from harmful manipulation
Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.
London’s Granola raises $125M to turn meeting recordings into enterprise AI infrastructure
The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 1mo ago
London’s Granola raises $125M to turn meeting recordings into enterprise AI infrastructure
Granola, the London-based AI meeting app that records conversations without dropping a bot into the call, has raised $125 million in a Series C round led by Dan