Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,450

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,389 Reads 5,061

Showing 5,061 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs

arXiv:2604.04947v1 Announce Type: cross Abstract: With the rapid proliferation of online sports journalism, extracting meaningful pre-game and post-game insight

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Learning to Retrieve from Agent Trajectories

arXiv:2604.04949v1 Announce Type: cross Abstract: Information retrieval (IR) systems have traditionally been designed and trained for human users, with learning

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity

arXiv:2604.04953v1 Announce Type: cross Abstract: The domain of automatic video trailer generation is currently undergoing a profound paradigm shift, transition

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown

arXiv:2604.04956v1 Announce Type: cross Abstract: The recent, super-exponential scaling of autonomous Large Language Model (LLM) agents signals a broader, funda

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Self-Supervised Foundation Model for Calcium-imaging Population Dynamics

arXiv:2604.04958v1 Announce Type: cross Abstract: Recent work suggests that large-scale, multi-animal modeling can significantly improve neural recording analys

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MG$^2$-RAG: Multi-Granularity Graph for Multimodal Retrieval-Augmented Generation

arXiv:2604.04969v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) mitigates hallucinations in Multimodal Large Language Models (MLLMs), yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode

arXiv:2604.04978v1 Announce Type: cross Abstract: Claude Code's auto mode is the first deployed permission system for AI coding agents, using a two-stage transc

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CURE:Circuit-Aware Unlearning for LLM-based Recommendation

arXiv:2604.04982v1 Announce Type: cross Abstract: Recent advances in large language models (LLMs) have opened new opportunities for recommender systems by enabl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling

arXiv:2604.04987v1 Announce Type: cross Abstract: Speculative sampling (SpS) has been successful in accelerating the decoding throughput of auto-regressive larg

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

FreakOut-LLM: The Effect of Emotional Stimuli on Safety Alignment

arXiv:2604.04992v1 Announce Type: cross Abstract: Safety-aligned LLMs go through refusal training to reject harmful requests, but whether these mechanisms remai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Evaluation of Embedding-Based and Generative Methods for LLM-Driven Document Classification: Opportunities and Challenges

arXiv:2604.04997v1 Announce Type: cross Abstract: This work presents a comparative analysis of embedding-based and generative models for classifying geoscience

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

EduIllustrate: Towards Scalable Automated Generation Of Multimodal Educational Content

arXiv:2604.05005v1 Announce Type: cross Abstract: Large language models are increasingly used as educational assistants, yet evaluation of their educational cap

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction

arXiv:2604.05007v1 Announce Type: cross Abstract: In Audio-Visual Navigation (AVN), agents must locate sound sources in unseen 3D environments using visual and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Comparative Characterization of KV Cache Management Strategies for LLM Inference

arXiv:2604.05012v1 Announce Type: cross Abstract: Efficient inference with Large Language Models (LLMs) increasingly relies on Key-Value (KV) caches to store pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scaling Coding Agents via Atomic Skills

arXiv:2604.05013v1 Announce Type: cross Abstract: Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often le

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

arXiv:2604.05014v1 Announce Type: cross Abstract: Building generalist embodied agents requires integrating perception, language understanding, and action, which

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space

arXiv:2604.05030v1 Announce Type: cross Abstract: We present Phase-Associative Memory (PAM), a recurrent sequence model in which all representations are complex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA

arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing

arXiv:2604.05077v1 Announce Type: cross Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

arXiv:2604.05083v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks

arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner

arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Offline RL for Adaptive Policy Retrieval in Prior Authorization

arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback

arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation

arXiv:2604.05150v1 Announce Type: cross Abstract: We study compiled AI, a paradigm in which large language models generate executable code artifacts during a co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Planning to Explore: Curiosity-Driven Planning for LLM Test Generation

arXiv:2604.05159v1 Announce Type: cross Abstract: The use of LLMs for code generation has naturally extended to code testing and evaluation. As codebases grow i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

arXiv:2604.05164v1 Announce Type: cross Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overt

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Use to Oversight: How Mental Models Influence User Behavior and Output in AI Writing Assistants

arXiv:2604.05166v1 Announce Type: cross Abstract: AI-based writing assistants are ubiquitous, yet little is known about how users' mental models shape their use

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models

arXiv:2604.05183v1 Announce Type: cross Abstract: In a rapidly growing field of model training there is a constant practical interest in parameter-efficient fin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Improving Clinical Trial Recruitment using Clinical Narratives and Large Language Models

arXiv:2604.05190v1 Announce Type: cross Abstract: Screening patients for enrollment is a well-known, labor-intensive bottleneck that leads to under-enrollment a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

arXiv:2604.05242v1 Announce Type: cross Abstract: Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into La

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning

arXiv:2604.05243v1 Announce Type: cross Abstract: Background: Children do not simply learn that balls are round and blocks are square. They learn that shape is

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation

arXiv:2604.05257v1 Announce Type: cross Abstract: Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking

arXiv:2604.05268v1 Announce Type: cross Abstract: Multi-modal retrieval-augmented generation (MM-RAG) relies heavily on re-rankers to surface the most relevant

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code

arXiv:2604.05292v1 Announce Type: cross Abstract: AI coding assistants are now used to generate production code in security-sensitive domains, yet the exploitab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLMs Should Express Uncertainty Explicitly

arXiv:2604.05306v1 Announce Type: cross Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

DQA: Diagnostic Question Answering for IT Support

arXiv:2604.05350v1 Announce Type: cross Abstract: Enterprise IT support interactions are fundamentally diagnostic: effective resolution requires iterative evide

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

OGA-AID: Clinician-in-the-loop AI Report Drafting Assistant for Multimodal Observational Gait Analysis in Post-Stroke Rehabilitation

arXiv:2604.05360v1 Announce Type: cross Abstract: Gait analysis is essential in post-stroke rehabilitation but remains time-intensive and cognitively demanding,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

VideoStir: Understanding Long Videos via Spatio-Temporally Structured and Intent-Aware RAG

arXiv:2604.05418v1 Announce Type: cross Abstract: Scaling multimodal large language models (MLLMs) to long videos is constrained by limited context windows. Whi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use

arXiv:2604.05432v1 Announce Type: cross Abstract: Tool-use large language model (LLM) agents are increasingly deployed to support sensitive workflows, relying o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

arXiv:2604.05445v1 Announce Type: cross Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLM Evaluation as Tensor Completion: Low Rank Structure and Semiparametric Efficiency

arXiv:2604.05460v1 Announce Type: cross Abstract: Large language model (LLM) evaluation platforms increasingly rely on pairwise human judgments. These data are

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

On the Role of Fault Localization Context for LLM-Based Program Repair

arXiv:2604.05481v1 Announce Type: cross Abstract: Fault Localization (FL) is a key component of Large Language Model (LLM)-based Automated Program Repair (APR),

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis

arXiv:2604.05482v1 Announce Type: cross Abstract: Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system

arXiv:2604.05536v1 Announce Type: cross Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as