Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,483

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,394 Reads 5,089

Showing 5,089 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks

arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner

arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Offline RL for Adaptive Policy Retrieval in Prior Authorization

arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning

arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback

arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation

arXiv:2604.05150v1 Announce Type: cross Abstract: We study compiled AI, a paradigm in which large language models generate executable code artifacts during a co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Planning to Explore: Curiosity-Driven Planning for LLM Test Generation

arXiv:2604.05159v1 Announce Type: cross Abstract: The use of LLMs for code generation has naturally extended to code testing and evaluation. As codebases grow i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

arXiv:2604.05164v1 Announce Type: cross Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overt

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Use to Oversight: How Mental Models Influence User Behavior and Output in AI Writing Assistants

arXiv:2604.05166v1 Announce Type: cross Abstract: AI-based writing assistants are ubiquitous, yet little is known about how users' mental models shape their use

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models

arXiv:2604.05183v1 Announce Type: cross Abstract: In a rapidly growing field of model training there is a constant practical interest in parameter-efficient fin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Improving Clinical Trial Recruitment using Clinical Narratives and Large Language Models

arXiv:2604.05190v1 Announce Type: cross Abstract: Screening patients for enrollment is a well-known, labor-intensive bottleneck that leads to under-enrollment a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts

arXiv:2604.05242v1 Announce Type: cross Abstract: Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into La

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning

arXiv:2604.05243v1 Announce Type: cross Abstract: Background: Children do not simply learn that balls are round and blocks are square. They learn that shape is

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation

arXiv:2604.05257v1 Announce Type: cross Abstract: Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking

arXiv:2604.05268v1 Announce Type: cross Abstract: Multi-modal retrieval-augmented generation (MM-RAG) relies heavily on re-rankers to surface the most relevant

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code

arXiv:2604.05292v1 Announce Type: cross Abstract: AI coding assistants are now used to generate production code in security-sensitive domains, yet the exploitab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLMs Should Express Uncertainty Explicitly

arXiv:2604.05306v1 Announce Type: cross Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

DQA: Diagnostic Question Answering for IT Support

arXiv:2604.05350v1 Announce Type: cross Abstract: Enterprise IT support interactions are fundamentally diagnostic: effective resolution requires iterative evide

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

OGA-AID: Clinician-in-the-loop AI Report Drafting Assistant for Multimodal Observational Gait Analysis in Post-Stroke Rehabilitation

arXiv:2604.05360v1 Announce Type: cross Abstract: Gait analysis is essential in post-stroke rehabilitation but remains time-intensive and cognitively demanding,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

VideoStir: Understanding Long Videos via Spatio-Temporally Structured and Intent-Aware RAG

arXiv:2604.05418v1 Announce Type: cross Abstract: Scaling multimodal large language models (MLLMs) to long videos is constrained by limited context windows. Whi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

arXiv:2604.05426v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use

arXiv:2604.05432v1 Announce Type: cross Abstract: Tool-use large language model (LLM) agents are increasingly deployed to support sensitive workflows, relying o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling

arXiv:2604.05445v1 Announce Type: cross Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLM Evaluation as Tensor Completion: Low Rank Structure and Semiparametric Efficiency

arXiv:2604.05460v1 Announce Type: cross Abstract: Large language model (LLM) evaluation platforms increasingly rely on pairwise human judgments. These data are

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

On the Role of Fault Localization Context for LLM-Based Program Repair

arXiv:2604.05481v1 Announce Type: cross Abstract: Fault Localization (FL) is a key component of Large Language Model (LLM)-based Automated Program Repair (APR),

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis

arXiv:2604.05482v1 Announce Type: cross Abstract: Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system

arXiv:2604.05536v1 Announce Type: cross Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version

arXiv:2604.05551v1 Announce Type: cross Abstract: Self-conditioning has been central to the success of continuous diffusion language models, as it allows models

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

arXiv:2604.05552v1 Announce Type: cross Abstract: Large Language Models demonstrate outstanding performance in many language tasks but still face fundamental ch

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering

arXiv:2604.05591v1 Announce Type: cross Abstract: This work introduces a modular platform that brings together six AI services, automatic speech recognition via

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Analogical Reasoning as a Doctor: A Foundation Model for Gastrointestinal Endoscopy Diagnosis

arXiv:2604.05649v1 Announce Type: cross Abstract: Gastrointestinal diseases impose a growing global health burden, and endoscopy is a primary tool for early dia

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Multiscale Physics-Informed Neural Network for Complex Fluid Flows with Long-Range Dependencies

arXiv:2604.05652v1 Announce Type: cross Abstract: Fluid flows are governed by the nonlinear Navier-Stokes equations, which can manifest multiscale dynamics even

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

arXiv:2604.05655v1 Announce Type: cross Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Rectified Schr\"odinger Bridge Matching for Few-Step Visual Navigation

arXiv:2604.05673v1 Announce Type: cross Abstract: Visual navigation is a core challenge in Embodied AI, requiring autonomous agents to translate high-dimensiona

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems

arXiv:2604.05674v1 Announce Type: cross Abstract: Cyber-physical systems often contend with incomplete architectural documentation or outdated information resul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

arXiv:2604.05688v1 Announce Type: cross Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing

arXiv:2604.05719v1 Announce Type: cross Abstract: The rapid advancement of Large Language Models (LLMs) has created new opportunities for Automated Penetration

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CAKE: Cloud Architecture Knowledge Evaluation of Large Language Models

arXiv:2604.05755v1 Announce Type: cross Abstract: In today's software architecture, large language models (LLMs) serve as software architecture co-pilots. Howev

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say "I Don't Know"

arXiv:2604.05779v1 Announce Type: cross Abstract: While large language models (LLMs) demonstrate strong capabilities across diverse user queries, they still suf

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding

arXiv:2604.05843v1 Announce Type: cross Abstract: Brain-computer interfaces (BCIs) enable direct communication between the brain and external devices, providing

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Neural Network Pruning via QUBO Optimization

arXiv:2604.05856v1 Announce Type: cross Abstract: Neural network pruning can be formulated as a combinatorial optimization problem, yet most existing approaches

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts

arXiv:2604.05872v1 Announce Type: cross Abstract: The deployment of large language models (LLMs) in Swiss financial and regulatory contexts demands empirical ev

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation

arXiv:2604.05906v1 Announce Type: cross Abstract: Numerous studies on text-to-image (T2I) generative models have utilized cross-attention maps to boost applicat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

"I See What You Did There": Can Large Vision-Language Models Understand Multimodal Puns?

arXiv:2604.05930v1 Announce Type: cross Abstract: Puns are a common form of rhetorical wordplay that exploits polysemy and phonetic similarity to create humor.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning

arXiv:2604.05931v1 Announce Type: cross Abstract: Zero-shot unsupervised reinforcement learning (URL) offers a promising direction for building generalist agent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Does Pass Rate Tell the Whole Story? Evaluating Design Constraint Compliance in LLM-based Issue Resolution

arXiv:2604.05955v1 Announce Type: cross Abstract: Repository-level issue resolution benchmarks have become a standard testbed for evaluating LLM-based agents, y

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models

arXiv:2604.05995v1 Announce Type: cross Abstract: Large Language Models (LLMs) internalize vast world knowledge as parametric memory, yet inevitably inherit the