Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,061 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs
arXiv:2604.04947v1 Announce Type: cross Abstract: With the rapid proliferation of online sports journalism, extracting meaningful pre-game and post-game insight
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Learning to Retrieve from Agent Trajectories
arXiv:2604.04949v1 Announce Type: cross Abstract: Information retrieval (IR) systems have traditionally been designed and trained for human users, with learning
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity
arXiv:2604.04953v1 Announce Type: cross Abstract: The domain of automatic video trailer generation is currently undergoing a profound paradigm shift, transition
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown
arXiv:2604.04956v1 Announce Type: cross Abstract: The recent, super-exponential scaling of autonomous Large Language Model (LLM) agents signals a broader, funda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Self-Supervised Foundation Model for Calcium-imaging Population Dynamics
arXiv:2604.04958v1 Announce Type: cross Abstract: Recent work suggests that large-scale, multi-animal modeling can significantly improve neural recording analys
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MG$^2$-RAG: Multi-Granularity Graph for Multimodal Retrieval-Augmented Generation
arXiv:2604.04969v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) mitigates hallucinations in Multimodal Large Language Models (MLLMs), yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode
arXiv:2604.04978v1 Announce Type: cross Abstract: Claude Code's auto mode is the first deployed permission system for AI coding agents, using a two-stage transc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CURE:Circuit-Aware Unlearning for LLM-based Recommendation
arXiv:2604.04982v1 Announce Type: cross Abstract: Recent advances in large language models (LLMs) have opened new opportunities for recommender systems by enabl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling
arXiv:2604.04987v1 Announce Type: cross Abstract: Speculative sampling (SpS) has been successful in accelerating the decoding throughput of auto-regressive larg
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
FreakOut-LLM: The Effect of Emotional Stimuli on Safety Alignment
arXiv:2604.04992v1 Announce Type: cross Abstract: Safety-aligned LLMs go through refusal training to reject harmful requests, but whether these mechanisms remai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Evaluation of Embedding-Based and Generative Methods for LLM-Driven Document Classification: Opportunities and Challenges
arXiv:2604.04997v1 Announce Type: cross Abstract: This work presents a comparative analysis of embedding-based and generative models for classifying geoscience
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EduIllustrate: Towards Scalable Automated Generation Of Multimodal Educational Content
arXiv:2604.05005v1 Announce Type: cross Abstract: Large language models are increasingly used as educational assistants, yet evaluation of their educational cap
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction
arXiv:2604.05007v1 Announce Type: cross Abstract: In Audio-Visual Navigation (AVN), agents must locate sound sources in unseen 3D environments using visual and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Comparative Characterization of KV Cache Management Strategies for LLM Inference
arXiv:2604.05012v1 Announce Type: cross Abstract: Efficient inference with Large Language Models (LLMs) increasingly relies on Key-Value (KV) caches to store pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scaling Coding Agents via Atomic Skills
arXiv:2604.05013v1 Announce Type: cross Abstract: Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often le
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
arXiv:2604.05014v1 Announce Type: cross Abstract: Building generalist embodied agents requires integrating perception, language understanding, and action, which
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space
arXiv:2604.05030v1 Announce Type: cross Abstract: We present Phase-Associative Memory (PAM), a recurrent sequence model in which all representations are complex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA
arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing
arXiv:2604.05077v1 Announce Type: cross Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation
arXiv:2604.05083v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks
arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner
arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Watch Before You Answer: Learning from Visually Grounded Post-Training
arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Offline RL for Adaptive Policy Retrieval in Prior Authorization
arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning
arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback
arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation
arXiv:2604.05150v1 Announce Type: cross Abstract: We study compiled AI, a paradigm in which large language models generate executable code artifacts during a co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Planning to Explore: Curiosity-Driven Planning for LLM Test Generation
arXiv:2604.05159v1 Announce Type: cross Abstract: The use of LLMs for code generation has naturally extended to code testing and evaluation. As codebases grow i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning
arXiv:2604.05164v1 Announce Type: cross Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overt
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Use to Oversight: How Mental Models Influence User Behavior and Output in AI Writing Assistants
arXiv:2604.05166v1 Announce Type: cross Abstract: AI-based writing assistants are ubiquitous, yet little is known about how users' mental models shape their use
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
arXiv:2604.05183v1 Announce Type: cross Abstract: In a rapidly growing field of model training there is a constant practical interest in parameter-efficient fin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Improving Clinical Trial Recruitment using Clinical Narratives and Large Language Models
arXiv:2604.05190v1 Announce Type: cross Abstract: Screening patients for enrollment is a well-known, labor-intensive bottleneck that leads to under-enrollment a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts
arXiv:2604.05242v1 Announce Type: cross Abstract: Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into La
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning
arXiv:2604.05243v1 Announce Type: cross Abstract: Background: Children do not simply learn that balls are round and blocks are square. They learn that shape is
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation
arXiv:2604.05257v1 Announce Type: cross Abstract: Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking
arXiv:2604.05268v1 Announce Type: cross Abstract: Multi-modal retrieval-augmented generation (MM-RAG) relies heavily on re-rankers to surface the most relevant
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code
arXiv:2604.05292v1 Announce Type: cross Abstract: AI coding assistants are now used to generate production code in security-sensitive domains, yet the exploitab
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLMs Should Express Uncertainty Explicitly
arXiv:2604.05306v1 Announce Type: cross Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DQA: Diagnostic Question Answering for IT Support
arXiv:2604.05350v1 Announce Type: cross Abstract: Enterprise IT support interactions are fundamentally diagnostic: effective resolution requires iterative evide
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
OGA-AID: Clinician-in-the-loop AI Report Drafting Assistant for Multimodal Observational Gait Analysis in Post-Stroke Rehabilitation
arXiv:2604.05360v1 Announce Type: cross Abstract: Gait analysis is essential in post-stroke rehabilitation but remains time-intensive and cognitively demanding,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
VideoStir: Understanding Long Videos via Spatio-Temporally Structured and Intent-Aware RAG
arXiv:2604.05418v1 Announce Type: cross Abstract: Scaling multimodal large language models (MLLMs) to long videos is constrained by limited context windows. Whi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads
arXiv:2604.05426v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use
arXiv:2604.05432v1 Announce Type: cross Abstract: Tool-use large language model (LLM) agents are increasingly deployed to support sensitive workflows, relying o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling
arXiv:2604.05445v1 Announce Type: cross Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM Evaluation as Tensor Completion: Low Rank Structure and Semiparametric Efficiency
arXiv:2604.05460v1 Announce Type: cross Abstract: Large language model (LLM) evaluation platforms increasingly rely on pairwise human judgments. These data are
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
On the Role of Fault Localization Context for LLM-Based Program Repair
arXiv:2604.05481v1 Announce Type: cross Abstract: Fault Localization (FL) is a key component of Large Language Model (LLM)-based Automated Program Repair (APR),
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis
arXiv:2604.05482v1 Announce Type: cross Abstract: Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system
arXiv:2604.05536v1 Announce Type: cross Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as
DeepCamp AI