Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,034 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control
arXiv:2604.05663v1 Announce Type: new Abstract: Traffic signal control (TSC) is a core component of intelligent transportation systems (ITS), aiming to reduce c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LUDOBENCH: Evaluating LLM Behavioural Decision-Making Through Spot-Based Board Game Scenarios in Ludo
arXiv:2604.05681v1 Announce Type: new Abstract: We introduce LudoBench, a benchmark for evaluating LLM strategic reasoning in Ludo, a stochastic multi-agent boa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
QA-MoE: Towards a Continuous Reliability Spectrum with Quality-Aware Mixture of Experts for Robust Multimodal Sentiment Analysis
arXiv:2604.05704v1 Announce Type: new Abstract: Multimodal Sentiment Analysis (MSA) aims to infer human sentiment from textual, acoustic, and visual signals. In
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Can Large Language Models Reinvent Foundational Algorithms?
arXiv:2604.05716v1 Announce Type: new Abstract: LLMs have shown strong potential to advance scientific discovery. Whether they possess the capacity for foundati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Emergent social transmission of model-based representations without inference
arXiv:2604.05777v1 Announce Type: new Abstract: How do people acquire rich, flexible knowledge about their environment from others despite limited cognitive cap
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents
arXiv:2604.05808v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated strong capabilities in complex interactive decision-making t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring
arXiv:2604.05854v1 Announce Type: new Abstract: We present \textbf{Deep Researcher Agent}, an open-source framework that enables large language model (LLM) agen
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
When Do We Need LLMs? A Diagnostic for Language-Driven Bandits
arXiv:2604.05859v1 Announce Type: new Abstract: We study Contextual Multi-Armed Bandits (CMABs) for non-episodic sequential decision making problems where the c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
JTON: A Token-Efficient JSON Superset with Zen Grid Tabular Encoding for Large Language Models
arXiv:2604.05865v1 Announce Type: new Abstract: When LLMs process structured data, the serialization format directly affects cost and context utilization. Stand
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Joint Knowledge Base Completion and Question Answering by Combining Large Language Models and Small Language Models
arXiv:2604.05875v1 Announce Type: new Abstract: Knowledge Bases (KBs) play a key role in various applications. As two representative KB-related tasks, knowledge
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference
arXiv:2604.05887v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have advanced unified reasoning over text, images, and videos, but thei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Context-Value-Action Architecture for Value-Driven Large Language Model Agents
arXiv:2604.05939v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown promise in simulating human behavior, yet existing agents often exhibit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning
arXiv:2604.05943v1 Announce Type: new Abstract: Recent advances in multi-agent reinforcement learning (MARL) have demonstrated success in numerous challenging d
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Towards Trustworthy Report Generation: A Deep Research Agent with Progressive Confidence Estimation and Calibration
arXiv:2604.05952v1 Announce Type: new Abstract: As agent-based systems continue to evolve, deep research agents are capable of automatically generating research
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment
arXiv:2604.05965v1 Announce Type: new Abstract: Transcending the single-preference paradigm, aligning LLMs with diverse human values is pivotal for robust deplo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Epistemic Blinding: An Inference-Time Protocol for Auditing Prior Contamination in LLM-Assisted Analysis
arXiv:2604.06013v1 Announce Type: new Abstract: This paper presents epistemic blinding in the context of an agentic system that uses large language models to re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
How LLMs Follow Instructions: Skillful Coordination, Not a Universal Mechanism
arXiv:2604.06015v1 Announce Type: new Abstract: Instruction tuning is commonly assumed to endow language models with a domain-general ability to follow instruct
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems
arXiv:2604.04936v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems critically depend on effective document chunking strategies to ba
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TDA-RC: Task-Driven Alignment for Knowledge-Based Reasoning Chains in Large Language Models
arXiv:2604.04942v1 Announce Type: cross Abstract: Enhancing the reasoning capability of large language models (LLMs) remains a core challenge in natural languag
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Illusion of Latent Generalization: Bi-directionality and the Reversal Curse
arXiv:2604.04943v1 Announce Type: cross Abstract: The reversal curse describes a failure of autoregressive language models to retrieve a fact in reverse order (
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Inclusion-of-Thoughts: Mitigating Preference Instability via Purifying the Decision Space
arXiv:2604.04944v1 Announce Type: cross Abstract: Multiple-choice questions (MCQs) are widely used to evaluate large language models (LLMs). However, LLMs remai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs
arXiv:2604.04947v1 Announce Type: cross Abstract: With the rapid proliferation of online sports journalism, extracting meaningful pre-game and post-game insight
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Learning to Retrieve from Agent Trajectories
arXiv:2604.04949v1 Announce Type: cross Abstract: Information retrieval (IR) systems have traditionally been designed and trained for human users, with learning
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Generative AI for Video Trailer Synthesis: From Extractive Heuristics to Autoregressive Creativity
arXiv:2604.04953v1 Announce Type: cross Abstract: The domain of automatic video trailer generation is currently undergoing a profound paradigm shift, transition
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown
arXiv:2604.04956v1 Announce Type: cross Abstract: The recent, super-exponential scaling of autonomous Large Language Model (LLM) agents signals a broader, funda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Self-Supervised Foundation Model for Calcium-imaging Population Dynamics
arXiv:2604.04958v1 Announce Type: cross Abstract: Recent work suggests that large-scale, multi-animal modeling can significantly improve neural recording analys
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MG$^2$-RAG: Multi-Granularity Graph for Multimodal Retrieval-Augmented Generation
arXiv:2604.04969v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) mitigates hallucinations in Multimodal Large Language Models (MLLMs), yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode
arXiv:2604.04978v1 Announce Type: cross Abstract: Claude Code's auto mode is the first deployed permission system for AI coding agents, using a two-stage transc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CURE:Circuit-Aware Unlearning for LLM-based Recommendation
arXiv:2604.04982v1 Announce Type: cross Abstract: Recent advances in large language models (LLMs) have opened new opportunities for recommender systems by enabl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling
arXiv:2604.04987v1 Announce Type: cross Abstract: Speculative sampling (SpS) has been successful in accelerating the decoding throughput of auto-regressive larg
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
FreakOut-LLM: The Effect of Emotional Stimuli on Safety Alignment
arXiv:2604.04992v1 Announce Type: cross Abstract: Safety-aligned LLMs go through refusal training to reject harmful requests, but whether these mechanisms remai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Evaluation of Embedding-Based and Generative Methods for LLM-Driven Document Classification: Opportunities and Challenges
arXiv:2604.04997v1 Announce Type: cross Abstract: This work presents a comparative analysis of embedding-based and generative models for classifying geoscience
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EduIllustrate: Towards Scalable Automated Generation Of Multimodal Educational Content
arXiv:2604.05005v1 Announce Type: cross Abstract: Large language models are increasingly used as educational assistants, yet evaluation of their educational cap
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction
arXiv:2604.05007v1 Announce Type: cross Abstract: In Audio-Visual Navigation (AVN), agents must locate sound sources in unseen 3D environments using visual and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Comparative Characterization of KV Cache Management Strategies for LLM Inference
arXiv:2604.05012v1 Announce Type: cross Abstract: Efficient inference with Large Language Models (LLMs) increasingly relies on Key-Value (KV) caches to store pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scaling Coding Agents via Atomic Skills
arXiv:2604.05013v1 Announce Type: cross Abstract: Current LLM coding agents are predominantly trained on composite benchmarks (e.g., bug fixing), which often le
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
arXiv:2604.05014v1 Announce Type: cross Abstract: Building generalist embodied agents requires integrating perception, language understanding, and action, which
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Phase-Associative Memory: Sequence Modeling in Complex Hilbert Space
arXiv:2604.05030v1 Announce Type: cross Abstract: We present Phase-Associative Memory (PAM), a recurrent sequence model in which all representations are complex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
This Treatment Works, Right? Evaluating LLM Sensitivity to Patient Question Framing in Medical QA
arXiv:2604.05051v1 Announce Type: cross Abstract: Patients are increasingly turning to large language models (LLMs) with medical questions that are complex and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Feature-Aware Anisotropic Local Differential Privacy for Utility-Preserving Graph Representation Learning in Metal Additive Manufacturing
arXiv:2604.05077v1 Announce Type: cross Abstract: Metal additive manufacturing (AM) enables the fabrication of safety-critical components, but reliable quality
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation
arXiv:2604.05083v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly adopted as automated judges for evaluating generated text,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks
arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner
arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Watch Before You Answer: Learning from Visually Grounded Post-Training
arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Offline RL for Adaptive Policy Retrieval in Prior Authorization
arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning
arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback
arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation
arXiv:2604.05150v1 Announce Type: cross Abstract: We study compiled AI, a paradigm in which large language models generate executable code artifacts during a co
DeepCamp AI