Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,265 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hierarchical Semantic Correlation-Aware Masked Autoencoder for Unsupervised Audio-Visual Representation Learning
arXiv:2604.04229v1 Announce Type: cross Abstract: Learning aligned multimodal embeddings from weakly paired, label-free corpora is challenging: pipelines often
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Three Phases of Expert Routing: How Load Balance Evolves During Mixture-of-Experts Training
arXiv:2604.04230v1 Announce Type: cross Abstract: We model Mixture-of-Experts (MoE) token routing as a congestion game with a single effective parameter, the co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
APPA: Adaptive Preference Pluralistic Alignment for Fair Federated RLHF of LLMs
arXiv:2604.04261v1 Announce Type: cross Abstract: Aligning large language models (LLMs) with diverse human preferences requires pluralistic alignment, where a s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Commercial Persuasion in AI-Mediated Conversations
arXiv:2604.04263v1 Announce Type: cross Abstract: As Large Language Models (LLMs) become a primary interface between users and the web, companies face growing e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
arXiv:2604.04289v1 Announce Type: cross Abstract: When an LLM deobfuscates JavaScript, can poisoned identifier names in the string table survive into the model'
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
HighFM: Towards a Foundation Model for Learning Representations from High-Frequency Earth Observation Data
arXiv:2604.04306v1 Announce Type: cross Abstract: The increasing frequency and severity of climate related disasters have intensified the need for real time mon
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Effects of Generative AI Errors on User Reliance Across Task Difficulty
arXiv:2604.04319v1 Announce Type: cross Abstract: The capabilities of artificial intelligence (AI) lie along a jagged frontier, where AI systems surprisingly fa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering
arXiv:2604.04359v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) systems have been widely adopted in contemporary large language models (L
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Compressible Softmax-Attended Language under Incompressible Attention
arXiv:2604.04384v1 Announce Type: cross Abstract: Across every attention head in five transformer language models (124M--7B parameters, four architecture famili
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
How Alignment Routes: Localizing, Scaling, and Controlling Policy Circuits in Language Models
arXiv:2604.04385v1 Announce Type: cross Abstract: We identify a recurring sparse routing mechanism in alignment-trained language models: a gate attention head r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment
arXiv:2604.04410v1 Announce Type: cross Abstract: Aligning language models with human preferences is essential for ensuring their safety and reliability. Althou
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding
arXiv:2604.04411v1 Announce Type: cross Abstract: Visual document understanding (VDU) is a challenging task for large vision language models (LVLMs), requiring
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality
arXiv:2604.04418v1 Announce Type: cross Abstract: As LLMs are deployed in high-stakes settings, users must judge the correctness of individual responses, often
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Is Prompt Selection Necessary for Task-Free Online Continual Learning?
arXiv:2604.04420v1 Announce Type: cross Abstract: Task-free online continual learning has recently emerged as a realistic paradigm for addressing continual lear
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Training Transformers in Cosine Coefficient Space
arXiv:2604.04440v1 Announce Type: cross Abstract: We parameterize the weight matrices of a transformer in the two-dimensional discrete cosine transform (DCT) do
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Conversational Control with Ontologies for Large Language Models: A Lightweight Framework for Constrained Generation
arXiv:2604.04450v1 Announce Type: cross Abstract: Conversational agents based on Large Language Models (LLMs) have recently emerged as powerful tools for human-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DP-OPD: Differentially Private On-Policy Distillation for Language Models
arXiv:2604.04461v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly adapted to proprietary and domain-specific corpora that contain
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Discrete Prototypical Memories for Federated Time Series Foundation Models
arXiv:2604.04475v1 Announce Type: cross Abstract: Leveraging Large Language Models (LLMs) as federated learning (FL)-based time series foundation models offers
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models
arXiv:2604.04493v1 Announce Type: cross Abstract: The rapid growth of large language models (LLMs) presents significant deployment challenges due to their massi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
One Model for All: Multi-Objective Controllable Language Models
arXiv:2604.04497v1 Announce Type: cross Abstract: Aligning large language models (LLMs) with human preferences is critical for enhancing LLMs' safety, helpfulne
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
GAIN: Multiplicative Modulation for Domain Adaptation
arXiv:2604.04516v1 Announce Type: cross Abstract: Adapting LLMs to new domains causes forgetting because standard methods (full fine-tuning, LoRA) inject new di
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Multilingual Prompt Localization for Agent-as-a-Judge: Language and Backbone Sensitivity in Requirement-Level Evaluation
arXiv:2604.04532v1 Announce Type: cross Abstract: Evaluation language is typically treated as a fixed English default in agentic code benchmarks, yet we show th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Mapping the Exploitation Surface: A 10,000-Trial Taxonomy of What Makes LLM Agents Exploit Vulnerabilities
arXiv:2604.04561v1 Announce Type: cross Abstract: LLM agents with tool access can discover and exploit security vulnerabilities. This is known. What is not know
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Paper Espresso: From Paper Overload to Research Insight
arXiv:2604.04562v1 Announce Type: cross Abstract: The accelerating pace of scientific publishing makes it increasingly difficult for researchers to stay current
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning
arXiv:2604.04565v1 Announce Type: cross Abstract: Large Language Models (LLMs) have achieved strong performance in question answering and retrieval-augmented ge
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Ruling Out to Rule In: Contrastive Hypothesis Retrieval for Medical Question Answering
arXiv:2604.04593v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) grounds large language models in external medical knowledge, yet standard
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
An AI Teaching Assistant for Motion Picture Engineering
arXiv:2604.04670v1 Announce Type: cross Abstract: The rapid rise of LLMs over the last few years has promoted growing experimentation with LLM-driven AI tutors.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MUXQ: Mixed-to-Uniform Precision MatriX Quantization via Low-Rank Outlier Decomposition
arXiv:2604.04701v1 Announce Type: cross Abstract: Large language models (LLMs) have achieved outstanding performance across a wide range of natural language pro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
BiST: A Gold Standard Bangla-English Bilingual Corpus for Sentence Structure and Tense Classification with Inter-Annotator Agreement
arXiv:2604.04708v1 Announce Type: cross Abstract: High-quality bilingual resources remain a critical bottleneck for advancing multilingual NLP in low-resource s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features
arXiv:2604.04720v1 Announce Type: cross Abstract: Large Reasoning Models (LRMs) still exhibit large performance gaps between English and other languages, yet mu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Individual and Combined Effects of English as a Second Language and Typos on LLM Performance
arXiv:2604.04723v1 Announce Type: cross Abstract: Large language models (LLMs) are used globally, and because much of their training data is in English, they ty
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs
arXiv:2604.04732v1 Announce Type: cross Abstract: Large language models (LLMs) are often described as multilingual because they can understand and respond in ma
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Discovering Failure Modes in Vision-Language Models using RL
arXiv:2604.04733v1 Announce Type: cross Abstract: Vision-language Models (VLMs), despite achieving strong performance on multimodal benchmarks, often misinterpr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hallucination Basins: A Dynamic Framework for Understanding and Controlling LLM Hallucinations
arXiv:2604.04743v1 Announce Type: cross Abstract: Large language models (LLMs) hallucinate: they produce fluent outputs that are factually incorrect. We present
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems
arXiv:2604.04767v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has improved the reasoning abilities of LLMs, yet a fund
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
SkillX: Automatically Constructing Skill Knowledge Bases for Agents
arXiv:2604.04804v1 Announce Type: cross Abstract: Learning from experience is critical for building capable large language model (LLM) agents, yet prevailing se
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection
arXiv:2604.04815v1 Announce Type: cross Abstract: The rapid development of Large Language Models (LLMs) has transformed fake news detection and fact-checking ta
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Plausibility as Commonsense Reasoning: Humans Succeed, Large Language Models Do not
arXiv:2604.04825v1 Announce Type: cross Abstract: Large language models achieve strong performance on many language tasks, yet it remains unclear whether they i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
InfBaGel: Human-Object-Scene Interaction Generation with Dynamic Perception and Iterative Refinement
arXiv:2604.04843v1 Announce Type: cross Abstract: Human-object-scene interactions (HOSI) generation has broad applications in embodied AI, simulation, and anima
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Strengthening Human-Centric Chain-of-Thought Reasoning Integrity in LLMs via a Structured Prompt Framework
arXiv:2604.04852v1 Announce Type: cross Abstract: Chain-of-Thought (CoT) prompting has been used to enhance the reasoning capability of LLMs. However, its relia
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms
arXiv:2604.04868v1 Announce Type: cross Abstract: Tabular foundation models (TFMs) such as TabPFN (Tabular Prior-Data Fitted Network) are designed to generalize
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Rethinking Exploration in RLVR: From Entropy Regularization to Refinement via Bidirectional Entropy Modulation
arXiv:2604.04894v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Agentic Federated Learning: The Future of Distributed Training Orchestration
arXiv:2604.04895v1 Announce Type: cross Abstract: Although Federated Learning (FL) promises privacy and distributed collaboration, its effectiveness in real-wor
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Vero: An Open RL Recipe for General Visual Reasoning
arXiv:2604.04917v1 Announce Type: cross Abstract: What does it take to build a visual reasoner that works across charts, science, spatial understanding, and ope
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Your Pre-trained Diffusion Model Secretly Knows Restoration
arXiv:2604.04924v1 Announce Type: cross Abstract: Pre-trained diffusion models have enabled significant advancements in All-in-One Restoration (AiOR), offering
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Early Stopping for Large Reasoning Models via Confidence Dynamics
arXiv:2604.04930v1 Announce Type: cross Abstract: Large reasoning models rely on long chain-of-thought generation to solve complex problems, but extended reason
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning
arXiv:2302.00797v4 Announce Type: replace Abstract: Opponent modeling methods typically involve two crucial steps: building a belief distribution over opponents
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Barriers to Complexity-Theoretic Proofs that "AGI" Using Machine Learning is Impossible
arXiv:2411.06498v2 Announce Type: replace Abstract: A recent paper (van Rooij et al. 2024) claims to have proved that achieving human-like intelligence using le
DeepCamp AI