Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,572

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,408 Reads 5,164

Showing 5,164 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

BioCOMPASS: Integrating Biomarkers into Transformer-Based Immunotherapy Response Prediction

arXiv:2604.00739v1 Announce Type: cross Abstract: Datasets used in immunotherapy response prediction are typically small in size, as well as diverse in cancer t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models

arXiv:2604.00757v1 Announce Type: cross Abstract: Large Vision Language Models show impressive performance across image and video understanding tasks, yet their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning

arXiv:2604.00770v1 Announce Type: cross Abstract: A new generation of language models reasons entirely in continuous hidden states, producing no tokens and leav

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer

arXiv:2604.00785v1 Announce Type: cross Abstract: Pretraining Large Language Models (LLMs) from scratch requires massive amount of compute. Aurora super compute

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Routing-Free Mixture-of-Experts

arXiv:2604.00801v1 Announce Type: cross Abstract: Standard Mixture-of-Experts (MoE) models rely on centralized routing mechanisms that introduce rigid inductive

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Emotion Entanglement and Bayesian Inference for Multi-Dimensional Emotion Understanding

arXiv:2604.00819v1 Announce Type: cross Abstract: Understanding emotions in natural language is inherently a multi-dimensional reasoning problem, where multiple

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

arXiv:2604.00830v1 Announce Type: cross Abstract: Test-Time Learning (TTL) enables language agents to iteratively refine their performance through repeated inte

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection

arXiv:2604.00878v1 Announce Type: cross Abstract: Actor-level stance detection aims to determine an author expressed position toward specific geopolitical actor

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

arXiv:2604.00886v1 Announce Type: cross Abstract: Document understanding and GUI interaction are among the highest-value applications of Vision-Language Models

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

WARP: Guaranteed Inner-Layer Repair of NLP Transformers

arXiv:2604.00938v1 Announce Type: cross Abstract: Transformer-based NLP models remain vulnerable to adversarial perturbations, yet existing repair methods face

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dual Optimal: Make Your LLM Peer-like with Dignity

arXiv:2604.00979v1 Announce Type: cross Abstract: Current aligned language models exhibit a dual failure mode we term the Evasive Servant: they sycophantically

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Query-Conditioned Evidential Keyframe Sampling for MLLM-Based Long-Form Video Understanding

arXiv:2604.01002v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have shown strong performance on video question answering, but their

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Fast and Accurate Probing of In-Training LLMs' Downstream Performances

arXiv:2604.01025v1 Announce Type: cross Abstract: The paradigm of scaling Large Language Models (LLMs) in both parameter size and test time has pushed the bound

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines

arXiv:2604.01029v1 Announce Type: cross Abstract: Multi-LLM revision pipelines, in which a second model reviews and improves a draft produced by a first, are wi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Automated Framework to Evaluate and Harden LLM System Instructions against Encoding Attacks

arXiv:2604.01039v1 Announce Type: cross Abstract: System Instructions in Large Language Models (LLMs) are commonly used to enforce safety policies, define agent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

TRACE: Training-Free Partial Audio Deepfake Detection via Embedding Trajectory Analysis of Speech Foundation Models

arXiv:2604.01083v1 Announce Type: cross Abstract: Partial audio deepfakes, where synthesized segments are spliced into genuine recordings, are particularly dece

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Temporal Dependencies in In-Context Learning: The Role of Induction Heads

arXiv:2604.01094v1 Announce Type: cross Abstract: Large language models (LLMs) exhibit strong in-context learning capabilities, but how they track and retrieve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning

arXiv:2604.01152v1 Announce Type: cross Abstract: We present Brainstacks, a modular architecture for continual multi-domain fine-tuning of large language models

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Online Reasoning Calibration: Test-Time Training Enables Generalizable Conformal LLM Reasoning

arXiv:2604.01170v1 Announce Type: cross Abstract: While test-time scaling has enabled large language models to solve highly difficult tasks, state-of-the-art re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Screening Is Enough

arXiv:2604.01178v1 Announce Type: cross Abstract: A core limitation of standard softmax attention is that it does not define a notion of absolute query--key rel

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget

arXiv:2604.01195v1 Announce Type: cross Abstract: Search agents, which integrate language models (LMs) with web search, are becoming crucial for answering compl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Code Comprehension then Auditing for Unsupervised LLM Evaluation

arXiv:2410.03131v4 Announce Type: replace Abstract: Large Language Models (LLMs) for unsupervised code correctness evaluation have recently gained attention bec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

arXiv:2501.09136v4 Announce Type: replace Abstract: Large Language Models (LLMs) have advanced artificial intelligence by enabling human-like text generation an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment

arXiv:2503.02976v3 Announce Type: replace Abstract: Large language models (LLMs), initially developed for generative AI, are now evolving into agentic AI system

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering

arXiv:2505.12189v3 Announce Type: replace Abstract: Large language models (LLMs) exhibit reasoning biases, often conflating content plausibility with formal log

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning

arXiv:2506.13841v3 Announce Type: replace Abstract: Recent advances in large language models (LLMs), particularly those enhanced through reinforced post-trainin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

HiMA-Ecom: Enabling Joint Training of Hierarchical Multi-Agent E-commerce Assistants

arXiv:2506.19846v2 Announce Type: replace Abstract: Hierarchical multi-agent systems based on large language models (LLMs) have become a common paradigm for bui

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Auto-Formulating Dynamic Programming Problems with Large Language Models

arXiv:2507.11737v2 Announce Type: replace Abstract: Dynamic programming (DP) is a fundamental method in operations research, but formulating DP models has tradi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Retrieval-of-Thought: Efficient Reasoning via Reusing Thoughts

arXiv:2509.21743v2 Announce Type: replace Abstract: Large reasoning models improve accuracy by producing long reasoning traces, but this inflates latency and co

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents

arXiv:2509.25302v2 Announce Type: replace Abstract: The prevalent deployment of Large Language Model agents such as OpenClaw unlocks potential in real-world app

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming

arXiv:2510.18314v2 Announce Type: replace Abstract: As large language model (LLM) agents increasingly automate complex web tasks, they boost productivity while

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

EHRStruct: A Comprehensive Benchmark Framework for Evaluating Large Language Models on Structured Electronic Health Record Tasks

arXiv:2511.08206v4 Announce Type: replace Abstract: Structured Electronic Health Record (EHR) data stores patient information in relational tables and plays a c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper 3w ago

DR-LoRA: Dynamic Rank LoRA for Fine-Tuning Mixture-of-Experts Models

arXiv:2601.04823v4 Announce Type: replace Abstract: Mixture-of-Experts (MoE) has become a prominent paradigm for scaling Large Language Models (LLMs). Parameter

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models

arXiv:2601.05144v2 Announce Type: replace Abstract: Reasoning Large Language Models (RLLMs) excelling in complex tasks present unique challenges for digital wat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning

arXiv:2602.08734v2 Announce Type: replace Abstract: Solving partially observable Markov decision processes (POMDPs) requires computing policies under imperfect

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Meta-Learning and Meta-Reinforcement Learning -- Tracing the Path towards DeepMind's Adaptive Agent

arXiv:2602.19837v2 Announce Type: replace Abstract: Humans are highly effective at utilizing prior knowledge to adapt to novel tasks, a capability that standard

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Epistemic Filtering and Collective Hallucination: A Jury Theorem for Confidence-Calibrated Agents

arXiv:2602.22413v2 Announce Type: replace Abstract: We investigate the collective accuracy of heterogeneous agents who learn to estimate their own reliability o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

When Agents Persuade: Rhetoric Generation and Mitigation in LLMs

arXiv:2603.04636v2 Announce Type: replace Abstract: Despite their wide-ranging benefits, LLM-based agents deployed in open environments can be exploited to prod

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions

arXiv:2502.14883v3 Announce Type: replace-cross Abstract: For individuals with blindness or low vision (BLV), navigating complex environments can pose serious r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Neural Conditional Transport Maps

arXiv:2505.15808v2 Announce Type: replace-cross Abstract: We present a neural framework for learning conditional optimal transport (OT) maps between probability

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors

arXiv:2505.17760v3 Announce Type: replace-cross Abstract: LLM-as-a-judge is widely used as a scalable substitute for human evaluation, yet current approaches re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Graceful Forgetting in Generative Language Models

arXiv:2505.19715v2 Announce Type: replace-cross Abstract: Recently, the pretrain-finetune paradigm has become a cornerstone in various deep learning areas. Whil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

How Does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

arXiv:2505.21505v3 Announce Type: replace-cross Abstract: Multilingual Alignment is an effective and representative paradigm to enhance LLMs' multilingual capab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions

arXiv:2506.09354v2 Announce Type: replace-cross Abstract: Mental health is a growing global concern, prompting interest in AI-driven solutions to expand access

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection

arXiv:2506.18919v4 Announce Type: replace-cross Abstract: As a multimodal medium combining images and text, memes frequently convey implicit harmful content thr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

arXiv:2508.07629v4 Announce Type: replace-cross Abstract: We present Klear-Reasoner, a model with long reasoning capabilities that demonstrates careful delibera

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

FedKLPR: KL-Guided Pruning-Aware Federated Learning for Person Re-Identification

arXiv:2508.17431v2 Announce Type: replace-cross Abstract: Person re-identification (re-ID) is a fundamental task in intelligent surveillance and public safety.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Polychromic Objectives for Reinforcement Learning

arXiv:2509.25424v4 Announce Type: replace-cross Abstract: Reinforcement learning fine-tuning (RLFT) is a dominant paradigm for improving pretrained policies for