Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,786

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,453 Reads 5,333

Showing 5,333 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction

arXiv:2603.25209v1 Announce Type: cross Abstract: Generating long videos using pre-trained video diffusion models, which are typically trained on short clips, p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Wireless World Model for AI-Native 6G Networks

arXiv:2603.25216v1 Announce Type: cross Abstract: Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approache

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing

arXiv:2603.25226v1 Announce Type: cross Abstract: The emergence of Large Language Models (LLMs) has catalyzed a paradigm shift in programming, giving rise to "v

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA

arXiv:2603.25243v1 Announce Type: cross Abstract: Large language models and autonomous agents are increasingly explored for EDA automation, but many existing in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models

arXiv:2603.25250v1 Announce Type: cross Abstract: Out-of-distribution (OOD) detection aims to identify samples that deviate from in-distribution (ID). One popul

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation

arXiv:2603.25253v1 Announce Type: cross Abstract: Large language models (LLMs) hold considerable potential for advancing scientific discovery, yet systematic as

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CRAFT: Grounded Multi-Agent Coordination Under Partial Information

arXiv:2603.25268v1 Announce Type: cross Abstract: We introduce CRAFT, a multi-agent benchmark for evaluating pragmatic communication in large language models un

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Revealing the influence of participant failures on model quality in cross-silo Federated Learning

arXiv:2603.25289v1 Announce Type: cross Abstract: Federated Learning (FL) is a paradigm for training machine learning (ML) models in collaborative settings whil

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

arXiv:2603.25322v1 Announce Type: cross Abstract: Alzheimer's disease (AD) is a growing global health challenge as populations age, and timely, accurate diagnos

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv:2603.25325v1 Announce Type: cross Abstract: Weight pruning is a standard technique for compressing large language models, yet its effect on learned intern

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs

arXiv:2603.25385v1 Announce Type: cross Abstract: Quantization techniques such as BitsAndBytes, AWQ, and GPTQ are widely used as a standard method in deploying

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Causal Framework for Evaluating ICU Discharge Strategies

arXiv:2603.25397v1 Announce Type: cross Abstract: In this applied paper, we address the difficult open problem of when to discharge patients from the Intensive

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models

arXiv:2603.25403v1 Announce Type: cross Abstract: On-device Vision-Language Models (VLMs) promise data privacy via local execution. However, we show that the ar

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Decidable By Construction: Design-Time Verification for Trustworthy AI

arXiv:2603.25414v1 Announce Type: cross Abstract: A prevailing assumption in machine learning is that model correctness must be enforced after the fact. We obse

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Temporally Decoupled Diffusion Planning for Autonomous Driving

arXiv:2603.25462v1 Announce Type: cross Abstract: Motion planning in dynamic urban environments requires balancing immediate safety with long-term goals. While

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning

arXiv:2603.25464v1 Announce Type: cross Abstract: Zero-shot reinforcement learning (RL) algorithms aim to learn a family of policies from a reward-free dataset,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

arXiv:2603.25562v1 Announce Type: cross Abstract: On-policy distillation (OPD) is appealing for large language model (LLM) post-training because it evaluates te

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL

arXiv:2603.25568v1 Announce Type: cross Abstract: Translating natural language to SQL for data retrieval has become more accessible thanks to code generation LL

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TAAC: A gate into Trustable Audio Affective Computing

arXiv:2603.25570v1 Announce Type: cross Abstract: With the emergence of AI techniques for depression diagnosis, the conflict between high demand and limited sup

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

arXiv:2603.25613v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) have recently been explored as face verification systems that determi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

arXiv:2603.25638v1 Announce Type: cross Abstract: Through an analysis of arXiv papers, we report several shifts in word usage that are likely driven by large la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

arXiv:2603.25646v1 Announce Type: cross Abstract: This paper presents an experimental platform for studying intentional-state attribution toward a non-humanoid

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

arXiv:2603.25674v1 Announce Type: cross Abstract: Automated systems have been widely adopted across the educational testing industry for open-response assessmen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Unified Memory Perspective for Probabilistic Trustworthy AI

arXiv:2603.25692v1 Announce Type: cross Abstract: Trustworthy artificial intelligence increasingly relies on probabilistic computation to achieve robustness, in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

arXiv:2603.25697v1 Announce Type: cross Abstract: Code production is now a commodity; the bottleneck is knowing what to build and proving it works. We present t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Neural Network Conversion of Machine Learning Pipelines

arXiv:2603.25699v1 Announce Type: cross Abstract: Transfer learning and knowledge distillation has recently gained a lot of attention in the deep learning commu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

arXiv:2603.25716v1 Announce Type: cross Abstract: Video world models have shown immense potential in simulating the physical world, yet existing memory mechanis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Natural-Language Agent Harnesses

arXiv:2603.25723v1 Announce Type: cross Abstract: Agent performance increasingly depends on \emph{harness engineering}, yet harness design is usually buried in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

arXiv:2603.25740v1 Announce Type: cross Abstract: Human driving behavior is inherently personal, which is shaped by long-term habits and influenced by short-ter

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots

arXiv:2410.20894v3 Announce Type: replace Abstract: Artificial General Intelligence (AGI) Agents and Robots must be able to cope with everchanging environments

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Semi-Strongly solved: a New Definition Leading Computer to Perfect Gameplay

arXiv:2411.01029v2 Announce Type: replace Abstract: Strong solving of perfect-information games certifies optimal play from every reachable position, but the re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Research on environment perception and behavior prediction of intelligent UAV based on semantic communication

arXiv:2501.04480v2 Announce Type: replace Abstract: The convergence of drone delivery systems, virtual worlds, and blockchain has transformed logistics and supp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Concepts Learned Visually by Infants Can Contribute to Visual Learning and Understanding in AI Models

arXiv:2503.03361v3 Announce Type: replace Abstract: Early in development, infants learn to extract surprisingly complex aspects of visual scenes. This early lea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

arXiv:2504.15780v3 Announce Type: replace Abstract: Geometric problem solving (GPS) requires precise multimodal understanding and rigorous, step-by-step logical

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Interactive Query Answering on Knowledge Graphs with Soft Entity Constraints

arXiv:2508.13663v4 Announce Type: replace Abstract: Methods for query answering over incomplete knowledge graphs retrieve entities that are \emph{likely} to be

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning

arXiv:2509.03345v2 Announce Type: replace Abstract: Non-deductive reasoning, encompassing inductive and abductive reasoning, is essential in addressing complex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

arXiv:2509.23768v2 Announce Type: replace Abstract: The chemical reaction recommendation is to select proper reaction condition parameters for chemical reaction

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Planned Diffusion

arXiv:2510.18087v2 Announce Type: replace Abstract: Most large language models are autoregressive: they generate tokens one at a time. Discrete diffusion langua

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Analysing Environmental Efficiency in AI for X-Ray Diagnosis

arXiv:2511.07436v2 Announce Type: replace Abstract: The integration of AI tools into medical applications has aimed to improve the efficiency of diagnosis. The

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

XGrammar-2: Efficient Dynamic Structured Generation Engine for Agentic LLMs

arXiv:2601.04426v2 Announce Type: replace Abstract: Modern LLM agents increasingly rely on dynamic structured generation, such as tool calling and response prot

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

arXiv:2603.08561v4 Announce Type: replace Abstract: Standard reinforcement learning (RL) for large language model (LLM) agents typically optimizes extrinsic rew

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Consequentialist Objectives and Catastrophe

arXiv:2603.15017v2 Announce Type: replace Abstract: Because human preferences are too complex to codify, AIs operate with misspecified objectives. Optimizing su

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Characterizing Linear Alignment Across Language Models

arXiv:2603.18908v3 Announce Type: replace Abstract: Language models increasingly appear to learn similar representations, despite differences in training object

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Man and machine: artificial intelligence and judicial decision making

arXiv:2603.19042v2 Announce Type: replace Abstract: The integration of artificial intelligence (AI) technologies into judicial decision-making, particularly in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

arXiv:2401.11605v2 Announce Type: replace-cross Abstract: We present the Hourglass Diffusion Transformer (HDiT), an image generative model that exhibits linear

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Future of AI-Driven Software Engineering

arXiv:2406.07737v2 Announce Type: replace-cross Abstract: A paradigm shift is underway in Software Engineering, with AI systems such as LLMs playing an increasi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers

arXiv:2408.13366v2 Announce Type: replace-cross Abstract: This paper presents CodeRefine, a novel framework for automatically transforming research paper method

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

arXiv:2410.10700v3 Announce Type: replace-cross Abstract: Safety concerns in large language models (LLMs) have gained significant attention due to their exposur