Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,966

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,461 Reads 5,505

Showing 5,505 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Self Paced Gaussian Contextual Reinforcement Learning

arXiv:2603.23755v1 Announce Type: cross Abstract: Curriculum learning improves reinforcement learning (RL) efficiency by sequencing tasks from simple to complex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Human-in-the-Loop Pareto Optimization: Trade-off Characterization for Assist-as-Needed Training and Performance Evaluation

arXiv:2603.23777v1 Announce Type: cross Abstract: During human motor skill training and physical rehabilitation, there is an inherent trade-off between task dif

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

arXiv:2603.23783v2 Announce Type: cross Abstract: Adapting large-scale foundation models to new domains with limited supervision remains a fundamental challenge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Cognitive Firewall:Securing Browser Based AI Agents Against Indirect Prompt Injection Via Hybrid Edge Cloud Defense

arXiv:2603.23791v1 Announce Type: cross Abstract: Deploying large language models (LLMs) as autonomous browser agents exposes a significant attack surface in th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Object Search in Partially-Known Environments via LLM-informed Model-based Planning and Prompt Selection

arXiv:2603.23800v1 Announce Type: cross Abstract: We present a novel LLM-informed model-based planning framework, and a novel prompt selection method, for objec

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Deep Neural Regression Collapse

arXiv:2603.23805v1 Announce Type: cross Abstract: Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Willful Disobedience: Automatically Detecting Failures in Agentic Traces

arXiv:2603.23806v1 Announce Type: cross Abstract: AI agents are increasingly embedded in real software systems, where they execute multi-step workflows through

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Perturbation: A simple and efficient adversarial tracer for representation learning in language models

arXiv:2603.23821v1 Announce Type: cross Abstract: Linguistic representation learning in deep neural language models (LMs) has been studied for decades, for both

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers

arXiv:2603.23823v1 Announce Type: cross Abstract: Knowledge tracing models mastery over interconnected concepts, often organized by prerequisites. We analyze hi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay

arXiv:2603.23841v1 Announce Type: cross Abstract: While Large Language Models (LLMs) are increasingly used as primary sources of information, their potential fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generative AI User Experience: Developing Human--AI Epistemic Partnership

arXiv:2603.23863v1 Announce Type: cross Abstract: Generative AI (GenAI) has rapidly entered education, yet its user experience is often explained through adopti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

arXiv:2603.23867v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have been applied to a wide range of reasoning tasks, yet it remains unclear whe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

arXiv:2603.23871v1 Announce Type: cross Abstract: Large language models trained with reinforcement learning (RL) for mathematical reasoning face a fundamental c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Luna Bound Propagator for Formal Analysis of Neural Networks

arXiv:2603.23878v1 Announce Type: cross Abstract: The parameterized CROWN analysis, a.k.a., alpha-CROWN, has emerged as a practically successful bound propagati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Latent Bias Alignment for High-Fidelity Diffusion Inversion in Real-World Image Reconstruction and Manipulation

arXiv:2603.23903v1 Announce Type: cross Abstract: Recent research has shown that text-to-image diffusion models are capable of generating high-quality images gu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Self-Distillation for Multi-Token Prediction

arXiv:2603.23911v1 Announce Type: cross Abstract: As Large Language Models (LLMs) scale up, inference efficiency becomes a critical bottleneck. Multi-Token Pred

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning

arXiv:2603.23916v1 Announce Type: cross Abstract: Multimodal deception detection aims to identify deceptive behavior by analyzing audiovisual cues for forensics

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage

arXiv:2603.23966v1 Announce Type: cross Abstract: With frequently evolving Advanced Persistent Threats (APTs) in cyberspace, traditional security solutions appr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More

arXiv:2603.23971v1 Announce Type: cross Abstract: Developers and consumers increasingly choose reasoning language models (RLMs) based on their listed API prices

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Untamed Black Box to Interpretable Pedagogical Orchestration: The Ensemble of Specialized LLMs Architecture for Adaptive Tutoring

arXiv:2603.23990v1 Announce Type: cross Abstract: Monolithic Large Language Models (LLMs) used in educational dialogue often behave as "black boxes," where peda

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Understanding the Challenges in Iterative Generative Optimization with LLMs

arXiv:2603.23994v1 Announce Type: cross Abstract: Generative optimization uses large language models (LLMs) to iteratively improve artifacts (such as code, work

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Schema on the Inside: A Two-Phase Fine-Tuning Method for High-Efficiency Text-to-SQL at Scale

arXiv:2603.24023v1 Announce Type: cross Abstract: Applying large, proprietary API-based language models to text-to-SQL tasks poses a significant industry challe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs

arXiv:2603.24034v1 Announce Type: cross Abstract: Contextual automatic speech recognition (ASR) with Speech-LLMs is typically trained with oracle conversation h

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification

arXiv:2603.24058v1 Announce Type: cross Abstract: Object hallucination in Large Vision-Language Models (LVLMs) severely compromises their reliability in real-wo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

arXiv:2603.24079v1 Announce Type: cross Abstract: Recently, multimodal large language models (MLLMs) have emerged as a unified paradigm for language and image g

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Knowledge-Guided Manipulation Using Multi-Task Reinforcement Learning

arXiv:2603.24083v1 Announce Type: cross Abstract: This paper introduces Knowledge Graph based Massively Multi-task Model-based Policy Optimization (KG-M3PO), a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

arXiv:2603.24093v1 Announce Type: cross Abstract: Recently, reinforcement learning~(RL) has become an important approach for improving the capabilities of large

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

arXiv:2603.24124v1 Announce Type: cross Abstract: RLHF-aligned language models exhibit response homogenization: on TruthfulQA (n=790), 40-79% of questions produ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare

arXiv:2603.24132v1 Announce Type: cross Abstract: Conversational artificial intelligence has the potential to assist users in preliminary medical consultations,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

arXiv:2603.24202v1 Announce Type: cross Abstract: Reinforcement learning (RL) has emerged as a powerful paradigm for improving large language models beyond supe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search

arXiv:2603.24203v1 Announce Type: cross Abstract: Recent advances in the Model Context Protocol (MCP) have enabled large language models (LLMs) to invoke extern

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement

arXiv:2603.24208v1 Announce Type: cross Abstract: Knowledge distillation transfers knowledge from large teacher models to smaller students for efficient inferen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage

arXiv:2603.24213v1 Announce Type: cross Abstract: Deep learning models for time series imputation are now essential in fields such as healthcare, the Internet o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias

arXiv:2603.24218v1 Announce Type: cross Abstract: Large Language Models (LLMs) enhanced with Retrieval-Augmented Generation (RAG) have achieved substantial impr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing

arXiv:2603.24221v1 Announce Type: cross Abstract: The increasing complexity and interconnectivity of digital infrastructures make scalable and reliable security

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DVM: Real-Time Kernel Generation for Dynamic AI Models

arXiv:2603.24239v1 Announce Type: cross Abstract: Dynamism is common in AI computation, e.g., the dynamic tensor shapes and the dynamic control flows in models.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep

arXiv:2603.24260v1 Announce Type: cross Abstract: Diffusion-based video editing has emerged as an important paradigm for high-quality and flexible content gener

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

The Specification Gap: Coordination Failure Under Partial Knowledge in Code Agents

arXiv:2603.24284v1 Announce Type: cross Abstract: When multiple LLM-based code agents independently implement parts of the same class, they must agree on shared

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

arXiv:2603.24324v1 Announce Type: cross Abstract: Designing effective auxiliary rewards for cooperative multi-agent systems remains a precarious task; misaligne

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

arXiv:2603.24329v1 Announce Type: cross Abstract: Multimodal LLMs are increasingly deployed as perceptual backbones for autonomous agents in 3D environments, fr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evidence of an Emergent "Self" in Continual Robot Learning

arXiv:2603.24350v1 Announce Type: cross Abstract: A key challenge to understanding self-awareness has been a principled way of quantifying whether an intelligen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization

arXiv:2603.24382v1 Announce Type: cross Abstract: Despite deep learning's success in chemistry, its impact is hindered by a lack of interpretability and an inab

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

arXiv:2603.24389v1 Announce Type: cross Abstract: High-quality teacher-child interaction (TCI) is fundamental to early childhood development, yet traditional ex

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

arXiv:2603.24414v1 Announce Type: cross Abstract: OpenClaw has rapidly established itself as a leading open-source autonomous agent runtime, offering powerful c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

arXiv:2603.24422v1 Announce Type: cross Abstract: Generative Retrieval (GR) has emerged as a promising paradigm for modern search systems. Compared to multi-sta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Enes Causal Discovery

arXiv:2603.24436v1 Announce Type: cross Abstract: Enes The proposed architecture is a mixture of experts, which allows for the model entities, such as the causa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

arXiv:2603.24440v1 Announce Type: cross Abstract: Computer-use agents (CUAs) hold great promise for automating complex desktop workflows, yet progress toward ge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

arXiv:2603.24511v1 Announce Type: cross Abstract: LLM agents like Claude Code can not only write code but also be used for autonomous AI research and engineerin