Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,159

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 21,471 Reads 29,688

All Reads (29,688) Articles (12625)Blog Posts (5609)Tutorials (2350)Research Papers (8231)News (873)

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Team MKC at CLPsych 2026: Capturing and Characterizing Mental Health Changes through Social Media Timeline Dynamics

arXiv:2606.31464v1 Announce Type: cross Abstract: Recent advances in Large Language Models (LLMs) have motivated their adoption across a wide range of domains,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

FinPersona-Bench: A Benchmark for Longitudinal Psychometric Stability of Autonomous Financial Agents

arXiv:2606.31522v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed as autonomous financial agents initialized with explici

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

On the Convergence of Self-Improving Online LLM Alignment

arXiv:2606.31524v1 Announce Type: cross Abstract: The Self-Improving Alignment (SAIL) algorithm addresses distribution shift by reducing a bilevel formulation o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Mitigating Positional Leakage in 3D Masked Autoencoders for Robust Representation Learning

arXiv:2606.31570v1 Announce Type: cross Abstract: Masked autoencoding has emerged as a prominent paradigm for self-supervised learning on 3D point clouds, achie

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

ZEBRA: Zero-Shot Entropy-Regularized Prompt Learning for Base-to-Novel Generalization in Audio-Language Models

arXiv:2606.31587v1 Announce Type: cross Abstract: Audio-Language Models (ALMs) achieve strong zero-shot performance by aligning audio with textual class descrip

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Evil Spectra: How Optimisers can Amplify or Suppress Emergent Misalignment

arXiv:2606.31591v1 Announce Type: cross Abstract: Emergent misalignment (EM) is a recently discovered phenomenon in LLMs where fine-tuning on a narrow misaligne

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Token-Sparse Medical Multimodal Reasoning via Dual-Stream Reinforcement Learning

arXiv:2606.31599v1 Announce Type: cross Abstract: Vision-language models (VLMs) combining reinforcement learning (RL) ignite remarkable progress in multimodal r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Automating Cause-Effect Specification with Knowledge Graphs and Large Language Models

arXiv:2606.31614v1 Announce Type: cross Abstract: Engineering specifications such as interlocks, alarm rationalization tables, and cause-and-effect (C&E) matric

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

A Tutorial on Autonomous Fault-Tolerant Control Using Knowledge-Grounded LLM Agents

arXiv:2606.31635v1 Announce Type: cross Abstract: Fault recovery in process plants still relies heavily on plant operators, especially when faults fall outside

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

A Lifecycle and Application-Stack Survey of Large Language Model Vulnerabilities: Attacks, Risks, Defenses, and Open Problems

arXiv:2606.31639v1 Announce Type: cross Abstract: Large language models are no longer only text generators. They are increasingly embedded in retrieval pipeline

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

ShopX: A Foundation Model for Intent-to-Item Fulfillment in Agentic Shopping

arXiv:2606.31693v1 Announce Type: cross Abstract: The wave of AI-native applications is moving shopping beyond page- and feed-based browsing toward intent-drive

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Look But Don't Touch with Sparse Autoencoders for Unlearning in Diffusion Models

arXiv:2606.31699v1 Announce Type: cross Abstract: Sparse autoencoders (SAEs) have recently been proposed as interpretable tools for concept-level manipulation,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Cross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian

arXiv:2606.31718v1 Announce Type: cross Abstract: Relation extraction (RE) for low-resource languages is typically constrained by the lack of annotated corpora.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Seeing Is Not Sharing: Some Vision-Language Models Overestimate Common Ground in Asymmetric Dialogue

arXiv:2606.31719v1 Announce Type: cross Abstract: In collaborative dialogue, shared perception does not guarantee shared interpretation. Mutual understanding mu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

STEB: Style Text Embedding Benchmark

arXiv:2606.31741v1 Announce Type: cross Abstract: While semantic embeddings are rigorously evaluated on the Massive Text Embedding Benchmark, the evaluation of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning

arXiv:2606.31742v1 Announce Type: cross Abstract: Explainable AI (XAI) methods have demonstrated significant success in recent years at identifying relevant fea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

CHERRY: Compressed Hierarchical Experts with Recurrent Representational Yield

arXiv:2606.31796v1 Announce Type: cross Abstract: We study three complementary techniques for training compute-efficient language models. (1) Selective supervis

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Geometry-Preserving Orthonormal Initialization for Low-Rank Adaptation in RLVR

arXiv:2606.31813v1 Announce Type: cross Abstract: Low-rank adaptation (LoRA) and its variants enable parameter-efficient fine-tuning of large language models un

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

arXiv:2606.31825v1 Announce Type: cross Abstract: Recent multimodal large language models have shown great promise in clinical image reasoning, but existing pos

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Real-Time Source-Free Object Detection

arXiv:2606.31834v1 Announce Type: cross Abstract: Real-world detectors for autonomous driving, surveillance, and robotics must handle domain-shifts under strict

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Z-1: Efficient Reinforcement Learning for Vision-Language-Action Models

arXiv:2606.31846v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models offer a promising framework for robotic manipulation by connecting languag

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Attend, Transform, or Silence: Operator-Level Visual Skipping for Efficient Multimodal LLM Inference

arXiv:2606.31903v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) increasingly process long visual-token sequences, increasing the over

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

GR2 Technical Report

arXiv:2606.31984v1 Announce Type: cross Abstract: Industrial recommendation systems serve billions of users through a multi-stage funnel -- retrieval, early-sta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Amplifying Membership Signal Through Chained Regeneration

arXiv:2606.31991v1 Announce Type: cross Abstract: The tendency of large generative models to memorize training data makes sample verification critical for priva

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

AdaJEPA: An Adaptive Latent World Model

arXiv:2606.32026v1 Announce Type: cross Abstract: Latent world models enable planning from high-dimensional observations by predicting future states in a compac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

arXiv:2606.32029v1 Announce Type: cross Abstract: While large language models (LLMs) perform well on table tasks, they still make data referencing errors (DREs)

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

arXiv:2606.32032v1 Announce Type: cross Abstract: Metacognition is a critical component of intelligence that describes the ability to monitor and regulate one's

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

QVal: Cheaply Evaluating Dense Supervision Signals for Long-Horizon LLM Agents

arXiv:2606.32034v1 Announce Type: cross Abstract: LLM agents increasingly act over long horizons, where a single trajectory can contain hundreds or thousands of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Introspective Coupling: Self-Explanation Training Tracks Behavioral Change Despite Fixed Supervision

arXiv:2606.32038v1 Announce Type: cross Abstract: When does training language models (LMs) to generate explanations of their predictions yield faithful introspe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Disentangling Reasoning Logic to Resolve Explicit Knowledge Conflicts

arXiv:2508.01273v3 Announce Type: replace Abstract: Explicit knowledge conflicts, occurring when retrieved contexts contain contradictory information, pose a fu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Deductive Logic in Language Models: Horizontal vs Vertical Reasoning

arXiv:2510.09340v2 Announce Type: replace Abstract: Recent language models exhibit significant logical reasoning abilities, yet the mechanisms supporting deduct

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

LLM-Empowered Agentic MAC Protocols: A Dynamic Stackelberg Game Approach

arXiv:2510.10895v2 Announce Type: replace Abstract: Medium Access Control (MAC) protocols, essential for wireless networks, are typically manually configured. W

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Improving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression

arXiv:2601.08187v3 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated promising capabilities in Text-Attributed Graph (TAG) underst

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Learning by Surprise: Adaptive Mitigation of Model Collapse in Large Language Models

arXiv:2410.12341v4 Announce Type: replace-cross Abstract: As AI-generated content increasingly populates the web, generative AI models are at growing risk of be

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection

arXiv:2502.15845v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) often hallucinate, limiting their reliability in sensitive applications.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

SAGE: A Search-AuGmented Evaluation of Large Language Models on Free-Form QA

arXiv:2504.07385v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) become increasingly used for question-answering (QA), relying on stati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

TraCeS: Learning Per-Timestep Constraint-Violation Credit from Sparse Trajectory-Level Labels

arXiv:2504.12557v3 Announce Type: replace-cross Abstract: Ensuring safe behavior in reinforcement learning (RL) is challenging when safety constraints are impli

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

A Reproducible Benchmark of Lightweight CNNs: Accuracy, Efficiency, and the Impact of Pretrained Initialization

arXiv:2505.03303v3 Announce Type: replace-cross Abstract: Lightweight convolutional neural networks are often compared using results obtained with different tra

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Dataset Construction for Training LLM to Learn Analog Circuit Knowledge

arXiv:2508.10409v3 Announce Type: replace-cross Abstract: This paper constructs a textual dataset for training large language models (LLMs) to learn analog circ

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Optimal Self-Consistency for Efficient Reasoning with Large Language Models

arXiv:2511.12309v2 Announce Type: replace-cross Abstract: Self-consistency (SC) is a widely used test-time inference technique for improving performance in chai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Revisiting Audio-language Pretraining for Learning General-purpose Audio Representation

arXiv:2511.16757v2 Announce Type: replace-cross Abstract: Audio-language pretraining (ALP) holds promise for learning general-purpose audio representation, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

arXiv:2512.21002v3 Announce Type: replace-cross Abstract: Distilling the capabilities from a large reasoning model (LRM) to a smaller student model often involv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

arXiv:2601.04126v3 Announce Type: replace-cross Abstract: GUI agents that interact with graphical interfaces on behalf of users represent a promising direction

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching

arXiv:2601.23088v2 Announce Type: replace-cross Abstract: Semantic caching has emerged as a pivotal technique for scaling LLM applications, widely adopted by ma

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

DeXposure-FM: A Time-series, Graph Foundation Model for Credit Exposures and Stability on Decentralized Financial Networks

arXiv:2602.03981v2 Announce Type: replace-cross Abstract: Credit exposure in Decentralized Finance (DeFi) is often implicit and token-mediated, creating a dense

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models

arXiv:2603.12893v2 Announce Type: replace-cross Abstract: Reinforcement learning (RL) has become a standard technique for post-training diffusion-based image sy

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Visual Prompt Discovery via Semantic Exploration

arXiv:2603.16250v2 Announce Type: replace-cross Abstract: LVLMs encounter significant challenges in image understanding and visual reasoning, leading to critica

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 [03:59:05]

Dev.to · anon1 anon1 🧠 Large Language Models ⚡ AI Lesson 3d ago

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 [03:59:05]

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5 ...