Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,483

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,394 Reads 5,089

Showing 5,089 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments

arXiv:2604.06019v1 Announce Type: cross Abstract: The advancement of Large Language Models (LLMs) has raised concerns regarding their dual-use potential in cybe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

A Multi-Stage Validation Framework for Trustworthy Large-scale Clinical Information Extraction using Large Language Models

arXiv:2604.06028v1 Announce Type: cross Abstract: Large language models (LLMs) show promise for extracting clinically meaningful information from unstructured h

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles

arXiv:2604.06071v1 Announce Type: cross Abstract: Personality traits are richly encoded in natural language, and large language models (LLMs) trained on human t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning

arXiv:2604.06079v1 Announce Type: cross Abstract: Graphics Program Synthesis is pivotal for interpreting and editing visual data, effectively facilitating the r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces

arXiv:2604.06086v1 Announce Type: cross Abstract: Modern Transformer-based language models achieve strong performance in natural language processing tasks, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Making in LLM Collectives

arXiv:2604.06091v1 Announce Type: cross Abstract: Large language model (LLM) agents are increasingly acting as human delegates in multi-agent environments, wher

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LLM4CodeRE: Generative AI for Code Decompilation Analysis and Reverse Engineering

arXiv:2604.06095v1 Announce Type: cross Abstract: Code decompilation analysis is a fundamental yet challenging task in malware reverse engineering, particularly

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer

arXiv:2604.06129v1 Announce Type: cross Abstract: This paper introduces the Polynomial Mixer (PoM), a novel token mixing mechanism with linear complexity that s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Shot-Based Quantum Encoding: A Data-Loading Paradigm for Quantum Neural Networks

arXiv:2604.06135v1 Announce Type: cross Abstract: Efficient data loading remains a bottleneck for near-term quantum machine-learning. Existing schemes (angle, a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Generating Synthetic Doctor-Patient Conversations for Long-form Audio Summarization

arXiv:2604.06138v1 Announce Type: cross Abstract: Long-context audio reasoning is underserved in both training data and evaluation. Existing benchmarks target s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement

arXiv:2604.06155v1 Announce Type: cross Abstract: Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

arXiv:2604.06156v1 Announce Type: cross Abstract: MLLMs have been successfully applied to multimodal embedding tasks, yet their generative reasoning capabilitie

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

In-Place Test-Time Training

arXiv:2604.06169v1 Announce Type: cross Abstract: The static ``train then deploy" paradigm fundamentally limits Large Language Models (LLMs) from dynamically ad

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Advancing AI Research Assistants with Expert-Involved Learning

arXiv:2505.04638v5 Announce Type: replace Abstract: Large language models (LLMs) and large multimodal models (LMMs) promise to accelerate biomedical discovery,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Beyond Syntax: Action Semantics Learning for App Agents

arXiv:2506.17697v3 Announce Type: replace Abstract: The recent development of Large Language Models (LLMs) enables the rise of App agents that interpret user in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

URSA: The Universal Research and Scientific Agent

arXiv:2506.22653v2 Announce Type: replace Abstract: Large language models (LLMs) have moved far beyond their initial form as simple chatbots, now carrying out c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

MedGemma Technical Report

arXiv:2507.05201v4 Announce Type: replace Abstract: Artificial intelligence (AI) has significant potential in healthcare applications, but its training and depl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Multiplayer Nash Preference Optimization

arXiv:2509.23102v3 Announce Type: replace Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the standard paradigm for aligning large la

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

arXiv:2509.25454v4 Announce Type: replace Abstract: Although RLVR has become an essential component for developing advanced reasoning skills in language models,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling

arXiv:2510.01025v2 Announce Type: replace Abstract: The linear representation hypothesis states that language models (LMs) encode concepts as directions in thei

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering

arXiv:2510.07432v2 Announce Type: replace Abstract: Large language models (LLMs) exhibit strong symbolic and compositional reasoning, yet they struggle with tim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems

arXiv:2510.10815v4 Announce Type: replace Abstract: Automating the formalization of mathematical statements for theorem proving remains a major challenge for La

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Toward Virtuous Reinforcement Learning: A Critique and Roadmap

arXiv:2512.04246v2 Announce Type: replace Abstract: This paper critiques common patterns in machine ethics for Reinforcement Learning (RL) and argues for a virt

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training

arXiv:2602.05765v2 Announce Type: replace Abstract: Reinforcement learning (RL) has emerged as a critical paradigm for post-training Vision-Language-Action (VLA

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Emergent Introspection in AI is Content-Agnostic

arXiv:2603.05414v2 Announce Type: replace Abstract: Introspection is a foundational cognitive ability, but its mechanism is not well understood. Recent work has

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

arXiv:2603.21357v2 Announce Type: replace Abstract: LLM agents fail on the majority of real-world tasks -- GPT-4o succeeds on fewer than 15% of WebArena navigat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

arXiv:2604.01591v2 Announce Type: replace Abstract: We introduce ThinkTwice, a simple two-phase framework that jointly optimizes LLMs to solve reasoning problem

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

arXiv:2407.14971v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) rely heavily on pretrained vision encoders to support downstream tasks s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment

arXiv:2409.19894v5 Announce Type: replace-cross Abstract: Code translation transforms code between programming languages while preserving functionality, which i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Cobblestone: A Divide-and-Conquer Approach for Automating Formal Verification

arXiv:2410.19940v4 Announce Type: replace-cross Abstract: Formal verification using proof assistants, such as Coq, is an effective way of improving software qua

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap

arXiv:2410.20791v3 Announce Type: replace-cross Abstract: The rapid expansion of foundation models (FMs), such as large language models (LLMs), has given rise t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models

arXiv:2411.05961v2 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) are central to Visual Question Answering (VQA) systems and are typically

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Retrieval Augmented Time Series Forecasting

arXiv:2411.08249v2 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) is a central component of modern LLM systems, particularly in sce

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

ENTER: Event Based Interpretable Reasoning for VideoQA

arXiv:2501.14194v2 Announce Type: replace-cross Abstract: In this paper, we present ENTER, an interpretable Video Question Answering (VideoQA) system based on e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

arXiv:2502.17421v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) can now process extremely long contexts, efficient inference over thes

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Hedging and Non-Affirmation: Quantifying LLM Alignment on Questions of Human Rights

arXiv:2502.19463v2 Announce Type: replace-cross Abstract: Hedging and non-affirmation are behaviors exhibited by large language models (LLMs) that limit the cle

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge

arXiv:2504.05995v3 Announce Type: replace-cross Abstract: The rapid progress of large language models (LLMs) raises concerns about cultural bias, fairness, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Phonetic Perturbations Reveal Tokenizer-Rooted Safety Gaps in LLMs

arXiv:2505.14226v5 Announce Type: replace-cross Abstract: Safety-aligned LLMs remain vulnerable to digital phenomena like textese that introduce non-canonical p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

arXiv:2506.01666v3 Announce Type: replace-cross Abstract: Efficiently compiling quantum operations remains a major bottleneck in scaling quantum computing. Toda

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

HeartcareGPT: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understanding

arXiv:2506.05831v4 Announce Type: replace-cross Abstract: Although electrocardiograms (ECG) play a dominant role in cardiovascular diagnosis and treatment, thei

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

A Survey of Continual Reinforcement Learning

arXiv:2506.21872v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) is an important machine learning paradigm for solving sequential decision-

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Enhancing Hallucination Detection via Future Context

arXiv:2507.20546v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are widely used to generate plausible text on online platforms, without r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

arXiv:2509.17183v2 Announce Type: replace-cross Abstract: Alignment plays a crucial role in Large Language Models (LLMs) in aligning with human preferences on a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

A State-Update Prompting Strategy for Efficient and Robust Multi-turn Dialogue

arXiv:2509.17766v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) struggle with information forgetting and inefficiency in long-horizon, mu

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Dissecting Transformers: A CLEAR Perspective towards Green AI

arXiv:2510.02810v2 Announce Type: replace-cross Abstract: The rapid adoption of Large Language Models (LLMs) has raised significant environmental concerns. Unli

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Reveal-to-Revise: Explainable Bias-Aware Generative Modeling with Multimodal Attention

arXiv:2510.12957v3 Announce Type: replace-cross Abstract: We present an explainable, bias-aware generative framework that unifies cross-modal attention fusion,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Unlocking the Potential of Diffusion Language Models through Template Infilling

arXiv:2510.13870v2 Announce Type: replace-cross Abstract: Diffusion Language Models (DLMs) have emerged as a promising alternative to Autoregressive Language Mo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago

Knowledge Reasoning Language Model: Unifying Knowledge and Language for Inductive Knowledge Graph Reasoning

arXiv:2510.13909v2 Announce Type: replace-cross Abstract: Inductive Knowledge Graph Reasoning (KGR) aims to discover facts in open-domain KGs containing unknown