Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,805

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,454 Reads 5,351

Showing 5,351 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Guidelines to Guarantees: A Graph-Based Evaluation Harness for Domain-Specific Evaluation of LLMs

arXiv:2508.20810v2 Announce Type: replace Abstract: Rigorous evaluation of domain-specific language models requires benchmarks that are comprehensive, contamina

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GeoSketch: A Neural-Symbolic Approach to Geometric Multimodal Reasoning with Auxiliary Line Construction and Affine Transformation

arXiv:2509.22460v3 Announce Type: replace Abstract: Geometric Problem Solving (GPS) poses a unique challenge for Multimodal Large Language Models (MLLMs), requi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SAG-Agent: Enabling Long-Horizon Reasoning in Strategy Games via Dynamic Knowledge Graphs

arXiv:2510.15259v3 Announce Type: replace Abstract: Most commodity software lacks accessible Application Programming Interfaces (APIs), requiring autonomous age

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

arXiv:2512.16917v3 Announce Type: replace Abstract: Large language models (LLMs) with explicit reasoning capabilities excel at mathematical reasoning yet still

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

arXiv:2601.10402v5 Announce Type: replace Abstract: The advancement of artificial intelligence toward agentic science is currently bottlenecked by the challenge

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Are LLMs Smarter Than Chimpanzees? An Evaluation on Perspective Taking and Knowledge State Estimation

arXiv:2601.12410v2 Announce Type: replace Abstract: Cognitive anthropology suggests that the distinction of human intelligence lies in the ability to infer othe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

CollectiveKV: Decoupling and Sharing Collaborative Information in Sequential Recommendation

arXiv:2601.19178v2 Announce Type: replace Abstract: Sequential recommendation models are widely used in applications, yet they face stringent latency requiremen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Agentified Assessment of Logical Reasoning Agents

arXiv:2603.02788v3 Announce Type: replace Abstract: We present a framework for evaluating and benchmarking logical reasoning agents when assessment itself must

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

arXiv:2603.03072v2 Announce Type: replace Abstract: Large language models (LLMs) are increasingly used to assist scientists across diverse workflows. A key chal

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics

arXiv:2603.11442v2 Announce Type: replace Abstract: Can humans detect AI-generated financial documents better than machines? We present GPT4o-Receipt, a benchma

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Relationship-Aware Safety Unlearning for Multimodal LLMs

arXiv:2603.14185v3 Announce Type: replace Abstract: Generative multimodal models can exhibit safety failures that are inherently relational: two benign concepts

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

arXiv:2603.21430v2 Announce Type: replace Abstract: Large language models (LLMs) have shown impressive capabilities in code generation. However, because most LL

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Human strategic decision making in parametrized games

arXiv:2104.14744v5 Announce Type: replace-cross Abstract: Many real-world games contain parameters which can affect payoffs, action spaces, and information stat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data and LLMs Perspective

arXiv:2211.14997v5 Announce Type: replace-cross Abstract: Enterprise financial risk analysis aims at predicting the future financial risk of enterprises. Due to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Moonwalk: Inverse-Forward Differentiation

arXiv:2402.14212v2 Announce Type: replace-cross Abstract: Backpropagation's main limitation is its need to store intermediate activations (residuals) during the

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Dynamic Neural Potential Field: Online Trajectory Optimization in the Presence of Moving Obstacles

arXiv:2410.06819v3 Announce Type: replace-cross Abstract: Generalist robot policies must operate safely and reliably in everyday human environments such as home

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

arXiv:2502.01521v4 Announce Type: replace-cross Abstract: Training reinforcement learning (RL) policies for legged locomotion often requires extensive environme

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Evaluation of Large Language Models via Coupled Token Generation

arXiv:2502.01754v3 Announce Type: replace-cross Abstract: State of the art large language models rely on randomization to respond to a prompt. As an immediate c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

arXiv:2503.11488v2 Announce Type: replace-cross Abstract: Adaptive traffic signal control (ATSC) is crucial in reducing congestion, maximizing throughput, and i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

arXiv:2503.14637v3 Announce Type: replace-cross Abstract: How do humans move? Advances in reinforcement learning (RL) have produced impressive results in captur

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

arXiv:2504.03486v2 Announce Type: replace-cross Abstract: Automating legal document drafting can improve efficiency and reduce the burden of manual legal work.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Linguistic Comparison of AI- and Human-Written Responses to Online Mental Health Queries

arXiv:2504.09271v2 Announce Type: replace-cross Abstract: The ubiquity and widespread use of digital and online technologies have transformed mental health supp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Explainable embeddings with Distance Explainer

arXiv:2505.15516v2 Announce Type: replace-cross Abstract: While eXplainable AI (XAI) has advanced significantly, few methods address interpretability in embedde

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

arXiv:2505.16950v4 Announce Type: replace-cross Abstract: Transformer LLMs have been shown to exhibit strong reasoning ability that scales with inference-time c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Reward Is Enough: LLMs Are In-Context Reinforcement Learners

arXiv:2506.06303v5 Announce Type: replace-cross Abstract: Reinforcement learning (RL) is a framework for solving sequential decision-making problems. In this wo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Enhancing Jailbreak Attacks on LLMs via Persona Prompts

arXiv:2507.22171v3 Announce Type: replace-cross Abstract: Jailbreak attacks aim to exploit large language models (LLMs) by inducing them to generate harmful con

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

SafeSieve: From Heuristics to Experience in Progressive Pruning for LLM-based Multi-Agent Communication

arXiv:2508.11733v3 Announce Type: replace-cross Abstract: LLM-based multi-agent systems exhibit strong collaborative capabilities but often suffer from redundan

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment

arXiv:2510.00430v2 Announce Type: replace-cross Abstract: Despite recent progress, reinforcement learning (RL)-based fine-tuning of diffusion models often strug

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents

arXiv:2510.04607v2 Announce Type: replace-cross Abstract: Computer-use agents (CUAs) powered by large language models (LLMs) have emerged as a promising approac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

arXiv:2510.10223v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly deployed in specialized domains such as finance, medicin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Prompts to Packets: A View from the Network on ChatGPT, Copilot, and Gemini

arXiv:2510.11269v2 Announce Type: replace-cross Abstract: GenAI chatbots are now pervasive in digital ecosystems, fundamentally reshaping user interactions over

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

arXiv:2510.14751v2 Announce Type: replace-cross Abstract: Next-token prediction (NTP) has driven the success of large language models (LLMs), but it struggles w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning

arXiv:2510.15495v2 Announce Type: replace-cross Abstract: Reinforcement learning algorithms typically utilize an interactive simulator (i.e., environment) with

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Is Multilingual LLM Watermarking Truly Multilingual? Scaling Robustness to 100+ Languages via Back-Translation

arXiv:2510.18019v2 Announce Type: replace-cross Abstract: Multilingual watermarking aims to make large language model (LLM) outputs traceable across languages,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

arXiv:2511.06767v2 Announce Type: replace-cross Abstract: Transformer-based models have revolutionized computer vision (CV) and natural language processing (NLP

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

arXiv:2511.11743v3 Announce Type: replace-cross Abstract: Deploying deep neural networks on resource-constrained devices faces two critical challenges: maintain

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning

arXiv:2511.18000v2 Announce Type: replace-cross Abstract: We present ContagionRL, a Gymnasium-compatible reinforcement learning platform specifically designed f

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation

arXiv:2511.18281v3 Announce Type: replace-cross Abstract: Diffusion models (DMs) produce high-quality images, yet their sampling remains costly when adapted to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

arXiv:2511.21542v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models offer a unified framework for robotic manipulation by integrating

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Goal-Oriented Multi-Agent Semantic Networking: Unifying Intents, Semantics, and Intelligence

arXiv:2512.01035v2 Announce Type: replace-cross Abstract: 6G services are evolving toward goal-oriented and AI-native communication, which are expected to deliv

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature

arXiv:2512.02566v2 Announce Type: replace-cross Abstract: There is a growing interest in developing strong biomedical vision-language models. A popular approach

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

arXiv:2512.04000v2 Announce Type: replace-cross Abstract: The application of Large Multimodal Models (LMMs) to long-form video understanding is constrained by l

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support

arXiv:2512.07801v5 Announce Type: replace-cross Abstract: LLM-based agents are increasingly deployed for expert decision support, yet human-AI teams in high-sta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ODMA: On-Demand Memory Allocation Strategy for LLM Serving on LPDDR-Class Accelerators

arXiv:2512.09427v3 Announce Type: replace-cross Abstract: Existing memory management techniques severely hinder efficient Large Language Model serving on accele

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Physics-driven human-like working memory outperforms digital networks in dynamic vision

arXiv:2512.15829v3 Announce Type: replace-cross Abstract: While the unsustainable energy cost of artificial intelligence necessitates physics-driven computing,

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning

arXiv:2601.00473v2 Announce Type: replace-cross Abstract: We revisit the analogy between feed-forward deep neural networks (DNNs) and discrete dynamical systems

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection

arXiv:2601.09195v2 Announce Type: replace-cross Abstract: Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (L

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

arXiv:2601.10305v3 Announce Type: replace-cross Abstract: Vision-Language Pre-training (VLP) models have achieved remarkable success by leveraging large-scale i