Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,074 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
URSA: The Universal Research and Scientific Agent
arXiv:2506.22653v2 Announce Type: replace Abstract: Large language models (LLMs) have moved far beyond their initial form as simple chatbots, now carrying out c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MedGemma Technical Report
arXiv:2507.05201v4 Announce Type: replace Abstract: Artificial intelligence (AI) has significant potential in healthcare applications, but its training and depl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Multiplayer Nash Preference Optimization
arXiv:2509.23102v3 Announce Type: replace Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the standard paradigm for aligning large la
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
arXiv:2509.25454v4 Announce Type: replace Abstract: Although RLVR has become an essential component for developing advanced reasoning skills in language models,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling
arXiv:2510.01025v2 Announce Type: replace Abstract: The linear representation hypothesis states that language models (LMs) encode concepts as directions in thei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
arXiv:2510.07432v2 Announce Type: replace Abstract: Large language models (LLMs) exhibit strong symbolic and compositional reasoning, yet they struggle with tim
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems
arXiv:2510.10815v4 Announce Type: replace Abstract: Automating the formalization of mathematical statements for theorem proving remains a major challenge for La
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Toward Virtuous Reinforcement Learning: A Critique and Roadmap
arXiv:2512.04246v2 Announce Type: replace Abstract: This paper critiques common patterns in machine ethics for Reinforcement Learning (RL) and argues for a virt
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training
arXiv:2602.05765v2 Announce Type: replace Abstract: Reinforcement learning (RL) has emerged as a critical paradigm for post-training Vision-Language-Action (VLA
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Emergent Introspection in AI is Content-Agnostic
arXiv:2603.05414v2 Announce Type: replace Abstract: Introspection is a foundational cognitive ability, but its mechanism is not well understood. Recent work has
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling
arXiv:2603.21357v2 Announce Type: replace Abstract: LLM agents fail on the majority of real-world tasks -- GPT-4o succeeds on fewer than 15% of WebArena navigat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
arXiv:2604.01591v2 Announce Type: replace Abstract: We introduce ThinkTwice, a simple two-phase framework that jointly optimizes LLMs to solve reasoning problem
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
arXiv:2407.14971v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) rely heavily on pretrained vision encoders to support downstream tasks s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment
arXiv:2409.19894v5 Announce Type: replace-cross Abstract: Code translation transforms code between programming languages while preserving functionality, which i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cobblestone: A Divide-and-Conquer Approach for Automating Formal Verification
arXiv:2410.19940v4 Announce Type: replace-cross Abstract: Formal verification using proof assistants, such as Coq, is an effective way of improving software qua
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
arXiv:2410.20791v3 Announce Type: replace-cross Abstract: The rapid expansion of foundation models (FMs), such as large language models (LLMs), has given rise t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
arXiv:2411.05961v2 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) are central to Visual Question Answering (VQA) systems and are typically
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Retrieval Augmented Time Series Forecasting
arXiv:2411.08249v2 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) is a central component of modern LLM systems, particularly in sce
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ENTER: Event Based Interpretable Reasoning for VideoQA
arXiv:2501.14194v2 Announce Type: replace-cross Abstract: In this paper, we present ENTER, an interpretable Video Question Answering (VideoQA) system based on e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
arXiv:2502.17421v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) can now process extremely long contexts, efficient inference over thes
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hedging and Non-Affirmation: Quantifying LLM Alignment on Questions of Human Rights
arXiv:2502.19463v2 Announce Type: replace-cross Abstract: Hedging and non-affirmation are behaviors exhibited by large language models (LLMs) that limit the cle
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge
arXiv:2504.05995v3 Announce Type: replace-cross Abstract: The rapid progress of large language models (LLMs) raises concerns about cultural bias, fairness, and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Phonetic Perturbations Reveal Tokenizer-Rooted Safety Gaps in LLMs
arXiv:2505.14226v5 Announce Type: replace-cross Abstract: Safety-aligned LLMs remain vulnerable to digital phenomena like textese that introduce non-canonical p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Synthesis of discrete-continuous quantum circuits with multimodal diffusion models
arXiv:2506.01666v3 Announce Type: replace-cross Abstract: Efficiently compiling quantum operations remains a major bottleneck in scaling quantum computing. Toda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
HeartcareGPT: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understanding
arXiv:2506.05831v4 Announce Type: replace-cross Abstract: Although electrocardiograms (ECG) play a dominant role in cardiovascular diagnosis and treatment, thei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A Survey of Continual Reinforcement Learning
arXiv:2506.21872v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) is an important machine learning paradigm for solving sequential decision-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Enhancing Hallucination Detection via Future Context
arXiv:2507.20546v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are widely used to generate plausible text on online platforms, without r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization
arXiv:2509.17183v2 Announce Type: replace-cross Abstract: Alignment plays a crucial role in Large Language Models (LLMs) in aligning with human preferences on a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A State-Update Prompting Strategy for Efficient and Robust Multi-turn Dialogue
arXiv:2509.17766v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) struggle with information forgetting and inefficiency in long-horizon, mu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Dissecting Transformers: A CLEAR Perspective towards Green AI
arXiv:2510.02810v2 Announce Type: replace-cross Abstract: The rapid adoption of Large Language Models (LLMs) has raised significant environmental concerns. Unli
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Reveal-to-Revise: Explainable Bias-Aware Generative Modeling with Multimodal Attention
arXiv:2510.12957v3 Announce Type: replace-cross Abstract: We present an explainable, bias-aware generative framework that unifies cross-modal attention fusion,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Unlocking the Potential of Diffusion Language Models through Template Infilling
arXiv:2510.13870v2 Announce Type: replace-cross Abstract: Diffusion Language Models (DLMs) have emerged as a promising alternative to Autoregressive Language Mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Knowledge Reasoning Language Model: Unifying Knowledge and Language for Inductive Knowledge Graph Reasoning
arXiv:2510.13909v2 Announce Type: replace-cross Abstract: Inductive Knowledge Graph Reasoning (KGR) aims to discover facts in open-domain KGs containing unknown
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
RLAIF-SPA: Structured AI Feedback for Semantic-Prosodic Alignment in Speech Synthesis
arXiv:2510.14628v2 Announce Type: replace-cross Abstract: Recent advances in Text-To-Speech (TTS) synthesis have achieved near-human speech quality in neutral s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Fairness Evaluation and Inference Level Mitigation in LLMs
arXiv:2510.18914v3 Announce Type: replace-cross Abstract: Large language models often display undesirable behaviors embedded in their internal representations,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Routing-Based Continual Learning for Multimodal Large Language Models
arXiv:2511.01831v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) struggle with continual learning, often suffering from catast
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Developing and Evaluating a Large Language Model-Based Automated Feedback System Grounded in Evidence-Centered Design for Supporting Physics Problem Solving
arXiv:2512.10785v2 Announce Type: replace-cross Abstract: Generative AI offers new opportunities for individualized and adaptive learning, e.g., through large l
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Automatic Replication of LLM Mistakes in Medical Conversations
arXiv:2512.20983v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly evaluated in clinical settings using multi-dimensional r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation
arXiv:2512.23994v2 Announce Type: replace-cross Abstract: Text-to-audio-video (T2AV) generation is central to applications such as filmmaking and world modeling
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
arXiv:2601.02978v2 Announce Type: replace-cross Abstract: Recent work in Mechanistic Interpretability (MI) has enabled the identification and intervention of in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation
arXiv:2601.03054v4 Announce Type: replace-cross Abstract: Recent research on medical MLLMs has gradually shifted its focus from image-level understanding to fin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency
arXiv:2601.05905v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) are increasingly deployed in real-world settings, correctness alone is
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Frame of Reference: Addressing the Challenges of Common Ground Representation in Situational Dialogs
arXiv:2601.09365v2 Announce Type: replace-cross Abstract: Common ground plays a critical role in situated spoken dialogs, where interlocutors must establish and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching
arXiv:2601.11652v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) become increasingly accessible to end users, an ever-growing number of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Why Can't I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition
arXiv:2601.16211v2 Announce Type: replace-cross Abstract: Zero-Shot Compositional Action Recognition (ZS-CAR) requires recognizing novel verb-object combination
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
arXiv:2602.00185v2 Announce Type: replace-cross Abstract: The integration of large language models (LLMs) into materials science offers a transformative opportu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration
arXiv:2602.00913v3 Announce Type: replace-cross Abstract: Human value detection from single sentences is a sparse, imbalanced multi-label task. We study whether
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
arXiv:2602.12705v4 Announce Type: replace-cross Abstract: We present MedXIAOHE, a medical vision-language foundation model designed to advance general-purpose m
DeepCamp AI