Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,089 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments
arXiv:2604.06019v1 Announce Type: cross Abstract: The advancement of Large Language Models (LLMs) has raised concerns regarding their dual-use potential in cybe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A Multi-Stage Validation Framework for Trustworthy Large-scale Clinical Information Extraction using Large Language Models
arXiv:2604.06028v1 Announce Type: cross Abstract: Large language models (LLMs) show promise for extracting clinically meaningful information from unstructured h
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles
arXiv:2604.06071v1 Announce Type: cross Abstract: Personality traits are richly encoded in natural language, and large language models (LLMs) trained on human t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning
arXiv:2604.06079v1 Announce Type: cross Abstract: Graphics Program Synthesis is pivotal for interpreting and editing visual data, effectively facilitating the r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces
arXiv:2604.06086v1 Announce Type: cross Abstract: Modern Transformer-based language models achieve strong performance in natural language processing tasks, yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Making in LLM Collectives
arXiv:2604.06091v1 Announce Type: cross Abstract: Large language model (LLM) agents are increasingly acting as human delegates in multi-agent environments, wher
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM4CodeRE: Generative AI for Code Decompilation Analysis and Reverse Engineering
arXiv:2604.06095v1 Announce Type: cross Abstract: Code decompilation analysis is a fundamental yet challenging task in malware reverse engineering, particularly
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer
arXiv:2604.06129v1 Announce Type: cross Abstract: This paper introduces the Polynomial Mixer (PoM), a novel token mixing mechanism with linear complexity that s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Shot-Based Quantum Encoding: A Data-Loading Paradigm for Quantum Neural Networks
arXiv:2604.06135v1 Announce Type: cross Abstract: Efficient data loading remains a bottleneck for near-term quantum machine-learning. Existing schemes (angle, a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Generating Synthetic Doctor-Patient Conversations for Long-form Audio Summarization
arXiv:2604.06138v1 Announce Type: cross Abstract: Long-context audio reasoning is underserved in both training data and evaluation. Existing benchmarks target s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement
arXiv:2604.06155v1 Announce Type: cross Abstract: Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control
arXiv:2604.06156v1 Announce Type: cross Abstract: MLLMs have been successfully applied to multimodal embedding tasks, yet their generative reasoning capabilitie
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
In-Place Test-Time Training
arXiv:2604.06169v1 Announce Type: cross Abstract: The static ``train then deploy" paradigm fundamentally limits Large Language Models (LLMs) from dynamically ad
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Advancing AI Research Assistants with Expert-Involved Learning
arXiv:2505.04638v5 Announce Type: replace Abstract: Large language models (LLMs) and large multimodal models (LMMs) promise to accelerate biomedical discovery,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Beyond Syntax: Action Semantics Learning for App Agents
arXiv:2506.17697v3 Announce Type: replace Abstract: The recent development of Large Language Models (LLMs) enables the rise of App agents that interpret user in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
URSA: The Universal Research and Scientific Agent
arXiv:2506.22653v2 Announce Type: replace Abstract: Large language models (LLMs) have moved far beyond their initial form as simple chatbots, now carrying out c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MedGemma Technical Report
arXiv:2507.05201v4 Announce Type: replace Abstract: Artificial intelligence (AI) has significant potential in healthcare applications, but its training and depl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Multiplayer Nash Preference Optimization
arXiv:2509.23102v3 Announce Type: replace Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the standard paradigm for aligning large la
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
arXiv:2509.25454v4 Announce Type: replace Abstract: Although RLVR has become an essential component for developing advanced reasoning skills in language models,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling
arXiv:2510.01025v2 Announce Type: replace Abstract: The linear representation hypothesis states that language models (LMs) encode concepts as directions in thei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
arXiv:2510.07432v2 Announce Type: replace Abstract: Large language models (LLMs) exhibit strong symbolic and compositional reasoning, yet they struggle with tim
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems
arXiv:2510.10815v4 Announce Type: replace Abstract: Automating the formalization of mathematical statements for theorem proving remains a major challenge for La
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Toward Virtuous Reinforcement Learning: A Critique and Roadmap
arXiv:2512.04246v2 Announce Type: replace Abstract: This paper critiques common patterns in machine ethics for Reinforcement Learning (RL) and argues for a virt
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training
arXiv:2602.05765v2 Announce Type: replace Abstract: Reinforcement learning (RL) has emerged as a critical paradigm for post-training Vision-Language-Action (VLA
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Emergent Introspection in AI is Content-Agnostic
arXiv:2603.05414v2 Announce Type: replace Abstract: Introspection is a foundational cognitive ability, but its mechanism is not well understood. Recent work has
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling
arXiv:2603.21357v2 Announce Type: replace Abstract: LLM agents fail on the majority of real-world tasks -- GPT-4o succeeds on fewer than 15% of WebArena navigat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
arXiv:2604.01591v2 Announce Type: replace Abstract: We introduce ThinkTwice, a simple two-phase framework that jointly optimizes LLMs to solve reasoning problem
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
arXiv:2407.14971v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) rely heavily on pretrained vision encoders to support downstream tasks s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment
arXiv:2409.19894v5 Announce Type: replace-cross Abstract: Code translation transforms code between programming languages while preserving functionality, which i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Cobblestone: A Divide-and-Conquer Approach for Automating Formal Verification
arXiv:2410.19940v4 Announce Type: replace-cross Abstract: Formal verification using proof assistants, such as Coq, is an effective way of improving software qua
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
arXiv:2410.20791v3 Announce Type: replace-cross Abstract: The rapid expansion of foundation models (FMs), such as large language models (LLMs), has given rise t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
arXiv:2411.05961v2 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) are central to Visual Question Answering (VQA) systems and are typically
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Retrieval Augmented Time Series Forecasting
arXiv:2411.08249v2 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) is a central component of modern LLM systems, particularly in sce
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ENTER: Event Based Interpretable Reasoning for VideoQA
arXiv:2501.14194v2 Announce Type: replace-cross Abstract: In this paper, we present ENTER, an interpretable Video Question Answering (VideoQA) system based on e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
arXiv:2502.17421v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) can now process extremely long contexts, efficient inference over thes
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hedging and Non-Affirmation: Quantifying LLM Alignment on Questions of Human Rights
arXiv:2502.19463v2 Announce Type: replace-cross Abstract: Hedging and non-affirmation are behaviors exhibited by large language models (LLMs) that limit the cle
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge
arXiv:2504.05995v3 Announce Type: replace-cross Abstract: The rapid progress of large language models (LLMs) raises concerns about cultural bias, fairness, and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Phonetic Perturbations Reveal Tokenizer-Rooted Safety Gaps in LLMs
arXiv:2505.14226v5 Announce Type: replace-cross Abstract: Safety-aligned LLMs remain vulnerable to digital phenomena like textese that introduce non-canonical p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Synthesis of discrete-continuous quantum circuits with multimodal diffusion models
arXiv:2506.01666v3 Announce Type: replace-cross Abstract: Efficiently compiling quantum operations remains a major bottleneck in scaling quantum computing. Toda
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
HeartcareGPT: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understanding
arXiv:2506.05831v4 Announce Type: replace-cross Abstract: Although electrocardiograms (ECG) play a dominant role in cardiovascular diagnosis and treatment, thei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A Survey of Continual Reinforcement Learning
arXiv:2506.21872v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) is an important machine learning paradigm for solving sequential decision-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Enhancing Hallucination Detection via Future Context
arXiv:2507.20546v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are widely used to generate plausible text on online platforms, without r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization
arXiv:2509.17183v2 Announce Type: replace-cross Abstract: Alignment plays a crucial role in Large Language Models (LLMs) in aligning with human preferences on a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A State-Update Prompting Strategy for Efficient and Robust Multi-turn Dialogue
arXiv:2509.17766v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) struggle with information forgetting and inefficiency in long-horizon, mu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Dissecting Transformers: A CLEAR Perspective towards Green AI
arXiv:2510.02810v2 Announce Type: replace-cross Abstract: The rapid adoption of Large Language Models (LLMs) has raised significant environmental concerns. Unli
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Reveal-to-Revise: Explainable Bias-Aware Generative Modeling with Multimodal Attention
arXiv:2510.12957v3 Announce Type: replace-cross Abstract: We present an explainable, bias-aware generative framework that unifies cross-modal attention fusion,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Unlocking the Potential of Diffusion Language Models through Template Infilling
arXiv:2510.13870v2 Announce Type: replace-cross Abstract: Diffusion Language Models (DLMs) have emerged as a promising alternative to Autoregressive Language Mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Knowledge Reasoning Language Model: Unifying Knowledge and Language for Inductive Knowledge Graph Reasoning
arXiv:2510.13909v2 Announce Type: replace-cross Abstract: Inductive Knowledge Graph Reasoning (KGR) aims to discover facts in open-domain KGs containing unknown
DeepCamp AI