Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,159
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
All Reads (29,688) Articles (12625)Blog Posts (5609)Tutorials (2350)Research Papers (8231)News (873)
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots
arXiv:2606.29705v1 Announce Type: new Abstract: Data, as the fundamental substrate of modern intelligence, has greatly driven the development of current foundat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
DEEPMED Search: An Open-Source Agentic Platform for Medical Deep Research with Introspective Verification
arXiv:2606.29746v1 Announce Type: new Abstract: Navigating the deluge of heterogeneous medical data, from academic literature (PubMed) to clinical guidelines (W
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Rethinking Generative Reconstruction Attacks against Graph Neural Network Models
arXiv:2606.29748v1 Announce Type: new Abstract: The application of graph data in numerous disciplines raises the need for gathering and analyzing huge volumes o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
CLQT: A Closed-Loop, Cost-Aware, Strategy-Consistent Benchmark for Diagnostic Evaluation of LLM Portfolio-Management Agents
arXiv:2606.29771v1 Announce Type: new Abstract: LLM agents are increasingly cast as autonomous portfolio managers, and benchmarks have moved from financial ques
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
The CRISTAL Method: Neurosymbolic analysis from AI-synthesized world models
arXiv:2606.29799v1 Announce Type: new Abstract: This project introduces the CRISTAL Method (Coherent Reliable Intentional Synthesis of Truthful Analysis Logic),
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Beyond Triplet Plausibility: Relation Set Completion in Knowledge Graphs
arXiv:2606.29860v2 Announce Type: new Abstract: Knowledge graphs (KGs) organize real-world knowledge as triplets and underpin many downstream applications. Due
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
AI Training Manager: Bounded Closed-Loop Control of Adaptive Training Recipes
arXiv:2606.29871v1 Announce Type: new Abstract: We present the AI Training Manager, a bounded LLM-based supervisory controller for adaptive machine learning tra
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
HippoSpark: An On-Demand Experience System for LLM Reasoning
arXiv:2606.29929v1 Announce Type: new Abstract: Distilling historical trajectories into reusable experience to enhance future problem-solving has become a focal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
First-Order Temporal Logic Tensor Networks
arXiv:2606.29972v1 Announce Type: new Abstract: Most of the existing neuro-symbolic AI methods focus on the scenario of static knowledge where objects do not ch
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Exploration and Online Transfer with Behavioral Foundation Models
arXiv:2606.29980v2 Announce Type: new Abstract: Zero-shot Transfer in Reinforcement Learning (RL) aims to train an agent that can generate optimal policies for
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Be Faithful When Response: Returning Fluent and Grounded Answers for Vision-Language Models Reinforcement Learning
arXiv:2606.29984v1 Announce Type: new Abstract: Reinforcement Learning (RL) is an important paradigm for improving the reasoning capabilities of Vision-Language
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Structural Certification for Reliable Physical Design with Language Models
arXiv:2606.30107v1 Announce Type: new Abstract: An unreliable language model can be made to produce reliable physical designs if the authority to assert is move
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Open Problems in Constitutional Preference Reconstruction
arXiv:2606.30116v1 Announce Type: new Abstract: Pairwise preference data is widely used for training and evaluating language models (e.g., RLHF), but each datap
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Does Verbose Chain-of-Thought Really Help? In-Distribution Evidence that Content, Not Length, Matters
arXiv:2606.30128v1 Announce Type: new Abstract: Chain-of-thought (CoT) prompting improves LLM reasoning, but the source is contested: do the intermediate steps
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Relevance Is Not Permission: Warranted Attention for Value Contributions
arXiv:2606.30139v1 Announce Type: new Abstract: Relevance is not permission. Attention lets a model read key-value items related to the current query, but it do
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
FacePlex: Full-Duplex Joint Speech-Facial Motion Generation for Conversational Avatars
arXiv:2606.30145v1 Announce Type: new Abstract: Natural face-to-face conversation requires real-time speech generation together with synchronized facial motion.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Dynamo: Dynamic Skill-Tool Evolution for Vision-Language Agents
arXiv:2606.30185v1 Announce Type: new Abstract: Improving vision-language models (VLMs) on visual reasoning typically requires retraining or hand-designed promp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
EvalSafetyGap: A Hybrid Survey and Conceptual Framework for LLM Evaluation-Safety Failures
arXiv:2606.30219v1 Announce Type: new Abstract: LLM evaluation and AI safety face a shared measurement problem: benchmark scores, reward-model signals, and repo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Inoculation Adapters: Improved Selective Generalization of Capabilities with Fewer Surprising Backdoors
arXiv:2606.30252v1 Announce Type: new Abstract: Inoculation prompting is a selective generalization technique used against Emergent Misalignment. We introduce i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
EMPATH: A Multilingual Auditor-Judge Benchmark for Safety Evaluation of Emotional-Support Chatbots
arXiv:2606.30256v1 Announce Type: new Abstract: Safety benchmarks often buy scalability by fixing the prompt, the language, and the turn structure. For emotiona
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
PromptGNN-sim: Deep Fusion and Alignment of GNN and LLMs for Text-Attributed Graph Learning
arXiv:2606.30291v1 Announce Type: new Abstract: Text-Attributed Graphs (TAGs) combine textual semantics with graph structure and are central to many graph learn
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
ManimAgent: Self-Evolving Multimodal Agents for Visual Education
arXiv:2606.30296v1 Announce Type: new Abstract: Multi-round reflection lets agents built on large language models recover from failures within a single task, bu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
BayesEvolve: Explicit Belief States for Autonomous Scientific Discovery
arXiv:2606.30335v1 Announce Type: new Abstract: Autonomous scientific discovery systems increasingly use large language models (LLMs) to propose new hypotheses,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Whose Side Is Your Agent On? Multi-Party Principal Loyalty in LLM Agents
arXiv:2606.30383v1 Announce Type: new Abstract: A rapidly growing class of LLM agents is multi-party: the agent acts for a principal (who briefs it, sends follo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
The FIL Hypothesis: Inductive Biases Help with Kernel Engineering
arXiv:2606.30442v1 Announce Type: new Abstract: The Bitter Lesson, which posits that general-purpose methods that scale with computation and data ultimately out
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Self-Evolving World Models for LLM Agent Planning
arXiv:2606.30639v1 Announce Type: new Abstract: World models offer a principled way to equip long-horizon LLM agents with foresight: predictions of action conse
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
$M^3 QuestionIng$: Multi-modal Multi-span Medical Question Answering
arXiv:2606.28329v1 Announce Type: cross Abstract: The growing adoption of AI in healthcare, particularly in preventive care, highlights the critical need for ac
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
High-Dimensional Concentration and Retrieval Instability in Embedding Spaces: Implications for Retrieval-Augmented Generation
arXiv:2606.28330v1 Announce Type: cross Abstract: Embedding-based retrieval systems rely on the assumption that geometric proximity in highdimensional represent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
When Medical Safety Alignment Fails: A Benchmark for Evaluating LLMs on High-Risk Medical Queries
arXiv:2606.28332v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used for medical and health-related questions, yet their safety
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Insidious by Design: Implications of Large Language Model algorithmic bias for the Global South
arXiv:2606.28333v1 Announce Type: cross Abstract: \begin{quote} The biases in Large Language Models' (LLMs) outputs remain inadequately theorised, particularly
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
HyBIRD: Hyperbolic Bridge Retrieval and Diagnosis for Methodology Inspiration Retrieval
arXiv:2606.28336v1 Announce Type: cross Abstract: Methodology Inspiration Retrieval (MIR) asks a system to retrieve prior papers whose methods can inspire a new
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
A Systems-Level Analysis of Sensitivity, Robustness, and Stability in Retrieval-Augmented Generation
arXiv:2606.28337v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) systems are often evaluated using final answer accuracy, even though thei
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
The Crowded Embedding Space: A Mean-Field Mechanism for Emergent Marginalization in Retrieval-Augmented Agents
arXiv:2606.28343v1 Announce Type: cross Abstract: Retrieval-augmented generative agents rely on retrieval for grounding, yet are typically evaluated on a query-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
PIXELRAG: Web Screenshots Beat Text for Retrieval-Augmented Generation
arXiv:2606.28344v1 Announce Type: cross Abstract: Augmenting large language models (LLMs) with retrieved web text has become a dominant paradigm, yet the web is
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Auditing LLM-Governed Social Robots with Culture-Specific Moral Gradients
arXiv:2606.28345v1 Announce Type: cross Abstract: LLM-governed social robots increasingly decide who receives real-world assistance first. As prioritization nor
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
HMARS: A Hierarchical Multi-Agent Memory System for Long-Context Reasoning
arXiv:2606.28349v1 Announce Type: cross Abstract: Long-context reasoning requires models to access, retrieve, and integrate evidence scattered across documents,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
ReasonRec: A Reasoning-Augmented Multimodal Agent for Unified Recommendation
arXiv:2606.28357v1 Announce Type: cross Abstract: Recent advances in multimodal recommenders excel at feature fusion but remain opaque and inefficient decision-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
How Do LLMs Cite? A Mechanistic Interpretation of Attribution in Retrieval-Augmented Generation
arXiv:2606.28358v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) aims to enhance the trustworthiness of Large Language Models (LLMs) by gr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Carolina Guide: A Multi-Agent RAG System with Institutional Guardrails for Academic Policy Assistance
arXiv:2606.28360v1 Announce Type: cross Abstract: University students often struggle to navigate complex academic policies, leading to advising bottlenecks and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
ConCise: Training-Free Conclusion-Chain State Compression for Cost-Efficient Multi-Step RAG Services
arXiv:2606.28361v1 Announce Type: cross Abstract: Multi-step retrieval-augmented generation (RAG) has been widely deployed as LLM-powered web services for compl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
LUMEN: Cost-Transparent Multi-Agent Pipeline for Automated Systematic Review and Meta-Analysis
arXiv:2606.28362v1 Announce Type: cross Abstract: Systematic reviews and meta-analyses (SR/MA) remain the gold standard for evidence synthesis, yet completing o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
meta-pipe: An LLM-agent pipeline for end-to-end automated systematic review and meta-analysis
arXiv:2606.28363v1 Announce Type: cross Abstract: Objective: To describe the architecture and design rationale of meta-pipe, an open-source large language model
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
CAMI: Cost-Aware Agent-Guided Multi-Indexing for Semantic Retrieval
arXiv:2606.28365v1 Announce Type: cross Abstract: RAG ingestion pipelines frequently augment search corpus index with semantic enrichment indices (e.g., synthet
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Beyond the Reranker: Do RAG Retrieval Enhancements Help Once a Strong Reranker Is Present?
arXiv:2606.28367v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) is routinely extended with methods meant to improve retrieval: query expa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Multimodal and Multiscale Spatial-Temporal Semantic Search and Recommendation with AI Foundation Models
arXiv:2606.28369v1 Announce Type: cross Abstract: Semantic search and recommendation of similar documents, such as news and reports about unusual environmental
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Conversational Query Engine for Mixed-Modality Heterogeneous Enterprise Data Sources
arXiv:2606.28370v1 Announce Type: cross Abstract: Enterprise business intelligence queries span structured warehouses and unstructured document repositories --
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
Model Merging to Evolution: Parameter Space Exploration for Expert Models
arXiv:2606.28373v1 Announce Type: cross Abstract: Model merging integrates the capabilities of multiple expert models to create strong models for multiple tasks
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4d ago
When Does Overlap Help? OSU-Mem and a Cell-Conditional Analysis of Trajectory Memory for LLM Agents
arXiv:2606.28376v1 Announce Type: cross Abstract: Long-horizon large language model (LLM) agents accumulate interaction trajectories that quickly exceed any pra