Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,322
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
All Reads (29,843) Articles (12693)Blog Posts (5645)Tutorials (2399)Research Papers (8232)News (874)
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Fisher-Routed Mixture of Experts for Federated Class-Incremental Learning
arXiv:2606.28835v1 Announce Type: cross Abstract: Federated Learning (FL) emerged as a promising distributed machine learning paradigm. However, extending FL to
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LAMP: Lean-based Agentic framework with MCP and Proof Repair
arXiv:2606.28841v1 Announce Type: cross Abstract: Large language models are increasingly capable of mathematical reasoning, but the proofs they generate are oft
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
The Heterogeneous Safety Impacts of Benign Multilingual Fine-Tuning
arXiv:2606.28843v1 Announce Type: cross Abstract: Fine-tuning a large language model is a ubiquitous method for enhancing its capability on a specific downstrea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Exploring the Value of Diverse LLM Explanations in Introductory Programming
arXiv:2606.28882v1 Announce Type: cross Abstract: Large Language Models (LLMs) have shown the potential to generate code explanations that surpass those of peer
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Latent Bridges for Multi-Table Question Answering
arXiv:2606.28916v1 Announce Type: cross Abstract: We introduce GRAB, a constructor-encoder-bridge pipeline for table question answering. Our method lifts relati
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
DLR: Zero-Inference-Cost Latent Residuals for Low-Rank Pre-Training
arXiv:2606.28932v1 Announce Type: cross Abstract: Large language models have driven recent progress in language and multimodal AI, yet pre-training them at scal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Evidence-Based Text-Conditioned 3D CT Synthesis for Ovarian Cancer
arXiv:2606.28980v1 Announce Type: cross Abstract: Ovarian cancer is frequently diagnosed at an advanced stage, making preoperative contrast-enhanced computed to
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Compositional Dynamics in Learning and Mechanics
arXiv:2606.28984v1 Announce Type: cross Abstract: We give a single compositional setting in which gradient-based learning and Hamiltonian-style mechanics appear
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Fine-Tuning General-Purpose Large Language Models for Agricultural Applications:A Reproducible Framework and Evaluation Protocol Based on Qwen3-8B
arXiv:2606.28992v1 Announce Type: cross Abstract: General-purpose large language models (LLMs) have demonstrated strong abilities in opendomain question answeri
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Arbitrary Reduction of Validation Error for AI Decision Tests using Homomorphic AI and Repetition Codes
arXiv:2606.28994v1 Announce Type: cross Abstract: This paper presents new results and breakthrough obtained with the HbHAI techniques (Hash-based Homomorphic Ar
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Reward-Free Code Alignment from Pretrained or Fine-Tuned LLM: Unpacking the Trade-offs for Code Generation
arXiv:2606.28998v1 Announce Type: cross Abstract: Large Language Model (LLM) alignment trains an LLM using preference data to produce outputs that better meet e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
BERTomelo: Your Portuguese Encoder Best Friend
arXiv:2606.28999v1 Announce Type: cross Abstract: Encoders have become the state of the art for multiple NLP tasks, especially those requiring deep contextual u
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Efficient Spatio-Temporal Grounding with Multimodal Large Models via Second-Level Tracking and RL Verification
arXiv:2606.29023v1 Announce Type: cross Abstract: Spatio-temporal grounding in long videos requires precise temporal localization and robust object tracking con
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
How to Leverage Synthetic Speech for LLM-Based ASR Systems?
arXiv:2606.29031v1 Announce Type: cross Abstract: In regulated domains such as banking and healthcare, where privacy constraints make real speech costly to coll
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
The strength of clinical evidence is recoverable from language model representations but not from their stated grades
arXiv:2606.29034v1 Announce Type: cross Abstract: Large language models (LLMs) increasingly summarize clinical evidence, where a claim's weight depends on how s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Flow Matching in Feature Space for Stochastic World Modeling
arXiv:2606.29059v1 Announce Type: cross Abstract: World modeling requires forecasting uncertain futures while preserving information useful for downstream perce
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
A Comparative Study on Affective Cues in Text Embeddings Across Psychological Emotion Theories
arXiv:2606.29068v1 Announce Type: cross Abstract: Text encoders are known for their utility in natural language processing, as they are able to efficiently comp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Diff-Based Code Corruption using LLMs for Large-Scale Bugfix Benchmarking
arXiv:2606.29088v2 Announce Type: cross Abstract: There are various benchmarks to evaluate bugfixing capabilities of Large Language Models. However, most widesp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
AB-RAG: Adaptive Budgeted Retrieval-Augmented Generation for Reliable Question Answering
arXiv:2606.29090v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) has become the standard way to ground large language models in external k
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Statistically Indistinguishable, Operationally Distinct: A Formal Barrier for Tabular Foundation Models
arXiv:2606.29091v1 Announce Type: cross Abstract: Tabular foundation models cannot reason about data produced by running systems without access to the rules tha
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Unified Complex-valued Neural Network: A Magnitude-Phase Computational Model for Event-Driven Neuromorphic Learning
arXiv:2606.29099v1 Announce Type: cross Abstract: Artificial neural networks (ANN) provide accurate continuous-valued representation, whereas spiking neural net
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LLM Semantic Signaling Game and Mechanism Design: Systematic Blindness, Awareness Shaping, and Mindset Dynamics
arXiv:2606.29113v1 Announce Type: cross Abstract: Large language models (LLMs) increasingly mediate strategic interactions through natural language, making sema
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
CMTFormer: Marrying Transformer with Hierarchical Information Interaction for RGB-Event Object Detection
arXiv:2606.29136v1 Announce Type: cross Abstract: Event cameras capture sparse brightness changes with high temporal resolution and high dynamic range, compensa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
On the Nonlinearity of Learning Rate Scaling for LLM Training
arXiv:2606.29158v1 Announce Type: cross Abstract: Learning-rate transfer can reduce the cost of training large language models: instead of sweeping learning rat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Invariant Reasoning Directions in Latent Trajectories of Language Models
arXiv:2606.29164v1 Announce Type: cross Abstract: Latent reasoning models perform multi-step inference directly in hidden-state space, yet the structure of thes
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
A Multi-Dataset Benchmark for Evaluating LLM Agents in Microservice Failure Diagnosis
arXiv:2606.29193v1 Announce Type: cross Abstract: LLM-based agents are reshaping microservice operations into AgentOps, where benchmarks are key to evaluating f
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
A Hybrid Framework for Song Lyric Annotation Based on Human-LLM Alignment
arXiv:2606.29273v1 Announce Type: cross Abstract: Emotion recognition of song lyrics is a challenging task since lyrics may not necessarily align with the overa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Manufactured Confidence: How Memory Consolidation Turns Hearsay into Confident Facts
arXiv:2606.29279v1 Announce Type: cross Abstract: LLM agents carry conclusions across steps and sessions in compressed memory, and memory products (e.g., mem0,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Deterministic Decisions for High-Stakes AI. A Zero-Egress Pipeline with the Deployability of RAG and the Accuracy of Machine Learning
arXiv:2606.29280v1 Announce Type: cross Abstract: We identify intervention bias as a previously unquantified failure mode of zero-shot large-language-model (LLM
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
AMR: Adaptive Modality Routing for Multimodal Polyglot Speaker Identification
arXiv:2606.29335v1 Announce Type: cross Abstract: Multimodal speaker identification systems face two key challenges in real-world deployment: missing modalities
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Fast Enough to Act: Spatio-Temporal Visual Token Merging for Low-Latency Robotic VLMs and VLAs
arXiv:2606.29350v1 Announce Type: cross Abstract: Vision-language models and vision-language action models endow the robot with unprecedented capabilities. Howe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Solver-Verified Formulation Generation and Selection for Multi-Warehouse Inventory Allocation Using Large Language Models
arXiv:2606.29366v1 Announce Type: cross Abstract: Balance-oriented multi-warehouse inventory allocation is a recurring decision problem in large-scale e-commerc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LC-ICL: Label-Guided Contrastive In-Context Learning for Robust Information Extraction
arXiv:2606.29407v1 Announce Type: cross Abstract: There has been increasing interest in exploring the capabilities of advanced large language models (LLMs) in t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LLMography: Transforming Human-AI Conversations into Traceability, Oversight, and Auditability Indicators
arXiv:2606.29437v1 Announce Type: cross Abstract: The growing use of Large Language Models (LLMs) in education, software engineering, academic writing, and tech
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Closing the Activation-Cone Blind Spot: Response-Time Probing and Unified Defense
arXiv:2606.29441v1 Announce Type: cross Abstract: Inference-time safety methods for large language models have proliferated, yet no systematic comparison exists
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Bridging VideoQA and Video-Guided Agentic Tasks via Generalized Keyframe Extraction
arXiv:2606.29445v1 Announce Type: cross Abstract: Video understanding is a fundamental capability for multimodal intelligence, and recent Multimodal Large Langu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Interpretable Inverse Design of Metal-Organic Frameworks with Large Language Model Agents
arXiv:2606.29459v1 Announce Type: cross Abstract: Inverse design of metal-organic frameworks (MOFs) requires searching a combinatorially vast space where proper
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Rank-Aware Hyperbolic Alignment for Vision-Language Dataset Distillation
arXiv:2606.29464v1 Announce Type: cross Abstract: Vision-language dataset distillation (VLDD) compresses a large image-text paired dataset into a small set of s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
To Reason or to Fabricate: Reasoning Without Shortcuts via Hint-Anchored Pairwise Aggregation
arXiv:2606.29481v1 Announce Type: cross Abstract: While reinforcement learning (RL) significantly enhances LLM reasoning, its efficacy is severely undermined by
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Reported Confidence in LLMs Tracks Commitment More Than Correctness
arXiv:2606.29490v1 Announce Type: cross Abstract: Confidence is an estimate of the probability that a chosen answer is correct. Verbal confidence reports are wi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
The Verbose Context Problem in Medical Records
arXiv:2606.29503v1 Announce Type: cross Abstract: The verbose context problem occurs when structured concepts have token-inefficient textual representations. Th
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
SAKE: Software Architectural Knowledge Evaluation Benchmark for Large Language Models
arXiv:2606.29520v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used as assistants across the software development lifecycle, ye
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
MotionAtlas: Detailed Region Captioning for Motion-Centric Videos
arXiv:2606.29531v1 Announce Type: cross Abstract: We propose MotionAtlas, a system for detailed captioning of motion-centric videos, comprising (1) a dedicated
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
SemJoin: Semantic Join Optimization
arXiv:2606.29532v1 Announce Type: cross Abstract: Integrating unstructured data into relational database systems is increasingly important as demand grows for n
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Em-ergence of the em-dash: a population-level rise in em-dash frequency in medRxiv preprints at the dawn of the large-language-model era
arXiv:2606.29540v1 Announce Type: cross Abstract: Large language models (LLMs) can leave subtle stylistic traces in assisted text; one of the most cited is the
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Coverage-Driven KV Cache Eviction for Efficient and Improved Inference of LLM
arXiv:2606.29563v1 Announce Type: cross Abstract: Large language models (LLMs) excel at complex tasks like question answering and summarization, thanks to their
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
ScAle: Attention Head Scaling as a Minimal Adapter for Spatial Reasoning in Vision Language Models
arXiv:2606.29579v1 Announce Type: cross Abstract: Spatial reasoning remains a persistent challenge for many vision language models (VLMs), and improving it typi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
The Joint Effect of Quantization and Sampling Temperature on LLM Safety Alignment: A Factorial Analysis
arXiv:2606.29581v1 Announce Type: cross Abstract: Modern LLM deployments routinely compress models and raise sampling temperature to reduce cost, latency, or re