Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

25,179
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,690 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Goedel-Code-Prover: Hierarchical Proof Search for Open State-of-the-Art Code Verification
arXiv:2603.19329v1 Announce Type: cross Abstract: Large language models (LLMs) can generate plausible code but offer limited guarantees of correctness. Formally
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA Optimization
arXiv:2603.19333v1 Announce Type: cross Abstract: Applying large language models (LLMs) to RTL code optimization for improved power, performance, and area (PPA)
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Do Post-Training Algorithms Actually Differ? A Controlled Study Across Model Scales Uncovers Scale-Dependent Ranking Inversions
arXiv:2603.19335v1 Announce Type: cross Abstract: Post-training alignment has produced dozens of competing algorithms -- DPO, SimPO, KTO, GRPO, and others -- ye
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity
arXiv:2603.19337v1 Announce Type: cross Abstract: Federated learning (FL) is severely challenged by non-independent and identically distributed (non-IID) client
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Spectral Tempering for Embedding Compression in Dense Passage Retrieval
arXiv:2603.19339v1 Announce Type: cross Abstract: Dimensionality reduction is critical for deploying dense retrieval systems at scale, yet mainstream post-hoc m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons
arXiv:2603.19344v1 Announce Type: cross Abstract: Weighted summation has remained the default input aggregation mechanism in artificial neurons since the earlie
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Scalable Prompt Routing via Fine-Grained Latent Task Discovery
arXiv:2603.19415v1 Announce Type: cross Abstract: Prompt routing dynamically selects the most appropriate large language model from a pool of candidates for eac
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Investigating In-Context Privacy Learning by Integrating User-Facing Privacy Tools into Conversational Agents
arXiv:2603.19416v1 Announce Type: cross Abstract: Supporting users in protecting sensitive information when using conversational agents (CAs) is crucial, as use
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The Autonomy Tax: Defense Training Breaks LLM Agents
arXiv:2603.19423v1 Announce Type: cross Abstract: Large language model (LLM) agents increasingly rely on external tools (file operations, API calls, database tr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Is Evaluation Awareness Just Format Sensitivity? Limitations of Probe-Based Evidence under Controlled Prompt Structure
arXiv:2603.19426v1 Announce Type: cross Abstract: Prior work uses linear probes on benchmark prompts as evidence of evaluation awareness in large language model
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
TrustFlow: Topic-Aware Vector Reputation Propagation for Multi-Agent Ecosystems
arXiv:2603.19452v1 Announce Type: cross Abstract: We introduce TrustFlow, a reputation propagation algorithm that assigns each software agent a multi-dimensiona
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Global Convergence of Multiplicative Updates for the Matrix Mechanism: A Collaborative Proof with Gemini 3
arXiv:2603.19465v1 Announce Type: cross Abstract: We analyze a fixed-point iteration $v \leftarrow \phi(v)$ arising in the optimization of a regularized nuclear
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
A Framework for Formalizing LLM Agent Security
arXiv:2603.19469v1 Announce Type: cross Abstract: Security in LLM agents is inherently contextual. For example, the same action taken by an agent may represent
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL
arXiv:2603.19470v1 Announce Type: cross Abstract: Off-policy problems such as policy staleness and training-inference mismatch, has become a major bottleneck fo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Linear Social Choice with Few Queries: A Moment-Based Approach
arXiv:2603.19510v1 Announce Type: cross Abstract: Most social choice rules assume access to full rankings, while current alignment practice -- despite aiming fo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Inducing Sustained Creativity and Diversity in Large Language Models
arXiv:2603.19519v1 Announce Type: cross Abstract: We address a not-widely-recognized subset of exploratory search, where a user sets out on a typically long "se
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Depictions of Depression in Generative AI Video Models: A Preliminary Study of OpenAI's Sora 2
arXiv:2603.19527v1 Announce Type: cross Abstract: Generative video models are increasingly capable of producing complex depictions of mental health experiences,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Plagiarism or Productivity? Students Moral Disengagement and Behavioral Intentions to Use ChatGPT in Academic Writing
arXiv:2603.19549v1 Announce Type: cross Abstract: This study examined how moral disengagement influences Filipino college students' intention to use ChatGPT in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Optimal Scalar Quantization for Matrix Multiplication: Closed-Form Density and Phase Transition
arXiv:2603.19559v1 Announce Type: cross Abstract: We study entrywise scalar quantization of two matrices prior to multiplication. Given $A\in R^{m\times k}$ and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
PFM-VEPAR: Prompting Foundation Models for RGB-Event Camera based Pedestrian Attribute Recognition
arXiv:2603.19565v1 Announce Type: cross Abstract: Event-based pedestrian attribute recognition (PAR) leverages motion cues to enhance RGB cameras in low-light a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
AI Psychosis: Does Conversational AI Amplify Delusion-Related Language?
arXiv:2603.19574v1 Announce Type: cross Abstract: Conversational AI systems are increasingly used for personal reflection and emotional disclosure, raising conc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Evolving Embodied Intelligence: Graph Neural Network--Driven Co-Design of Morphology and Control in Soft Robotics
arXiv:2603.19582v1 Announce Type: cross Abstract: The intelligent behavior of robots does not emerge solely from control systems, but from the tight coupling be
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Skilled AI Agents for Embedded and IoT Systems Development
arXiv:2603.19583v1 Announce Type: cross Abstract: Large language models (LLMs) and agentic systems have shown promise for automated software development, but ap
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization
arXiv:2603.19594v1 Announce Type: cross Abstract: Indoor localization has become increasingly essential for applications ranging from asset tracking to deliveri
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
arXiv:2603.19608v1 Announce Type: cross Abstract: Fine-grained anomaly detection is crucial in industrial and medical applications, but labeled anomalies are of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
CAF-Score: Calibrating CLAP with LALMs for Reference-free Audio Captioning Evaluation
arXiv:2603.19615v1 Announce Type: cross Abstract: While Large Audio-Language Models (LALMs) have advanced audio captioning, robust evaluation remains difficult.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
arXiv:2603.19621v1 Announce Type: cross Abstract: Deep Reinforcement Learning (DRL) provides a general-purpose methodology for training inventory policies that
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking
arXiv:2603.19634v1 Announce Type: cross Abstract: Generative AI (GenAI) search tools are increasingly used for information seeking, yet their design tends to en
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework
arXiv:2603.19643v1 Announce Type: cross Abstract: Despite the rapid advancement of Virtual Try-On (VTON) and Try-Off (VTOFF) technologies, existing VTON methods
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization
arXiv:2603.19649v1 Announce Type: cross Abstract: Social platforms serve as central hubs for information exchange, where user behaviors and platform interventio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference
arXiv:2603.19664v1 Announce Type: cross Abstract: The key-value (KV) cache is widely treated as essential state in transformer inference, and a large body of wo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models
arXiv:2603.19676v1 Announce Type: cross Abstract: Text-to-image diffusion models achieve high visual fidelity but surprisingly exhibit systematic failures in nu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
arXiv:2603.19677v1 Announce Type: cross Abstract: Large language model (LLM)-based multi-agent systems (MAS) have demonstrated exceptional capabilities in solvi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation
arXiv:2603.19710v1 Announce Type: cross Abstract: Pre-search query recommendation, widely known as HintQ on Taobao's homepage, plays a vital role in intent capt
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision
arXiv:2603.19807v1 Announce Type: cross Abstract: Unified Multimodal Models (UMMs) have emerged as a promising paradigm that integrates multimodal understanding
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue
arXiv:2603.19849v1 Announce Type: cross Abstract: Do LLMs talk like us? This question intrigues a multitude of scholar and it is relevant in many fields, from e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Failure Modes for Deep Learning-Based Online Mapping: How to Measure and Address Them
arXiv:2603.19852v1 Announce Type: cross Abstract: Deep learning-based online mapping has emerged as a cornerstone of autonomous driving, yet these models freque
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
arXiv:2603.19880v1 Announce Type: cross Abstract: Test-Time Reinforcement Learning (TTRL) enables Large Language Models (LLMs) to enhance reasoning capabilities
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Integrating Meta-Features with Knowledge Graph Embeddings for Meta-Learning
arXiv:2603.19888v1 Announce Type: cross Abstract: The vast collection of machine learning records available on the web presents a significant opportunity for me
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
arXiv:2603.19918v1 Announce Type: cross Abstract: Generalized Category Discovery (GCD) seeks to uncover novel categories in unlabeled data while preserving reco
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance
arXiv:2603.19974v1 Announce Type: cross Abstract: Autonomous coding agents are increasingly integrated into software development workflows, offering capabilitie
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Promoting Critical Thinking With Domain-Specific Generative AI Provocations
arXiv:2603.19975v1 Announce Type: cross Abstract: The evidence on the effects of generative AI (GenAI) on critical thinking is mixed, with studies suggesting bo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving
arXiv:2603.19979v1 Announce Type: cross Abstract: Scalable and reliable evaluation is increasingly critical in the end-to-end era of autonomous driving, where v
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States
arXiv:2603.19987v1 Announce Type: cross Abstract: Reinforcement learning (RL) has become a standard paradigm for post-training and aligning Large Language Model
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Physics-Informed Long-Range Coulomb Correction for Machine-learning Hamiltonians
arXiv:2603.20007v1 Announce Type: cross Abstract: Machine-learning electronic Hamiltonians achieve orders-of-magnitude speedups over density-functional theory,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR
arXiv:2603.20020v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) excel at high-level reasoning yet fail on OCR tasks where fine-graine
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families
arXiv:2603.20042v1 Announce Type: cross Abstract: Large language models (LLMs) have driven substantial advances in speech language models (SpeechLMs), yielding
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1mo ago
The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries
arXiv:2603.20062v1 Announce Type: cross Abstract: When a traveler asks an AI search engine to recommend a hotel, which sources get cited -- and does query frami