Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

51,318
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
All Reads (29,839) Articles (12693)Blog Posts (5644)Tutorials (2396)Research Papers (8232)News (874)
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
meta-pipe: An LLM-agent pipeline for end-to-end automated systematic review and meta-analysis
arXiv:2606.28363v1 Announce Type: cross Abstract: Objective: To describe the architecture and design rationale of meta-pipe, an open-source large language model
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
CAMI: Cost-Aware Agent-Guided Multi-Indexing for Semantic Retrieval
arXiv:2606.28365v1 Announce Type: cross Abstract: RAG ingestion pipelines frequently augment search corpus index with semantic enrichment indices (e.g., synthet
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Beyond the Reranker: Do RAG Retrieval Enhancements Help Once a Strong Reranker Is Present?
arXiv:2606.28367v1 Announce Type: cross Abstract: Retrieval-augmented generation (RAG) is routinely extended with methods meant to improve retrieval: query expa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Multimodal and Multiscale Spatial-Temporal Semantic Search and Recommendation with AI Foundation Models
arXiv:2606.28369v1 Announce Type: cross Abstract: Semantic search and recommendation of similar documents, such as news and reports about unusual environmental
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Conversational Query Engine for Mixed-Modality Heterogeneous Enterprise Data Sources
arXiv:2606.28370v1 Announce Type: cross Abstract: Enterprise business intelligence queries span structured warehouses and unstructured document repositories --
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Model Merging to Evolution: Parameter Space Exploration for Expert Models
arXiv:2606.28373v1 Announce Type: cross Abstract: Model merging integrates the capabilities of multiple expert models to create strong models for multiple tasks
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
When Does Overlap Help? OSU-Mem and a Cell-Conditional Analysis of Trajectory Memory for LLM Agents
arXiv:2606.28376v1 Announce Type: cross Abstract: Long-horizon large language model (LLM) agents accumulate interaction trajectories that quickly exceed any pra
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Memory-Augmented LSTM Autoencoder for Unsupervised Activity Recognition with IMU Sensor Fusion
arXiv:2606.28377v1 Announce Type: cross Abstract: HAR using Inertial Measurement Unit (IMU) sensors is vital for healthcare monitoring and rehabilitation. Despi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Schema-First Retrieval: Embedding Catalogs for Natural Language Analytics
arXiv:2606.28387v1 Announce Type: cross Abstract: Enterprise text-to-SQL systems often fail before SQL is generated: the model receives the wrong schema context
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Few-class Fidelity: Evaluating Explanations of Real-conditions CNN classifiers with Optimized Perturbations
arXiv:2606.28391v1 Announce Type: cross Abstract: The wide use of Convolutional Neural Networks (CNN) in numerous domains and real-world classification applicat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
RADIANT-PET: Reasoning-Augmented PET/CT Lesion Segmentation with Large Language Models and Reinforcement Learning
arXiv:2606.28392v1 Announce Type: cross Abstract: Accurate lesion segmentation in PET/CT is critical for oncology, yet remains challenging because physiologic t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
CLOSER-VLN: Closed-Loop Self-Verified Retrieval-Augmented Reasoning for Aerial Vision-Language Navigation
arXiv:2606.28397v1 Announce Type: cross Abstract: Vision-language navigation (VLN) has recently advanced with large language and multimodal models, enabling age
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Evidence-Driven LLM Agent for C-to-Synthesizable-C Conversion and Verification
arXiv:2606.28409v1 Announce Type: cross Abstract: Software-compilable C programs routinely fail to complete the four-stage pipeline of a high-level synthesis (H
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
RSGPNet: Geometric Prompting for Remote Sensing Open-Vocabulary Semantic Segmentation
arXiv:2606.28410v1 Announce Type: cross Abstract: Open-vocabulary semantic segmentation (OVSS) enables text-guided segmentation of unseen objects, breaking fixe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
JuZhou 1.0 Technical Report: The First Edge-Native Text-to-Image Foundation Model Trained Entirely on China-Developed AI Accelerators
arXiv:2606.28421v1 Announce Type: cross Abstract: Text-to-image (T2I) diffusion models typically require substantial computational resources and cloud infrastru
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems
arXiv:2606.28425v1 Announce Type: cross Abstract: Increasingly autonomous agentic AI systems pose novel multi-agent risks, such as secret collusion via covert c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Building to the Test: Coding Agents Deliver What You Check, Not What You Requested
arXiv:2606.28430v1 Announce Type: cross Abstract: Benchmarks are widely used to evaluate task completion by Large Language Models (LLMs), but this approach has
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
When AI Reviews Its Own Code: Recursive Self-Training Collapse in Code LLMs
arXiv:2606.28438v1 Announce Type: cross Abstract: Recursive self-training can degrade neural generative models when generated data is reused without fresh human
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Learning to Distributedly Estimate under Partially Known Dynamics: A Covariance-Agnostic Neural Kalman Consensus Filter
arXiv:2606.28441v1 Announce Type: cross Abstract: Online latent state estimation constitutes a fundamental challenge within the artificial intelligence field, s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LoRA-Tuned Large Language Models for Dementia Detection via Multi-View Speech-Derived Features
arXiv:2606.28445v1 Announce Type: cross Abstract: Early detection of dementia enables timely intervention, and reflecting cognitive impairment, spontaneous spee
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
SemFlowRAG: Directed Semantic Flow from Abstraction to Evidence for Complex Reasoning
arXiv:2606.28447v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) enhanced by Knowledge Graphs has shown promise in complex multi-hop reaso
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LLM agents security duality: a comprehensive survey of self-security and empowered cybersecurity
arXiv:2606.28450v1 Announce Type: cross Abstract: Large language model (LLM) agents are rapidly being integrated into real-world systems. Their autonomy and too
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Event-Conditioned Diagnostics of Kinematic, Contact, and Object-Permanence Fields in Passive Object-State World Models
arXiv:2606.28455v1 Announce Type: cross Abstract: World models can predict future physical states, but prediction accuracy alone does not explain how physical i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Is Lying an Emergent Behaviour in LLMs? Evidence from Gaslighting AI agents in a Sustainability Game
arXiv:2606.28456v1 Announce Type: cross Abstract: LLMs agents are increasingly used in multi-agent settings, yet their behaviour in sustainability games remains
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
An Agentic AI Pipeline for Appliance-Level Energy Anomaly Detection and LLM-Driven Recommendations
arXiv:2606.28467v1 Announce Type: cross Abstract: Appliance-level energy monitoring in office buildings produces noisy alerts that non-expert facility managers
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Decomposing Memorization Reduction in Privacy-Preserving Fine-Tuning of SLMs for CSIRTs
arXiv:2606.28479v1 Announce Type: cross Abstract: CSIRTs increasingly fine tune language models on vulnerability scan records, but these records expose internal
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
HDDPM: Heteroscedastic Denoising Diffusion Probabilistic Model for Quantitative Low-Count Brain PET Recovery
arXiv:2606.28513v1 Announce Type: cross Abstract: Positron emission tomography (PET) seeks to balance diagnostic quality with ra-diation dose. Low-count PET noi
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
A Gravitational Interpretation of Fine-Tuning Reversion
arXiv:2606.28525v1 Announce Type: cross Abstract: Fine-tuning on harmless data can partially undo behaviors acquired earlier in training. Safety can erode under
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
The Speedup Paradox: Rethinking Inference Speed-Quality Trade-off in Embodied Tasks
arXiv:2606.28529v2 Announce Type: cross Abstract: Embodied foundation models have recently been widely used to improve robot generalization and task success rat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
KernelSight-LM: A Kernel-Level LLM Inference Simulator
arXiv:2606.28565v1 Announce Type: cross Abstract: As large language models (LLMs) move into production serving, practitioners must rapidly evaluate inference pe
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Geometric Measurements of the Axiom of Choice in Neural Proof Embeddings
arXiv:2606.28572v1 Announce Type: cross Abstract: The axiom of choice has divided the foundations of mathematics for over a century, but the distinction between
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Correct codes for the wrong reasons? validating LLMs as measurement instruments for theoretical constructs
arXiv:2606.28574v1 Announce Type: cross Abstract: When a large language model (LLM) codes a construct in text as a human annotator would, that agreement makes t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Animation2Code: Evaluating Temporal Visual Reasoning in Video-to-Code Generation
arXiv:2606.28593v1 Announce Type: cross Abstract: While recent vision-language models (VLMs) have achieved significant improvements on static visual-to-code tas
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Neuromorphic Energy-Aware Learning for Adaptive Deep Brain Stimulation
arXiv:2606.28600v1 Announce Type: cross Abstract: Neuromorphic and edge computing research has focused on reducing the inference cost of neural network controll
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Database Context Compression for Text-to-SQL on Real-World Large Databases
arXiv:2606.28601v1 Announce Type: cross Abstract: Recent progress in Text-to-SQL has been driven by stronger language models and prompting strategies, yet perfo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
What LLMs explain is not what they believe: Evaluating explanation sufficiency under models' own input beliefs
arXiv:2606.28615v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed in high-stakes domains, where free-text explanations su
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Constrained Tabular Diffusion for Finance
arXiv:2606.28674v1 Announce Type: cross Abstract: Generative models in finance face the dual challenge of producing realistic data while satisfying strict regul
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Capability Gates Are Not Authorization: Confused-Deputy Failures in LLM Agent Frameworks
arXiv:2606.28679v1 Announce Type: cross Abstract: Tool-using LLM agents increasingly read untrusted content while holding side-effecting tools such as payments,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages
arXiv:2606.28715v1 Announce Type: cross Abstract: While AI development and evaluation for Southeast Asia (SEA) has grown rapidly, agent capabilities in regional
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
5ting at SemEval-2026 Task 8: Strong End-to-End Multi-Turn RAG via LLM-Based Reranking and Faithfulness Control
arXiv:2606.28737v1 Announce Type: cross Abstract: We introduce 5ting, our system for the SemEval2026 Task 8 (MTRAGEval), which evaluates multi-turn Retrieval Au
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Four Types of LLM Reliance and Their Predictors Among Undergraduate Writers: A Mixed-Methods Study at a Minority-Serving R1 University
arXiv:2606.28749v1 Announce Type: cross Abstract: Although most undergraduates now use large language models (LLMs), a form of generative artificial intelligenc
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Majority Vote Silences Minority Values: Annotator Disagreement at the Hate/Offensive Boundary in HateXplain
arXiv:2606.28772v1 Announce Type: cross Abstract: Hate speech annotation pipelines routinely collapse annotator disagreement into majority vote labels before tr
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Categorizing Mathematical Concepts with LLM Voting Ensembles in Mathswitch
arXiv:2606.28815v1 Announce Type: cross Abstract: Mathswitch is an open-source project that imports mathematical concept records from sources such as Wikidata,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
HARD-KV: Head-Adaptive Regularization for Decoding-time KV Compression
arXiv:2606.28831v1 Announce Type: cross Abstract: Long-context LLM inference faces a fundamental conflict: head-adaptive compression algorithms (e.g., Top-$p$ n
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Fisher-Routed Mixture of Experts for Federated Class-Incremental Learning
arXiv:2606.28835v1 Announce Type: cross Abstract: Federated Learning (FL) emerged as a promising distributed machine learning paradigm. However, extending FL to
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
LAMP: Lean-based Agentic framework with MCP and Proof Repair
arXiv:2606.28841v1 Announce Type: cross Abstract: Large language models are increasingly capable of mathematical reasoning, but the proofs they generate are oft
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
The Heterogeneous Safety Impacts of Benign Multilingual Fine-Tuning
arXiv:2606.28843v1 Announce Type: cross Abstract: Fine-tuning a large language model is a ubiquitous method for enhancing its capability on a specific downstrea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5d ago
Exploring the Value of Diverse LLM Explanations in Introductory Programming
arXiv:2606.28882v1 Announce Type: cross Abstract: Large Language Models (LLMs) have shown the potential to generate code explanations that surpass those of peer