Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

50,984
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
All Reads (29,520) Articles (12561)Blog Posts (5574)Tutorials (2291)Research Papers (8224)News (870)
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Learning Where to Look: A Reinforcement Learning Framework for Robust Micro-Ultrasound Prostate Cancer Detection
arXiv:2606.30951v1 Announce Type: cross Abstract: Micro-ultrasound ($\mu$US) is a new, emerging, and promising imaging modality for prostate cancer (PCa) detect
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Loc2Repair: A Framework for Evaluating the Impact of File-Level Issue Localization in Repo-Level LLM Repair
arXiv:2606.30963v1 Announce Type: cross Abstract: Repository-grounded automated repair is often reported as a single end-to-end capability, which hides distinct
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Wait, am I Being Fair? Characterizing Deductive Stereotyping and Mitigating It with Fair-GCG
arXiv:2606.30989v1 Announce Type: cross Abstract: Warning: This paper contains several toxic and offensive statements. While reasoning generally improves fairne
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
OTCache: Optimal Transport for Geometry-Aware Caching in Diffusion Models
arXiv:2606.31026v1 Announce Type: cross Abstract: We propose OTCache, a training-free framework for accelerating diffusion sampling via caching schedule predict
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
LLM-Driven Personalities for Decision Making in Emergency Simulations
arXiv:2606.31038v1 Announce Type: cross Abstract: For virtual humans to appear believable, they must exhibit agency and spatial awareness while interacting with
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Knowledge Distillation from Large Reasoning Models to Compact Student Models: A Case Study on the John O Bryan Mathematics Competition
arXiv:2606.31048v1 Announce Type: cross Abstract: This paper investigates knowledge distillation from a large reasoning model (DeepSeek-R1) to a compact student
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMs
arXiv:2606.31054v1 Announce Type: cross Abstract: Multimodal Large Language Models (MLLMs) are critically hampered by hallucination, generating content inconsis
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
When Reranking Hurts: Uncertainty-Based Gating for Few-Shot Reranking
arXiv:2606.31087v1 Announce Type: cross Abstract: Few-shot selection typically assumes that reranking retrieved examples always improves performance. We challen
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
LLM-Powered Interactive Robotic Action Synthesis from Multimodal Speech, Gestures, and Music
arXiv:2606.31158v1 Announce Type: cross Abstract: The quest for intuitive and natural human-robot interaction (HRI) remains a significant challenge in robotics.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
ComplianceGate: Classifier-Gated Multi-Tier LLM Routing for Inference in Regulated Industries
arXiv:2606.31163v1 Announce Type: cross Abstract: Large language models deployed in regulated industries operate under two constraints: compliance enforcement a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Transformers as Bayesian In-Context Experimenters: Smoothness-Adaptive Efficient ATE Estimation
arXiv:2606.31184v1 Announce Type: cross Abstract: Adaptive experiments for average treatment effects (ATE) require randomized allocations balancing valid infere
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Gated Multi-Graph Fusion via Graph Attention Networks for Alzheimer's Disease Detection
arXiv:2606.31186v1 Announce Type: cross Abstract: Spontaneous speech is a vital non-invasive biomarker for Alzheimer's Disease (AD), yet many systems overlook n
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Can LLMs Imagine Moral Alternatives Beyond Binary Dilemmas?
arXiv:2606.31213v1 Announce Type: cross Abstract: As large language models (LLMs) are increasingly deployed as moral advisors and agents, they need to address d
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Probing Stylistic Appropriation using Large Language Models: An Evaluation Framework for Copyright Infringement under EU Law
arXiv:2606.31250v1 Announce Type: cross Abstract: Large language models (LLM) trained on web-scale corpora generate output that may infringe copyright, yet exis
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Learning from Failure: Inference-Time Self-Improvement for Computer-Use Agents
arXiv:2606.31270v1 Announce Type: cross Abstract: Computer-use agents, which leverage multimodal large language models (MLLMs) to operate computers and complete
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
CLIMB: Centroid-Based Hierarchical Memory for Online Continual Self-Supervised Learning
arXiv:2606.31275v1 Announce Type: cross Abstract: Online Continual Self-Supervised Learning (OCSSL) aims to learn representations from a continuous stream of un
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Minimizing Quantized Semantic Age of Information (QSAoI) in Foundation Model-Based Semantic Communications
arXiv:2606.31303v1 Announce Type: cross Abstract: The emerging techniques of semantic communications and edge computing in 6G networks necessitate a paradigm sh
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
CSO-LLM: Class Subspace Orthogonalization for Post-Training Backdoor Detection and Trigger Inversion in LLMs
arXiv:2606.31309v1 Announce Type: cross Abstract: While post-training backdoor detection and trigger inversion schemes have been developed for AIs used e.g. for
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Calibrating the Evaluator: Does Probability Calibration Mitigate Preference Coupling in LLM Agent Feedback Loops?
arXiv:2606.31371v1 Announce Type: cross Abstract: When large language model (LLM) agents adapt their behavior through evaluator feedback, systematic evaluator b
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Stage-Transition Dense Reward Modeling for Reinforcement Learning
arXiv:2606.31377v1 Announce Type: cross Abstract: Reinforcement learning for long-horizon robotic manipulation is often limited by sparse and delayed rewards, w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Mixture-of-Control: State-Aware Fine-Tuning for Transformer-based Models
arXiv:2606.31397v1 Announce Type: cross Abstract: State-based fine-tuning has emerged as a compelling alternative to weight-based adaptation for transformers, u
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Visual Semantic Entropy: Do Vision Language Models Recognize Visual Ambiguity?
arXiv:2606.31407v1 Announce Type: cross Abstract: Vision-language models can produce confident answers on visually ambiguous inputs, resulting in biased predict
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
DA-Studio: An Agentic System for End-to-End Data Analysis
arXiv:2606.31423v1 Announce Type: cross Abstract: Real-world data analysis is a multi-step process over heterogeneous inputs rather than merely producing a fina
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
UniTac: A Unified Multimodal Model for Cross-Sensor Tactile Understanding and Generation
arXiv:2606.31451v1 Announce Type: cross Abstract: Unified multimodal models (UMMs) have shown great promise in integrating understanding and generation across d
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Team MKC at CLPsych 2026: Capturing and Characterizing Mental Health Changes through Social Media Timeline Dynamics
arXiv:2606.31464v1 Announce Type: cross Abstract: Recent advances in Large Language Models (LLMs) have motivated their adoption across a wide range of domains,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
FinPersona-Bench: A Benchmark for Longitudinal Psychometric Stability of Autonomous Financial Agents
arXiv:2606.31522v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed as autonomous financial agents initialized with explici
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
On the Convergence of Self-Improving Online LLM Alignment
arXiv:2606.31524v1 Announce Type: cross Abstract: The Self-Improving Alignment (SAIL) algorithm addresses distribution shift by reducing a bilevel formulation o
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Mitigating Positional Leakage in 3D Masked Autoencoders for Robust Representation Learning
arXiv:2606.31570v1 Announce Type: cross Abstract: Masked autoencoding has emerged as a prominent paradigm for self-supervised learning on 3D point clouds, achie
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
ZEBRA: Zero-Shot Entropy-Regularized Prompt Learning for Base-to-Novel Generalization in Audio-Language Models
arXiv:2606.31587v1 Announce Type: cross Abstract: Audio-Language Models (ALMs) achieve strong zero-shot performance by aligning audio with textual class descrip
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Evil Spectra: How Optimisers can Amplify or Suppress Emergent Misalignment
arXiv:2606.31591v1 Announce Type: cross Abstract: Emergent misalignment (EM) is a recently discovered phenomenon in LLMs where fine-tuning on a narrow misaligne
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Token-Sparse Medical Multimodal Reasoning via Dual-Stream Reinforcement Learning
arXiv:2606.31599v1 Announce Type: cross Abstract: Vision-language models (VLMs) combining reinforcement learning (RL) ignite remarkable progress in multimodal r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Automating Cause-Effect Specification with Knowledge Graphs and Large Language Models
arXiv:2606.31614v1 Announce Type: cross Abstract: Engineering specifications such as interlocks, alarm rationalization tables, and cause-and-effect (C&E) matric
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
A Tutorial on Autonomous Fault-Tolerant Control Using Knowledge-Grounded LLM Agents
arXiv:2606.31635v1 Announce Type: cross Abstract: Fault recovery in process plants still relies heavily on plant operators, especially when faults fall outside
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
A Lifecycle and Application-Stack Survey of Large Language Model Vulnerabilities: Attacks, Risks, Defenses, and Open Problems
arXiv:2606.31639v1 Announce Type: cross Abstract: Large language models are no longer only text generators. They are increasingly embedded in retrieval pipeline
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
ShopX: A Foundation Model for Intent-to-Item Fulfillment in Agentic Shopping
arXiv:2606.31693v1 Announce Type: cross Abstract: The wave of AI-native applications is moving shopping beyond page- and feed-based browsing toward intent-drive
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Look But Don't Touch with Sparse Autoencoders for Unlearning in Diffusion Models
arXiv:2606.31699v1 Announce Type: cross Abstract: Sparse autoencoders (SAEs) have recently been proposed as interpretable tools for concept-level manipulation,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Cross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian
arXiv:2606.31718v1 Announce Type: cross Abstract: Relation extraction (RE) for low-resource languages is typically constrained by the lack of annotated corpora.
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Seeing Is Not Sharing: Some Vision-Language Models Overestimate Common Ground in Asymmetric Dialogue
arXiv:2606.31719v1 Announce Type: cross Abstract: In collaborative dialogue, shared perception does not guarantee shared interpretation. Mutual understanding mu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
STEB: Style Text Embedding Benchmark
arXiv:2606.31741v1 Announce Type: cross Abstract: While semantic embeddings are rigorously evaluated on the Massive Text Embedding Benchmark, the evaluation of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
FedXDS: Leveraging Model Attribution Methods to counteract Data Heterogeneity in Federated Learning
arXiv:2606.31742v1 Announce Type: cross Abstract: Explainable AI (XAI) methods have demonstrated significant success in recent years at identifying relevant fea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
CHERRY: Compressed Hierarchical Experts with Recurrent Representational Yield
arXiv:2606.31796v1 Announce Type: cross Abstract: We study three complementary techniques for training compute-efficient language models. (1) Selective supervis
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Geometry-Preserving Orthonormal Initialization for Low-Rank Adaptation in RLVR
arXiv:2606.31813v1 Announce Type: cross Abstract: Low-rank adaptation (LoRA) and its variants enable parameter-efficient fine-tuning of large language models un
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning
arXiv:2606.31825v1 Announce Type: cross Abstract: Recent multimodal large language models have shown great promise in clinical image reasoning, but existing pos
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Real-Time Source-Free Object Detection
arXiv:2606.31834v1 Announce Type: cross Abstract: Real-world detectors for autonomous driving, surveillance, and robotics must handle domain-shifts under strict
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Z-1: Efficient Reinforcement Learning for Vision-Language-Action Models
arXiv:2606.31846v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models offer a promising framework for robotic manipulation by connecting languag
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Attend, Transform, or Silence: Operator-Level Visual Skipping for Efficient Multimodal LLM Inference
arXiv:2606.31903v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) increasingly process long visual-token sequences, increasing the over
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
GR2 Technical Report
arXiv:2606.31984v1 Announce Type: cross Abstract: Industrial recommendation systems serve billions of users through a multi-stage funnel -- retrieval, early-sta
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 1d ago
Amplifying Membership Signal Through Chained Regeneration
arXiv:2606.31991v1 Announce Type: cross Abstract: The tendency of large generative models to memorize training data makes sample verification critical for priva