Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

52,975

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 21,425 Reads 31,550

All Reads (31,550) Articles (13448)Blog Posts (5947)Tutorials (2551)Research Papers (8683)News (921)

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

One Framework for All: Cross-Modal Membership Inference for Generative Models

arXiv:2607.04339v1 Announce Type: cross Abstract: Large generative models across text-to-text, text-to-image, and image-to-text modalities have been shown to po

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

IRIS: An Intelligent Vision-Language System for Ocular Surface Diseases via Topic Tree and Scene-Driven VQA Generation

arXiv:2607.04344v1 Announce Type: cross Abstract: While Large Vision-Language Models (VLMs) demonstrate remarkable generic capabilities, their clinical reasonin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Auto-AEG: Scalable Data Construction for Open-Vocabulary Audio Event Grounding

arXiv:2607.04383v1 Announce Type: cross Abstract: Large Audio-Language Models (LALMs) reason fluently about sound yet struggle to localize precisely when events

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Full-Stack FP4: Stable LLM Pretraining with Quantized Projections, Optimizers, and Attention

arXiv:2607.04422v1 Announce Type: cross Abstract: Recent NVFP4 pretraining methods mainly target transformer linear layers, leaving optimizer states, optimizer

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Transferability Between Understanding and Generation in Unified Multimodal Models

arXiv:2607.04423v1 Announce Type: cross Abstract: Unified Multimodal Models (UMMs) integrate image understanding and generation within a single architecture, ye

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

dOPSD: On-Policy Self-Distillation for Diffusion Language Models

arXiv:2607.04428v1 Announce Type: cross Abstract: Diffusion large language models (dLLMs) generate text by iteratively denoising a masked sequence, offering a p

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

evalci: A Python Library for Statistically Rigorous Comparison of Language Model Evaluations

arXiv:2607.04429v1 Announce Type: cross Abstract: The dominant practice in language model evaluation is to report a single accuracy number per model and declare

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Covert Trait Propagation Is Representation Alignment: Mechanistic Evidence from Hidden-Channel Distillation

arXiv:2607.04432v1 Announce Type: cross Abstract: A student model trained on pure uniform noise can still inherit its teacher's digit-classification ability, pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

A Retrieval-Augmented Framework for Detecting and Resolving Pragmatic Ambiguities in Natural Language Requirements

arXiv:2607.04436v1 Announce Type: cross Abstract: Natural language requirements (NLRs) are essential for bridging communication gaps among diverse stakeholders

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Regime-Conditional Stabilisation of LLM-Augmented Cooperative Multi-Agent Reinforcement Learning

arXiv:2607.04470v1 Announce Type: cross Abstract: Large Language Models (LLMs) offer a natural interface for translating human objectives into reward signals fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

PulmoSight-XAI: An Explainable Multi-View Attention Ensemble with Gradient Boosting Meta-Learning for Multi-Label Chest X-Ray Classification

arXiv:2607.04478v1 Announce Type: cross Abstract: Automated chest X-ray classification remains challenging due to severe class imbalance, co-occurring pathologi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Transplanting, inverting, and preventing a misalignment persona: method-conditional emergent misalignment in Qwen2.5

arXiv:2607.04510v1 Announce Type: cross Abstract: Emergent misalignment (EM) -- the broad misbehaviour a language model acquires after fine-tuning on narrow har

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Failures and Successes to Learn a Core Conceptual Distinction from the Statistics of Language

arXiv:2607.04523v1 Announce Type: cross Abstract: Generic statements like "tigers are striped" and "cars have radios" communicate information that is, in genera

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Language Models Represent and Transform Concepts with Shared Geometry

arXiv:2607.04525v1 Announce Type: cross Abstract: How concepts are represented in neural networks is a fundamental question in machine learning. The dominant vi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Obey, Diverge, Collapse: Blind Obedience to Incorrect Instructions Drives Code LLMs to Irrecoverable Code Semantic Collapse

arXiv:2607.04537v1 Announce Type: cross Abstract: Code language models are now trusted collaborators in production workflows for debugging, refactoring, and ite

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Auto: The AGI Compiler

arXiv:2607.04542v1 Announce Type: cross Abstract: Every LLM agent run re-derives its behavior token by token on a frontier model: brilliant, expensive, slow, an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Predicting Therapeutic Outcome via Aligning Patient-Specific Knowledge Graph and Gene-Level Perturbation Representations

arXiv:2607.04557v1 Announce Type: cross Abstract: Accurate prediction of patient-specific therapeutic response from pre-treatment transcriptomes is hindered by

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

A Few Teacher Steps Go a Long Way: Cost-Efficient On-Policy Data Augmentation for Agent Post-Training

arXiv:2607.04574v1 Announce Type: cross Abstract: For LLM agents, supervised fine-tuning is not only about teacher labels' quality, but also about which interac

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

LLM-Driven CI-CD Workflow Intelligence for Cyber Systems Engineering

arXiv:2607.04579v1 Announce Type: cross Abstract: CI/CD workflows have become executable operational policy: they decide what gets built, tested, released, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Simple-to-Complex Structured Demonstrations for Vision-Language-Action Learning

arXiv:2607.04591v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models have demonstrated strong capabilities in robotic manipulation by integrati

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

TORINO: Token Reduction via Interpretable Concept Overlap in Vision-Language Models

arXiv:2607.04593v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have demonstrated impressive capabilities across different tasks, but their comp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Hierarchical Evidence-Driven Reasoning for Long Document Understanding

arXiv:2607.04625v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) streamlines long-document understanding by leveraging retrieval mechanism

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Elastic Gang: Per-Token Membership Change for a Hard-Barriered LLM Inference Gang Co-Scheduled with OS Processes

arXiv:2607.04668v1 Announce Type: cross Abstract: On-device LLM decoding is a hard-barriered CPU-SIMD computation that wants every core for milliseconds per tok

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Do Vision-Language-Action Models Mean What They Say? On the Role of Faithfulness in Embodied Reasoning

arXiv:2607.04681v1 Announce Type: cross Abstract: Embodied Chain-of-Thought has emerged as a promising mechanism to enhance robot decision-making and interpreta

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

ToolFailBench: Diagnosing Tool-Use Failures in LLM Agents

arXiv:2607.04686v1 Announce Type: cross Abstract: Tool calling is central to modern language model agents, but aggregate benchmark scores often hide where tool

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

URSA: Chemistry-Aware Benchmark for Utilitarian Retrosynthesis Assessment

arXiv:2607.04688v1 Announce Type: cross Abstract: Synthesis planning aiming to find pathways of reactions for a target molecule is one of the most important and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

RSPO: Reward-Swap Policy Optimization for Multi-Turn LLM Agents

arXiv:2607.04713v1 Announce Type: cross Abstract: Reinforcement learning holds significant potential for training large language models (LLMs) to handle multi-t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Dashboard2Code: Evaluating Multimodal Models on Reconstructing Interactive Dashboards

arXiv:2607.04727v1 Announce Type: cross Abstract: Automatic data visualization generation has advanced rapidly with multi-modal large language models, yet exist

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Turning Off-Policy Tokens On-Policy: A Plug-in Approach for Improving LLM Alignment

arXiv:2607.04728v1 Announce Type: cross Abstract: Reinforcement learning (RL) post-training for large language models (LLMs) follows a efficient paradigm of "ro

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Evaluating the Effect of Linguistic Relatedness on Cross-Lingual Transfer in Large Multilingual Automatic Speech Recognition

arXiv:2607.04814v1 Announce Type: cross Abstract: Extending automatic speech recognition (ASR) to low-resource African languages is constrained by the prohibiti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Pretraining Curricula Enable Selective Fine-tuning

arXiv:2607.04846v1 Announce Type: cross Abstract: Transformers follow implicit curricula whereby some tasks are learned before others. However, how explicit pre

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Input Pathways Shape Few-Shot, Not Zero-Shot, Binding in Tiny Transformers: A Fully-Enumerable Study

arXiv:2607.04926v1 Announce Type: cross Abstract: How does the way information reaches a transformer -- as symbolic tokens, a clean per-factor "oracle" code, or

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

LLM for the development of FCM

arXiv:2607.04983v1 Announce Type: cross Abstract: This article is about the development of a fuzzy cognitive map using a local large language model. In the ligh

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

ImputeECG: Deep Learning Reconstruction of Complete 12-Lead Electrocardiograms from Incomplete Recordings for Cardiac Assessment

arXiv:2607.05009v1 Announce Type: cross Abstract: Complete digital 12-lead electrocardiograms (ECGs) are essential for AI-enabled cardiovascular assessment, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Your Agent's Memories Are Not Its Own: Forged Reasoning Attacks on LLM Agent Memory and Defenses

arXiv:2607.05029v1 Announce Type: cross Abstract: Persistent memory has enabled large language model (LLM) agents to store factual knowledge, prior decisions, r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

LLM-Based Test Oracles: Source-of-Authority Taxonomy -- A Systematic Literature Review

arXiv:2607.05031v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used to produce test oracles, the part of a test that decides wh

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Beyond Independent Labels: Schwartz-Geometry Decoding for Human Value Detection

arXiv:2607.05052v1 Announce Type: cross Abstract: Human value detection is commonly formulated as sentence-level multi-label classification over the 19 refined

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Grokking Is Conditional and Fragile: A Fully-Tractable, Multi-Seed Study at 12K Parameters

arXiv:2607.05104v1 Announce Type: cross Abstract: Grokking -- the delayed onset of generalization long after a network has fit its training set - -is usually st

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Localized LoRA-MoE: Block-wise Low-Rank Experts With Adaptive Routing

arXiv:2607.05114v1 Announce Type: cross Abstract: Large Language Models (LLMs) and high-dimensional perception networks increasingly rely on parameter-efficient

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Unified Audio Intelligence Without Regressing on Text Intelligence

arXiv:2607.05196v1 Announce Type: cross Abstract: Audio intelligence involves understanding, reasoning about, and generating both audio and speech. In this work

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Noisy-Channel Minimum Bayes Risk Decoding

arXiv:2607.05198v1 Announce Type: cross Abstract: Minimum Bayes Risk (MBR) decoding yields more robust and higher-quality text generation than maximum a posteri

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

ProPS: Prompted Profile Synthesis for Natural Language-Conditioned Speaker Embedding Distributions

arXiv:2607.05276v1 Announce Type: cross Abstract: Speaker embeddings, or x-vectors, are widely used to represent speaker identity and speaker-related attributes

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

TREK: Distill to Explore, Reinforce to Refine

arXiv:2607.05339v1 Announce Type: cross Abstract: Group Relative Policy Optimization (GRPO) is effective when the current policy already samples useful reasonin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Selective Disclosure Watermarking for Large Language Models

arXiv:2607.05353v1 Announce Type: cross Abstract: Watermarking methods embed imperceptible and verifiable signals into text generated by large language models (

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

SPEARBench: A Benchmark for Naturalness Evaluation in Streaming Speech-to-Speech Language Models

arXiv:2607.05365v1 Announce Type: cross Abstract: Streaming speech-to-speech language models aim to answer spoken queries directly with synthetic speech. Howeve

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario

arXiv:2506.16898v2 Announce Type: replace Abstract: Diffusion-based text-to-image models are increasingly used for urban analysis and scenario generation, but t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

A Technical Survey of Reinforcement Learning Techniques for Large Language Models

arXiv:2507.04136v2 Announce Type: replace Abstract: This survey offers a comprehensive foundation on the integration of RL with language models, highlighting pr

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3d ago

Activation-Deactivation: A General Framework for Robust Post-hoc Explainable AI

arXiv:2510.01038v2 Announce Type: replace Abstract: Perturbation-based explainability methods face criticism due to their reliance on out-of-distribution mutant