Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Microsoft Research
🧠 Large Language Models
⚡ AI Lesson
1d ago
Memora: A Harmonic Memory Representation Balancing Abstraction and Specificity
AI agents can't remember past conversations. They must constantly reload or retrieve context, which grows less efficient as tasks get longer and more complex. M
![I made a quiz that tells you which LLM you align with most, based on personality and values research across 15 models [R]](https://preview.redd.it/yx86ia6rr6ah1.png?width=140&height=80&auto=webp&s=50a7c238e71f794f9908533538785f72e88913a9)
Reddit r/MachineLearning
🧠 Large Language Models
⚡ AI Lesson
1d ago
I made a quiz that tells you which LLM you align with most, based on personality and values research across 15 models [R]
<img src="https://preview.redd.it/yx86ia6rr6ah1.png?width=140&height=80&auto=webp&s=50a7c238e71f794f9908533538785f72e88913a9" alt="I made a quiz tha
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
AI-Model Network: Concept, Current State and Future
arXiv:2606.27382v1 Announce Type: new Abstract: While the primary function of computers lies in computation and processing, the core value of the Internet is ro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
Odyssey: Constructing Verifiable Local Truth-Preserving Foundation Models
arXiv:2606.27593v1 Announce Type: new Abstract: We introduce a categorical framework called ODYSSEY for constructing verifiable, local truth-preserving foundati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
DysLexLens: A Low-Resource LLM Framework for Analysing Dyslexic Learners Insights from Online Forums
arXiv:2606.27619v1 Announce Type: new Abstract: Dyslexic learners increasingly use artificial intelligence (AI) tools to support reading, writing, organisation,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
MER-R1: Multimodal Emotion Reasoning via Slow-Fast Thinking Synergy
arXiv:2606.27652v1 Announce Type: new Abstract: We find that explicit reasoning does not necessarily translate into better multimodal emotion recognition (MER)
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
ToE: A Hierarchical and Explainable Claim Verification Framework with Dynamic Multi-source Evidence Retrieval and Aggregation
arXiv:2606.27736v1 Announce Type: new Abstract: The rapid spread of fake news poses increasing threats to information ecosystems, especially as AI-generated mis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
Grounded Iterative Language Planning: How Parameterized World Models Reduce Hallucination Propagation in LLM Agents
arXiv:2606.27806v1 Announce Type: new Abstract: World models for language agents come in two useful forms. An agent-based world model calls an LLM API and reaso
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
NormAct: A Benchmark for Hidden Social Norm Compliance in Embodied Planning
arXiv:2606.27826v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) are increasingly deployed as embodied planners in egocentric environmen
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1d ago
RelBall: Relation Ball with Quaternion Rotation for Knowledge Graph Completion
arXiv:2606.27967v1 Announce Type: new Abstract: Real-world knowledge graphs are often incomplete, lacking many valid facts. Knowledge Graph Completion (KGC) aim

Interconnects
🧠 Large Language Models
⚡ AI Lesson
2d ago
Latest open artifacts (#22): Zyphra, Cohere, and Poolside are expanding the breadth of the ecosystem
An assessment of the open ecosystem and the motivations behind releasing models
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Refusal Lives Downstream of Persona in Chat Models
arXiv:2606.26161v1 Announce Type: new Abstract: Linear directions in activation space have been identified for both refusal and persona traits in instruction-tu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
AlgoEvolve: LLM-driven Meta-evolution of Algorithmic Trading Programs
arXiv:2606.26173v1 Announce Type: new Abstract: Recent work shows that Large Language Models (LLMs) can act as semantic mutation operators for the evolutionary
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Agentic Analysis for Agentic Infrastructure: An LLM-Powered Pipeline for Comparative Governance of DAO and Corporate AI Protocols
arXiv:2606.26203v1 Announce Type: new Abstract: As AI agent protocols proliferate, the governance structures shaping their interoperability standards remain emp
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami
arXiv:2606.26299v1 Announce Type: new Abstract: While generative AI has achieved remarkable success in solving problems with verifiable solutions, generating ph
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
How Do Tool-Augmented LLM Agents Perform on Real-World Energy Analytics Tasks?
arXiv:2606.26346v1 Announce Type: new Abstract: Agentic benchmarks have emerged across general-purpose and domain-specific settings, including finance, coding,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
What We are Missing in Multimodal LLM Evaluation?
arXiv:2606.26348v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) can process diverse inputs, e.g., text, images, audio, and video, and g
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
4d ago
Accelerating Returns and the Qualitative Engine for Science
arXiv:2606.26359v1 Announce Type: new Abstract: Ray Kurzweil described a thesis of accelerating returns, which is the most influential narratives in discussions
DeepMind Blog
🧠 Large Language Models
⚡ AI Lesson
6d ago
Introducing computer use in Gemini 3.5 Flash
MarkTechPost
🧠 Large Language Models
⚡ AI Lesson
1w ago
Mistral OCR 4 Brings Citation-Ready Structured Output to RAG, Agentic, and Enterprise Search Pipelines
Mistral AI released OCR 4 on June 23, 2026, moving from clean text extraction to structured document output. Each block returns a bounding box, a typed classifi
MarkTechPost
🧠 Large Language Models
⚡ AI Lesson
1w ago
Sakana AI Launches Sakana Fugu: An Orchestration Model That Routes Tasks Across a Swappable Pool of Frontier LLMs
Fugu and Fugu Ultra route tasks across a swappable model pool, leading most coding, reasoning, and agentic benchmarks. The post Sakana AI Launches Sakana Fugu:
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
LLM Doesn't Know What It Doesn't Know: Detecting Epistemic Blind Spots via Cross-Model Attribution Divergence on Clinical Tabular Data
arXiv:2606.19509v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly applied to structured clinical data, yet whether they can recogniz
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
REVEAL++: Differentiable Phenotypic Grouping for Vision-Language Retinal Modeling of Alzheimer's Disease Risk
arXiv:2606.19522v1 Announce Type: new Abstract: The retina offers a noninvasive window into neurodegenerative disease, capturing subtle structural patterns asso
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Emergent Alignment
arXiv:2606.19527v1 Announce Type: new Abstract: Can Large Language Models (LLMs) discern when their own outputs are misaligned with human ethics? And can they s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ITNet: A Learnable Integral Transform That Subsumes Convolution, Attention, and Recurrence
arXiv:2606.19538v1 Announce Type: new Abstract: Convolutional networks, recurrent networks, and transformers each encode different inductive biases -- locality,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Uncertainty Decomposition for Clarification Seeking in LLM Agents
arXiv:2606.19559v1 Announce Type: new Abstract: Recent position papers argue that the classical aleatoric/epistemic uncertainty framework is insufficient for in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Analyzing the Narration Gap in LLM-Solver Loops
arXiv:2606.19588v1 Announce Type: new Abstract: Formal tools such as SAT and SMT solvers are increasingly embedded in language model reasoning pipelines when a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why
arXiv:2606.19602v1 Announce Type: new Abstract: Patient contexts span hundreds of heterogeneous documents and thousands of structured data points, yet the docum
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Which Pairs to Compare for LLM Post-Training?
arXiv:2606.19607v1 Announce Type: new Abstract: Preference-based post-training has become a central paradigm for aligning language models. A common data-collect
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Toten: Knowledge-Based Ontological Tokenization Of Physical Quantities And Technical Notation In Brazilian Portuguese
arXiv:2606.19626v1 Announce Type: new Abstract: Byte-Pair Encoding tokenization is statistically efficient for vocabulary compression, but semantically blind to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation
arXiv:2606.19651v1 Announce Type: new Abstract: Three-dimensional (3D) brain MRI is central to clinical neurology and neuro-oncology, where generative models co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents
arXiv:2606.19704v1 Announce Type: new Abstract: Agent benchmarks are growing fast, but no single benchmark touches more than four or five of the dimensions that
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
GLARE: A Natural Language Interface for Querying Global Explanations
arXiv:2606.19735v1 Announce Type: new Abstract: While global explanations are crucial for understanding vision models across datasets, classes, and decision con
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Grounded Inference: Principles for Deterministically Encapsulated Generative Models
arXiv:2606.19753v1 Announce Type: new Abstract: The incorporation of generative models into traditional computational systems presents both enormous opportunity
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Beyond Entropy: Learning from Token-Level Distributional Deviations for LLM Reasoning
arXiv:2606.19771v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has significantly advanced Large Language Model (LLM) reas
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ORAgentBench: Can LLM Agents Solve Challenging Operations Research Tasks End to End?
arXiv:2606.19787v1 Announce Type: new Abstract: Large language models are increasingly deployed as autonomous agents for multi-step tasks in executable environm
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
CombEval: A Framework for Evaluating Combinatorial Counting in Large Language Models
arXiv:2606.19788v1 Announce Type: new Abstract: We present CombEval, a dynamic benchmark for evaluating combinatorial counting in large language models. CombEva
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Human-on-the-Loop Orchestration for AI-Assisted Legal Discovery
arXiv:2606.19812v1 Announce Type: new Abstract: Autonomous Large Language Model (LLM) agents are increasingly deployed in electronic discovery (e-discovery), wh
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
A Systematic Evaluation of Black-Box Uncertainty Estimation Methods for Large Language Models
arXiv:2606.19868v1 Announce Type: new Abstract: Although large language models (LLMs) have shown strong capabilities across a wide range of tasks, their outputs
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
MetaResearcher: Scaling Deep Research via Self-Reflective Reinforcement Learning in Adversarial Virtual Environments
arXiv:2606.19893v1 Announce Type: new Abstract: Deep research agents have demonstrated remarkable capabilities in autonomous information gathering and synthesis
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Multi-Agent Transactive Memory
arXiv:2606.19911v1 Announce Type: new Abstract: The decentralized deployment of LLM agents with diverse capabilities across diverse tasks motivates infrastructu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Advancing DialNav through Automatic Embodied Dialog Augmentation
arXiv:2606.19948v1 Announce Type: new Abstract: For embodied agents capable of physical interaction, the capability to create and understand dialog is crucial t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Process-Verified Reinforcement Learning for Theorem Proving via Lean
arXiv:2606.20068v1 Announce Type: new Abstract: While reinforcement learning from verifiable rewards (RLVR) typically has relied on a single binary verification
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Residual-Space Evolutionary Optimization via Flow-based Generative Models
arXiv:2606.20084v1 Announce Type: new Abstract: Data editing with generative methods typically requires differentiable objectives and gradient-based search. How
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
ScaffoldAgent: Utility-Guided Dynamic Outline Optimization for Open-Ended Deep Research
arXiv:2606.20122v1 Announce Type: new Abstract: Open-ended deep research (OEDR) requires systems to acquire knowledge through multi-round retrieval and generate
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
arXiv:2606.20205v1 Announce Type: new Abstract: Psychological instruments designed for humans are increasingly used to assign large language models (LLMs) stabl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
QMFOL: Benchmarking Large Language Model Reasoning via Quantifiable Monadic First-Order Logic Test Case Generation
arXiv:2606.20227v1 Announce Type: new Abstract: Large Language Models (LLMs) have made significant progress in reasoning, particularly in deductive reasoning, w
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1w ago
Thermodynamic Measure of Intelligence
arXiv:2606.20231v1 Announce Type: new Abstract: Can intelligence be measured? We propose that intelligence can be defined as the lawful amplification of rare bu
DeepCamp AI