Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,288 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget
arXiv:2604.01195v1 Announce Type: cross Abstract: Search agents, which integrate language models (LMs) with web search, are becoming crucial for answering compl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Code Comprehension then Auditing for Unsupervised LLM Evaluation
arXiv:2410.03131v4 Announce Type: replace Abstract: Large Language Models (LLMs) for unsupervised code correctness evaluation have recently gained attention bec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG
arXiv:2501.09136v4 Announce Type: replace Abstract: Large Language Models (LLMs) have advanced artificial intelligence by enabling human-like text generation an
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment
arXiv:2503.02976v3 Announce Type: replace Abstract: Large language models (LLMs), initially developed for generative AI, are now evolving into agentic AI system
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering
arXiv:2505.12189v3 Announce Type: replace Abstract: Large language models (LLMs) exhibit reasoning biases, often conflating content plausibility with formal log
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning
arXiv:2506.13841v3 Announce Type: replace Abstract: Recent advances in large language models (LLMs), particularly those enhanced through reinforced post-trainin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
HiMA-Ecom: Enabling Joint Training of Hierarchical Multi-Agent E-commerce Assistants
arXiv:2506.19846v2 Announce Type: replace Abstract: Hierarchical multi-agent systems based on large language models (LLMs) have become a common paradigm for bui
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Auto-Formulating Dynamic Programming Problems with Large Language Models
arXiv:2507.11737v2 Announce Type: replace Abstract: Dynamic programming (DP) is a fundamental method in operations research, but formulating DP models has tradi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Retrieval-of-Thought: Efficient Reasoning via Reusing Thoughts
arXiv:2509.21743v2 Announce Type: replace Abstract: Large reasoning models improve accuracy by producing long reasoning traces, but this inflates latency and co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents
arXiv:2509.25302v2 Announce Type: replace Abstract: The prevalent deployment of Large Language Model agents such as OpenClaw unlocks potential in real-world app
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming
arXiv:2510.18314v2 Announce Type: replace Abstract: As large language model (LLM) agents increasingly automate complex web tasks, they boost productivity while
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
EHRStruct: A Comprehensive Benchmark Framework for Evaluating Large Language Models on Structured Electronic Health Record Tasks
arXiv:2511.08206v4 Announce Type: replace Abstract: Structured Electronic Health Record (EHR) data stores patient information in relational tables and plays a c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
3w ago
DR-LoRA: Dynamic Rank LoRA for Fine-Tuning Mixture-of-Experts Models
arXiv:2601.04823v4 Announce Type: replace Abstract: Mixture-of-Experts (MoE) has become a prominent paradigm for scaling Large Language Models (LLMs). Parameter
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models
arXiv:2601.05144v2 Announce Type: replace Abstract: Reasoning Large Language Models (RLLMs) excelling in complex tasks present unique challenges for digital wat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning
arXiv:2602.08734v2 Announce Type: replace Abstract: Solving partially observable Markov decision processes (POMDPs) requires computing policies under imperfect
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Meta-Learning and Meta-Reinforcement Learning -- Tracing the Path towards DeepMind's Adaptive Agent
arXiv:2602.19837v2 Announce Type: replace Abstract: Humans are highly effective at utilizing prior knowledge to adapt to novel tasks, a capability that standard
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Epistemic Filtering and Collective Hallucination: A Jury Theorem for Confidence-Calibrated Agents
arXiv:2602.22413v2 Announce Type: replace Abstract: We investigate the collective accuracy of heterogeneous agents who learn to estimate their own reliability o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
When Agents Persuade: Rhetoric Generation and Mitigation in LLMs
arXiv:2603.04636v2 Announce Type: replace Abstract: Despite their wide-ranging benefits, LLM-based agents deployed in open environments can be exploited to prod
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions
arXiv:2502.14883v3 Announce Type: replace-cross Abstract: For individuals with blindness or low vision (BLV), navigating complex environments can pose serious r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Neural Conditional Transport Maps
arXiv:2505.15808v2 Announce Type: replace-cross Abstract: We present a neural framework for learning conditional optimal transport (OT) maps between probability
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors
arXiv:2505.17760v3 Announce Type: replace-cross Abstract: LLM-as-a-judge is widely used as a scalable substitute for human evaluation, yet current approaches re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Graceful Forgetting in Generative Language Models
arXiv:2505.19715v2 Announce Type: replace-cross Abstract: Recently, the pretrain-finetune paradigm has become a cornerstone in various deep learning areas. Whil
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
How Does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective
arXiv:2505.21505v3 Announce Type: replace-cross Abstract: Multilingual Alignment is an effective and representative paradigm to enhance LLMs' multilingual capab
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions
arXiv:2506.09354v2 Announce Type: replace-cross Abstract: Mental health is a growing global concern, prompting interest in AI-driven solutions to expand access
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection
arXiv:2506.18919v4 Announce Type: replace-cross Abstract: As a multimodal medium combining images and text, memes frequently convey implicit harmful content thr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
arXiv:2508.07629v4 Announce Type: replace-cross Abstract: We present Klear-Reasoner, a model with long reasoning capabilities that demonstrates careful delibera
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
FedKLPR: KL-Guided Pruning-Aware Federated Learning for Person Re-Identification
arXiv:2508.17431v2 Announce Type: replace-cross Abstract: Person re-identification (re-ID) is a fundamental task in intelligent surveillance and public safety.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Polychromic Objectives for Reinforcement Learning
arXiv:2509.25424v4 Announce Type: replace-cross Abstract: Reinforcement learning fine-tuning (RLFT) is a dominant paradigm for improving pretrained policies for
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Are Large Vision-Language Models Ready to Guide Blind and Low-Vision Individuals?
arXiv:2510.00766v2 Announce Type: replace-cross Abstract: Large Vision-Language Models (LVLMs) demonstrate a promising direction for assisting individuals with
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
TempoControl: Temporal Attention Guidance for Text-to-Video Models
arXiv:2510.02226v3 Announce Type: replace-cross Abstract: Recent advances in generative video models have enabled the creation of high-quality videos based on n
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Incoherence in Goal-Conditioned Autoregressive Models
arXiv:2510.06545v2 Announce Type: replace-cross Abstract: We investigate mathematically the notion of incoherence: a structural issue with reinforcement learnin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
E-Scores for (In)Correctness Assessment of Generative Model Outputs
arXiv:2510.25770v2 Announce Type: replace-cross Abstract: While generative models, especially large language models (LLMs), are ubiquitous in today's world, pri
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback
arXiv:2511.08225v2 Announce Type: replace-cross Abstract: As teachers increasingly turn to GenAI in their educational practice, we need robust methods to benchm
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling
arXiv:2511.20224v2 Announce Type: replace-cross Abstract: Audio tokenization bridges continuous waveforms and multi-track music language models. In dual-track m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Structured Prompts Improve Evaluation of Language Models
arXiv:2511.20836v3 Announce Type: replace-cross Abstract: As language models (LMs) are increasingly adopted across domains, high-quality benchmarking frameworks
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion
arXiv:2512.00234v2 Announce Type: replace-cross Abstract: There has been significant progress in open-source text-only translation large language models (LLMs)
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Lumos: Let there be Language Model System Certification
arXiv:2512.02966v2 Announce Type: replace-cross Abstract: We introduce the first principled framework, Lumos, for specifying and formally certifying Language Mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Bypassing Prompt Injection Detectors through Evasive Injections
arXiv:2602.00750v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used in interactive and retrieval-augmented systems, but
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
On the Non-Identifiability of Steering Vectors in Large Language Models
arXiv:2602.06801v4 Announce Type: replace-cross Abstract: Activation steering methods are widely used to control large language model (LLM) behavior and are oft
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff
arXiv:2602.08040v3 Announce Type: replace-cross Abstract: Deep neural networks trained on nonstationary data must balance stability (i.e., retaining prior knowl
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Evaluating LLM-Generated ACSL Annotations for Formal Verification
arXiv:2602.13851v2 Announce Type: replace-cross Abstract: Formal specifications are crucial for building verifiable and dependable software systems, yet generat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Chat-Based Support Alone May Not Be Enough: Comparing Conversational and Embedded LLM Feedback for Mathematical Proof Learning
arXiv:2602.18807v2 Announce Type: replace-cross Abstract: We evaluate GPTutor, an LLM-powered tutoring system for an undergraduate discrete mathematics course.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation
arXiv:2603.17205v2 Announce Type: replace-cross Abstract: Domain-specific finetuning is essential for dense retrievers, yet not all training pairs contribute eq
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
Open Source Project of the Day (Part 27): Awesome AI Coding - A One-Stop AI Programming Resource Navigator
Introduction "AI coding tools and resources are scattered everywhere. A topically organized, searchable, contributable list can save enormous amounts of search
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
How I Built Cryptographic Signing for Every AI Agent Tool Call
How I Built Cryptographic Signing for Every AI Agent Tool Call Your AI agent just mass-deleted a production database. Can you prove exactly what it did? When? W
Dev.to AI
🧠 Large Language Models
⚡ AI Lesson
3w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen
LangChain Blog
🧠 Large Language Models
⚡ AI Lesson
3w ago
March 2026: LangChain Newsletter
It feels like spring has sprung here, and so has a new NVIDIA integration, ticket sales for Interrupt 2026, and announcing LangSmith Fleet (formerly Agent Build

Forbes Innovation
🧠 Large Language Models
⚡ AI Lesson
3w ago
The $6 Trillion Question: What AI Can And Can’t Do For Climate Finance
Artificial intelligence has an important role to place in improving climate finance flows, and the climate finance world has a role to play in shaping AI govern
DeepCamp AI