Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,754

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,450 Reads 5,304

Showing 5,304 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Structured Agent Distillation for Large Language Model

arXiv:2505.13820v4 Announce Type: replace-cross Abstract: Large language models (LLMs) exhibit strong capabilities as decision-making agents by interleaving rea

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving

arXiv:2505.16377v2 Announce Type: replace-cross Abstract: Autonomous driving policy learning with reinforcement learning (RL) is fundamentally limited by low sa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification

arXiv:2506.04450v5 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly adopted across domains such as education, healthcare, an

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

arXiv:2506.17337v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have shown promise in automating image diagnosis and interpretation in c

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation

arXiv:2506.21138v2 Announce Type: replace-cross Abstract: High-quality labeled datasets are fundamental for training and evaluating machine learning models, yet

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

arXiv:2508.02343v2 Announce Type: replace-cross Abstract: Quantization significantly accelerates inference in large language models (LLMs) by replacing original

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

arXiv:2508.13773v3 Announce Type: replace-cross Abstract: Despite advances in the Transformer architecture, their effectiveness for long-term time series foreca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation

arXiv:2509.16952v2 Announce Type: replace-cross Abstract: The growing volume of academic papers has made it increasingly difficult for researchers to efficientl

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distortion Detection

arXiv:2509.17292v2 Announce Type: replace-cross Abstract: Cognitive distortions have been closely linked to mental health disorders, yet their automatic detecti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Advancing Few-Shot Pediatric Arrhythmia Classification with a Novel Contrastive Loss and Multimodal Learning

arXiv:2509.19315v2 Announce Type: replace-cross Abstract: Arrhythmias are a major cause of sudden cardiac death in children, making automated rhythm classificat

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

arXiv:2509.23362v2 Announce Type: replace-cross Abstract: As large language models evolve, Machine Unlearning has emerged to address growing concerns around use

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models

arXiv:2509.25848v3 Announce Type: replace-cross Abstract: Reasoning has emerged as a pivotal capability in Large Language Models (LLMs). Through Reinforcement L

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

arXiv:2510.04618v3 Announce Type: replace-cross Abstract: Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling

arXiv:2510.05825v2 Announce Type: replace-cross Abstract: Inference-Time Scaling (ITS) improves language models by allocating more computation at generation tim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation

arXiv:2510.08553v2 Announce Type: replace-cross Abstract: Vision-and-Language Navigation (VLN) requires agents to follow natural language instructions through e

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CLMN: Concept based Language Models via Neural Symbolic Reasoning

arXiv:2510.10063v2 Announce Type: replace-cross Abstract: Deep learning has advanced NLP, but interpretability remains limited, especially in healthcare and fin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Schema for In-Context Learning

arXiv:2510.13905v3 Announce Type: replace-cross Abstract: In-Context Learning (ICL) enables transformer-based language models to adapt to new tasks by condition

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings

arXiv:2510.15681v3 Announce Type: replace-cross Abstract: Translating human-written mathematical theorems and proofs from natural language (NL) into formal lang

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

arXiv:2510.20351v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly exposed to data contamination, i.e., performance gains d

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning

arXiv:2510.25311v2 Announce Type: replace-cross Abstract: Reinforcement Learning algorithms are primarily focused on learning a policy that maximizes expected r

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks

arXiv:2511.10465v2 Announce Type: replace-cross Abstract: While prompt optimization has emerged as a critical technique for enhancing language model performance

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

arXiv:2511.11483v4 Announce Type: replace-cross Abstract: Recent text-to-image (T2I) models have made remarkable progress in generating visually realistic and s

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Scaling Spatial Intelligence with Multimodal Foundation Models

arXiv:2511.13719v4 Announce Type: replace-cross Abstract: Despite remarkable progress, multimodal foundation models still exhibit surprising deficiencies in spa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Object-Centric World Models for Causality-Aware Reinforcement Learning

arXiv:2511.14262v3 Announce Type: replace-cross Abstract: World models have been developed to support sample-efficient deep reinforcement learning agents. Howev

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning

arXiv:2511.15090v2 Announce Type: replace-cross Abstract: Scientific documents contain complex multimodal structures, which makes evidence localization and scie

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search

arXiv:2511.16681v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) systems have become a dominant approach to augment large language

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

arXiv:2511.19413v3 Announce Type: replace-cross Abstract: Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generatio

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings

arXiv:2511.21428v2 Announce Type: replace-cross Abstract: We present a novel unsupervised framework to unlock vast unlabeled human demonstration data from conti

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Single-Round Scalable Analytic Federated Learning

arXiv:2512.03336v2 Announce Type: replace-cross Abstract: Federated Learning (FL) is plagued by two key challenges: high communication overhead and performance

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control

arXiv:2512.04653v2 Announce Type: replace-cross Abstract: Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic sig

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Multilingual Medical Reasoning for Question Answering with Large Language Models

arXiv:2512.05658v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) with reasoning capabilities have recently demonstrated strong potential i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models

arXiv:2512.08503v2 Announce Type: replace-cross Abstract: Multi-modal large reasoning models (MLRMs) pose significant privacy risks by inferring precise geograp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models

arXiv:2512.10932v2 Announce Type: replace-cross Abstract: Early children's developmental trajectories set up a natural goal for sample-efficient pretraining of

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, and LLaMA

arXiv:2512.12812v2 Announce Type: replace-cross Abstract: Prompt engineering has emerged as a critical factor influencing large language model (LLM) performance

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Measuring all the noises of LLM Evals

arXiv:2512.21326v2 Announce Type: replace-cross Abstract: Separating signal from noise is central to experiments. Applying well-established statistical methods

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

JMedEthicBench: A Multi-Turn Conversational Benchmark for Evaluating Medical Safety in Japanese Large Language Models

arXiv:2601.01627v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) are increasingly deployed in healthcare field, it becomes essential to

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Symphonym: Universal Phonetic Embeddings for Cross-Script Name Matching

arXiv:2601.06932v4 Announce Type: replace-cross Abstract: Matching place names across writing systems is a persistent obstacle to the integration of multilingua

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts

arXiv:2601.10079v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) has become essential for eliciting complex reasoning capabilities in Large

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

LLMs versus the Halting Problem: Revisiting Program Termination Prediction

arXiv:2601.18987v4 Announce Type: replace-cross Abstract: Determining whether a program terminates is a central problem in computer science. Turing's foundation

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Does My Chatbot Have an Agenda? Understanding Human and AI Agency in Human-Human-like Chatbot Interaction

arXiv:2601.22452v2 Announce Type: replace-cross Abstract: As AI chatbots shift from tools to companions, critical questions arise: who controls the conversation

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

TextBFGS: A Case-Based Reasoning Approach to Code Optimization via Error-Operator Retrieval

arXiv:2602.00059v2 Announce Type: replace-cross Abstract: Iterative code generation with Large Language Models (LLMs) can be viewed as an optimization process g

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Can Small Language Models Handle Context-Summarized Multi-Turn Customer-Service QA? A Synthetic Data-Driven Comparative Evaluation

arXiv:2602.00665v2 Announce Type: replace-cross Abstract: Customer-service question answering (QA) systems increasingly rely on conversational language understa

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation

arXiv:2602.05548v3 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR), particularly GRPO, has become the standard for

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

A Theoretical Analysis of Test-Driven LLM Code Generation

arXiv:2602.06098v2 Announce Type: replace-cross Abstract: Coding assistants are increasingly utilized in test-driven software development, yet the theoretical m

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CLEAR: A Knowledge-Centric Vessel Trajectory Analysis Platform

arXiv:2602.08482v2 Announce Type: replace-cross Abstract: Vessel trajectory data from the Automatic Identification System (AIS) is used widely in maritime analy

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CoPE-VideoLM: Leveraging Codec Primitives For Efficient Video Language Modeling

arXiv:2602.13191v2 Announce Type: replace-cross Abstract: Video Language Models (VideoLMs) enable AI systems to understand temporal dynamics in videos. To fit w

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation

arXiv:2602.16898v4 Announce Type: replace-cross Abstract: Task planning for robotic manipulation with large language models (LLMs) is an emerging area. Prior ap

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning

arXiv:2602.21655v2 Announce Type: replace-cross Abstract: Image captioning remains a fundamental task for vision language understanding, yet ground-truth superv