Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,754
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,304 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Structured Agent Distillation for Large Language Model
arXiv:2505.13820v4 Announce Type: replace-cross Abstract: Large language models (LLMs) exhibit strong capabilities as decision-making agents by interleaving rea
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
VLM-SAFE: Vision-Language Model-Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving
arXiv:2505.16377v2 Announce Type: replace-cross Abstract: Autonomous driving policy learning with reinforcement learning (RL) is fundamentally limited by low sa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification
arXiv:2506.04450v5 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly adopted across domains such as education, healthcare, an
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights
arXiv:2506.17337v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) have shown promise in automating image diagnosis and interpretation in c
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation
arXiv:2506.21138v2 Announce Type: replace-cross Abstract: High-quality labeled datasets are fundamental for training and evaluating machine learning models, yet
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
arXiv:2508.02343v2 Announce Type: replace-cross Abstract: Quantization significantly accelerates inference in large language models (LLMs) by replacing original
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting
arXiv:2508.13773v3 Announce Type: replace-cross Abstract: Despite advances in the Transformer architecture, their effectiveness for long-term time series foreca
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
arXiv:2509.16952v2 Announce Type: replace-cross Abstract: The growing volume of academic papers has made it increasingly difficult for researchers to efficientl
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distortion Detection
arXiv:2509.17292v2 Announce Type: replace-cross Abstract: Cognitive distortions have been closely linked to mental health disorders, yet their automatic detecti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Advancing Few-Shot Pediatric Arrhythmia Classification with a Novel Contrastive Loss and Multimodal Learning
arXiv:2509.19315v2 Announce Type: replace-cross Abstract: Arrhythmias are a major cause of sudden cardiac death in children, making automated rhythm classificat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dual-Space Smoothness for Robust and Balanced LLM Unlearning
arXiv:2509.23362v2 Announce Type: replace-cross Abstract: As large language models evolve, Machine Unlearning has emerged to address growing concerns around use
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
arXiv:2509.25848v3 Announce Type: replace-cross Abstract: Reasoning has emerged as a pivotal capability in Large Language Models (LLMs). Through Reinforcement L
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
arXiv:2510.04618v3 Announce Type: replace-cross Abstract: Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
arXiv:2510.05825v2 Announce Type: replace-cross Abstract: Inference-Time Scaling (ITS) improves language models by allocating more computation at generation tim
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
arXiv:2510.08553v2 Announce Type: replace-cross Abstract: Vision-and-Language Navigation (VLN) requires agents to follow natural language instructions through e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CLMN: Concept based Language Models via Neural Symbolic Reasoning
arXiv:2510.10063v2 Announce Type: replace-cross Abstract: Deep learning has advanced NLP, but interpretability remains limited, especially in healthcare and fin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Schema for In-Context Learning
arXiv:2510.13905v3 Announce Type: replace-cross Abstract: In-Context Learning (ICL) enables transformer-based language models to adapt to new tasks by condition
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings
arXiv:2510.15681v3 Announce Type: replace-cross Abstract: Translating human-written mathematical theorems and proofs from natural language (NL) into formal lang
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
arXiv:2510.20351v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly exposed to data contamination, i.e., performance gains d
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
arXiv:2510.25311v2 Announce Type: replace-cross Abstract: Reinforcement Learning algorithms are primarily focused on learning a policy that maximizes expected r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
arXiv:2511.10465v2 Announce Type: replace-cross Abstract: While prompt optimization has emerged as a critical technique for enhancing language model performance
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation
arXiv:2511.11483v4 Announce Type: replace-cross Abstract: Recent text-to-image (T2I) models have made remarkable progress in generating visually realistic and s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Scaling Spatial Intelligence with Multimodal Foundation Models
arXiv:2511.13719v4 Announce Type: replace-cross Abstract: Despite remarkable progress, multimodal foundation models still exhibit surprising deficiencies in spa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Object-Centric World Models for Causality-Aware Reinforcement Learning
arXiv:2511.14262v3 Announce Type: replace-cross Abstract: World models have been developed to support sample-efficient deep reinforcement learning agents. Howev
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
arXiv:2511.15090v2 Announce Type: replace-cross Abstract: Scientific documents contain complex multimodal structures, which makes evidence localization and scie
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search
arXiv:2511.16681v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) systems have become a dominant approach to augment large language
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
UniGame: Turning a Unified Multimodal Model Into Its Own Adversary
arXiv:2511.19413v3 Announce Type: replace-cross Abstract: Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generatio
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings
arXiv:2511.21428v2 Announce Type: replace-cross Abstract: We present a novel unsupervised framework to unlock vast unlabeled human demonstration data from conti
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Single-Round Scalable Analytic Federated Learning
arXiv:2512.03336v2 Announce Type: replace-cross Abstract: Federated Learning (FL) is plagued by two key challenges: high communication overhead and performance
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control
arXiv:2512.04653v2 Announce Type: replace-cross Abstract: Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic sig
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Multilingual Medical Reasoning for Question Answering with Large Language Models
arXiv:2512.05658v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) with reasoning capabilities have recently demonstrated strong potential i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models
arXiv:2512.08503v2 Announce Type: replace-cross Abstract: Multi-modal large reasoning models (MLRMs) pose significant privacy risks by inferring precise geograp
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
arXiv:2512.10932v2 Announce Type: replace-cross Abstract: Early children's developmental trajectories set up a natural goal for sample-efficient pretraining of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, and LLaMA
arXiv:2512.12812v2 Announce Type: replace-cross Abstract: Prompt engineering has emerged as a critical factor influencing large language model (LLM) performance
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Measuring all the noises of LLM Evals
arXiv:2512.21326v2 Announce Type: replace-cross Abstract: Separating signal from noise is central to experiments. Applying well-established statistical methods
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
JMedEthicBench: A Multi-Turn Conversational Benchmark for Evaluating Medical Safety in Japanese Large Language Models
arXiv:2601.01627v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) are increasingly deployed in healthcare field, it becomes essential to
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Symphonym: Universal Phonetic Embeddings for Cross-Script Name Matching
arXiv:2601.06932v4 Announce Type: replace-cross Abstract: Matching place names across writing systems is a persistent obstacle to the integration of multilingua
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts
arXiv:2601.10079v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) has become essential for eliciting complex reasoning capabilities in Large
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
LLMs versus the Halting Problem: Revisiting Program Termination Prediction
arXiv:2601.18987v4 Announce Type: replace-cross Abstract: Determining whether a program terminates is a central problem in computer science. Turing's foundation
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Does My Chatbot Have an Agenda? Understanding Human and AI Agency in Human-Human-like Chatbot Interaction
arXiv:2601.22452v2 Announce Type: replace-cross Abstract: As AI chatbots shift from tools to companions, critical questions arise: who controls the conversation
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
TextBFGS: A Case-Based Reasoning Approach to Code Optimization via Error-Operator Retrieval
arXiv:2602.00059v2 Announce Type: replace-cross Abstract: Iterative code generation with Large Language Models (LLMs) can be viewed as an optimization process g
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Can Small Language Models Handle Context-Summarized Multi-Turn Customer-Service QA? A Synthetic Data-Driven Comparative Evaluation
arXiv:2602.00665v2 Announce Type: replace-cross Abstract: Customer-service question answering (QA) systems increasingly rely on conversational language understa
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation
arXiv:2602.05548v3 Announce Type: replace-cross Abstract: Reinforcement Learning with Verifiable Rewards (RLVR), particularly GRPO, has become the standard for
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
A Theoretical Analysis of Test-Driven LLM Code Generation
arXiv:2602.06098v2 Announce Type: replace-cross Abstract: Coding assistants are increasingly utilized in test-driven software development, yet the theoretical m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CLEAR: A Knowledge-Centric Vessel Trajectory Analysis Platform
arXiv:2602.08482v2 Announce Type: replace-cross Abstract: Vessel trajectory data from the Automatic Identification System (AIS) is used widely in maritime analy
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CoPE-VideoLM: Leveraging Codec Primitives For Efficient Video Language Modeling
arXiv:2602.13191v2 Announce Type: replace-cross Abstract: Video Language Models (VideoLMs) enable AI systems to understand temporal dynamics in videos. To fit w
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation
arXiv:2602.16898v4 Announce Type: replace-cross Abstract: Task planning for robotic manipulation with large language models (LLMs) is an emerging area. Prior ap
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago
CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning
arXiv:2602.21655v2 Announce Type: replace-cross Abstract: Image captioning remains a fundamental task for vision language understanding, yet ground-truth superv