Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,216 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection
arXiv:2603.27240v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have achieved impressive performance across multimodal understanding and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Zero-shot Vision-Language Reranking for Cross-View Geolocalization
arXiv:2603.27251v1 Announce Type: cross Abstract: Cross-view geolocalization (CVGL) systems, while effective at retrieving a list of relevant candidates (high R
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Amalgam: Hybrid LLM-PGM Synthesis Algorithm for Accuracy and Realism
arXiv:2603.27254v1 Announce Type: cross Abstract: To generate synthetic datasets, e.g., in domains such as healthcare, the literature proposes approaches of two
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
From Foundation ECG Models to NISQ Learners: Distilling ECGFounder into a VQC Student
arXiv:2603.27269v1 Announce Type: cross Abstract: Foundation models have recently improved electrocardiogram (ECG) representation learning, but their deployment
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP
arXiv:2603.27277v1 Announce Type: cross Abstract: Large Language Model (LLM) coding agents typically explore codebases through repeated file-reading and grep-se
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations
arXiv:2603.27306v1 Announce Type: cross Abstract: Large language models (LLMs) have been proposed as supervisory agents for spacecraft operations, but existing
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Culturally Adaptive Explainable LLM Assessment for Multilingual Information Disorder: A Human-in-the-Loop Approach
arXiv:2603.27356v1 Announce Type: cross Abstract: Recognizing information disorder is difficult because judgments about manipulation depend on cultural and ling
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling
arXiv:2603.27403v1 Announce Type: cross Abstract: Large language models (LLMs) need reliable test-time control of hallucinations. Existing conformal methods for
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Geometry of Harmful Intent: Training-Free Anomaly Detection via Angular Deviation in LLM Residual Streams
arXiv:2603.27412v1 Announce Type: cross Abstract: We present LatentBiopsy, a training-free method for detecting harmful prompts by analysing the geometry of res
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
CarbonEdge: Carbon-Aware Deep Learning Inference Framework for Sustainable Edge Computing
arXiv:2603.27420v1 Announce Type: cross Abstract: Deep learning applications at the network edge lead to a significant growth in AI-related carbon emissions, pr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Improving Attributed Long-form Question Answering with Intent Awareness
arXiv:2603.27435v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly being used to generate comprehensive, knowledge-intensive report
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Multi-Agent Dialectical Refinement for Enhanced Argument Classification
arXiv:2603.27451v1 Announce Type: cross Abstract: Argument Mining (AM) is a foundational technology for automated writing evaluation, yet traditional supervised
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv:2603.27460v1 Announce Type: cross Abstract: Foundation models have demonstrated remarkable success across diverse domains and tasks, primarily due to the
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization
arXiv:2603.27467v1 Announce Type: cross Abstract: We compress KV cache entries by quantizing angles in the Fast Walsh-Hadamard domain, where a random diagonal r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
arXiv:2603.27481v1 Announce Type: cross Abstract: Multimodal Continual Instruction Tuning aims to continually enhance Large Vision Language Models (LVLMs) by le
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning
arXiv:2603.27482v1 Announce Type: cross Abstract: Vision--language models (VLMs) are increasingly aligned via Group Relative Policy Optimization (GRPO)-style tr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Copilot-Assisted Second-Thought Framework for Brain-to-Robot Hand Motion Decoding
arXiv:2603.27492v1 Announce Type: cross Abstract: Motor kinematics prediction (MKP) from electroencephalography (EEG) is an important research area for developi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs
arXiv:2603.27494v1 Announce Type: cross Abstract: To enhance the perception and reasoning capabilities of multimodal large language models in complex visual sce
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness
arXiv:2603.27539v1 Announce Type: cross Abstract: Multi-agent systems based on large language models (LLMs) for financial trading have grown rapidly since 2023,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
arXiv:2603.27593v1 Announce Type: cross Abstract: Recent progress in video large language models (Video-LLMs) has enabled strong offline reasoning over long and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling
arXiv:2603.27624v1 Announce Type: cross Abstract: Mixture-of-Experts is a promising approach for edge AI with low-batch inference. Yet, on-device deployments of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents
arXiv:2603.27626v1 Announce Type: cross Abstract: I propose Umwelt engineering -- the deliberate design of the linguistic cognitive environment -- as a third la
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
EvA: An Evidence-First Audio Understanding Paradigm for LALMs
arXiv:2603.27667v1 Announce Type: cross Abstract: Large Audio Language Models (LALMs) still struggle in complex acoustic scenes because they often fail to prese
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
LVRPO: Language-Visual Alignment with GRPO for Multimodal Understanding and Generation
arXiv:2603.27693v1 Announce Type: cross Abstract: Unified multimodal pretraining has emerged as a promising paradigm for jointly modeling language and vision wi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation
arXiv:2603.27705v1 Announce Type: cross Abstract: Few-shot medical image segmentation (FSMIS) has achieved notable progress, yet most existing methods mainly re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
KVSculpt: KV Cache Compression as Distillation
arXiv:2603.27819v1 Announce Type: cross Abstract: KV cache compression is critical for efficient long-context LLM inference. Approaches that reduce the per-pair
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Kernel Dynamics under Path Entropy Maximization
arXiv:2603.27880v1 Announce Type: cross Abstract: We propose a variational framework in which the kernel function k : X x X -> R, interpreted as the foundationa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing
arXiv:2603.27914v1 Announce Type: cross Abstract: We present \textbf{ITQ3\_S} (Interleaved Ternary Quantization -- Specialized), a novel 3-bit weight quantizati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey
arXiv:2603.27918v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) integrate information from multiple modalities such as text, images,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding
arXiv:2603.27942v1 Announce Type: cross Abstract: Japanese scene text poses challenges that multilingual benchmarks often fail to capture, including mixed scrip
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models
arXiv:2603.27982v1 Announce Type: cross Abstract: Vision-language models (VLMs) achieve strong performance on many benchmarks, yet a basic reliability question
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation
arXiv:2603.27986v1 Announce Type: cross Abstract: Federated learning (FL) enables distributed clients to collaboratively train a global model using local privat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment
arXiv:2603.27987v1 Announce Type: cross Abstract: The high cost and accessibility problem associated with large datasets hinder the development of large-scale v
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
arXiv:2603.27991v1 Announce Type: cross Abstract: Interactive documents help readers engage with complex ideas through dynamic visualization, interactive animat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers
arXiv:2603.28013v1 Announce Type: cross Abstract: We present a stage-decomposed analysis of prompt injection attacks against five frontier LLM agents. Prior wor
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Synonymix: Unified Group Personas for Generative Simulations
arXiv:2603.28066v1 Announce Type: cross Abstract: Generative agent simulations operate at two scales: individual personas for character interaction, and populat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MolmoPoint: Better Pointing for VLMs with Grounding Tokens
arXiv:2603.28069v1 Announce Type: cross Abstract: Grounding has become a fundamental capability of vision-language models (VLMs). Most existing VLMs point by ge
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
arXiv:2603.28086v1 Announce Type: cross Abstract: Voice design from natural language aims to generate speaker timbres directly from free-form textual descriptio
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models
arXiv:2603.28103v1 Announce Type: cross Abstract: Parliamentary proceedings represent a rich yet challenging resource for computational analysis, particularly w
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data
arXiv:2603.28122v1 Announce Type: cross Abstract: Integrating quantum circuits into deep learning pipelines remains challenging due to heuristic design limitati
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Does Claude's Constitution Have a Culture?
arXiv:2603.28123v1 Announce Type: cross Abstract: Constitutional AI (CAI) aligns language models with explicitly stated normative principles, offering a transpa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
arXiv:2603.28130v1 Announce Type: cross Abstract: We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photogr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation
arXiv:2603.28142v1 Announce Type: cross Abstract: Domain Generalized Semantic Segmentation (DGSS) aims to maintain robust performance across unseen target domai
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Evaluating Privilege Usage of Agents on Real-World Tools
arXiv:2603.28166v1 Announce Type: cross Abstract: Equipping LLM agents with real-world tools can substantially improve productivity. However, granting agents au
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models
arXiv:2603.28204v1 Announce Type: cross Abstract: Reinforcement learning from verifiable rewards (RLVR) has significantly advanced the reasoning capabilities of
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning
arXiv:2603.28251v1 Announce Type: cross Abstract: Drivers' visual attention provides critical cues for anticipating latent hazards and directly shapes decision-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries
arXiv:2603.28258v1 Announce Type: cross Abstract: Categorical perception (CP) -- enhanced discriminability at category boundaries -- is among the most studied p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights
arXiv:2603.28263v1 Announce Type: cross Abstract: Large Language Models (LLMs) remain heavily centered on English, with limited performance in low-resource lang
DeepCamp AI