Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,074 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking
arXiv:2604.05268v1 Announce Type: cross Abstract: Multi-modal retrieval-augmented generation (MM-RAG) relies heavily on re-rankers to surface the most relevant
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code
arXiv:2604.05292v1 Announce Type: cross Abstract: AI coding assistants are now used to generate production code in security-sensitive domains, yet the exploitab
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLMs Should Express Uncertainty Explicitly
arXiv:2604.05306v1 Announce Type: cross Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DQA: Diagnostic Question Answering for IT Support
arXiv:2604.05350v1 Announce Type: cross Abstract: Enterprise IT support interactions are fundamentally diagnostic: effective resolution requires iterative evide
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
OGA-AID: Clinician-in-the-loop AI Report Drafting Assistant for Multimodal Observational Gait Analysis in Post-Stroke Rehabilitation
arXiv:2604.05360v1 Announce Type: cross Abstract: Gait analysis is essential in post-stroke rehabilitation but remains time-intensive and cognitively demanding,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
VideoStir: Understanding Long Videos via Spatio-Temporally Structured and Intent-Aware RAG
arXiv:2604.05418v1 Announce Type: cross Abstract: Scaling multimodal large language models (MLLMs) to long videos is constrained by limited context windows. Whi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads
arXiv:2604.05426v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use
arXiv:2604.05432v1 Announce Type: cross Abstract: Tool-use large language model (LLM) agents are increasingly deployed to support sensitive workflows, relying o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling
arXiv:2604.05445v1 Announce Type: cross Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM Evaluation as Tensor Completion: Low Rank Structure and Semiparametric Efficiency
arXiv:2604.05460v1 Announce Type: cross Abstract: Large language model (LLM) evaluation platforms increasingly rely on pairwise human judgments. These data are
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
On the Role of Fault Localization Context for LLM-Based Program Repair
arXiv:2604.05481v1 Announce Type: cross Abstract: Fault Localization (FL) is a key component of Large Language Model (LLM)-based Automated Program Repair (APR),
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis
arXiv:2604.05482v1 Announce Type: cross Abstract: Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system
arXiv:2604.05536v1 Announce Type: cross Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version
arXiv:2604.05551v1 Announce Type: cross Abstract: Self-conditioning has been central to the success of continuous diffusion language models, as it allows models
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue
arXiv:2604.05552v1 Announce Type: cross Abstract: Large Language Models demonstrate outstanding performance in many language tasks but still face fundamental ch
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering
arXiv:2604.05591v1 Announce Type: cross Abstract: This work introduces a modular platform that brings together six AI services, automatic speech recognition via
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Analogical Reasoning as a Doctor: A Foundation Model for Gastrointestinal Endoscopy Diagnosis
arXiv:2604.05649v1 Announce Type: cross Abstract: Gastrointestinal diseases impose a growing global health burden, and endoscopy is a primary tool for early dia
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Multiscale Physics-Informed Neural Network for Complex Fluid Flows with Long-Range Dependencies
arXiv:2604.05652v1 Announce Type: cross Abstract: Fluid flows are governed by the nonlinear Navier-Stokes equations, which can manifest multiscale dynamics even
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals
arXiv:2604.05655v1 Announce Type: cross Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Rectified Schr\"odinger Bridge Matching for Few-Step Visual Navigation
arXiv:2604.05673v1 Announce Type: cross Abstract: Visual navigation is a core challenge in Embodied AI, requiring autonomous agents to translate high-dimensiona
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems
arXiv:2604.05674v1 Announce Type: cross Abstract: Cyber-physical systems often contend with incomplete architectural documentation or outdated information resul
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion
arXiv:2604.05688v1 Announce Type: cross Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing
arXiv:2604.05719v1 Announce Type: cross Abstract: The rapid advancement of Large Language Models (LLMs) has created new opportunities for Automated Penetration
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CAKE: Cloud Architecture Knowledge Evaluation of Large Language Models
arXiv:2604.05755v1 Announce Type: cross Abstract: In today's software architecture, large language models (LLMs) serve as software architecture co-pilots. Howev
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say "I Don't Know"
arXiv:2604.05779v1 Announce Type: cross Abstract: While large language models (LLMs) demonstrate strong capabilities across diverse user queries, they still suf
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding
arXiv:2604.05843v1 Announce Type: cross Abstract: Brain-computer interfaces (BCIs) enable direct communication between the brain and external devices, providing
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Neural Network Pruning via QUBO Optimization
arXiv:2604.05856v1 Announce Type: cross Abstract: Neural network pruning can be formulated as a combinatorial optimization problem, yet most existing approaches
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts
arXiv:2604.05872v1 Announce Type: cross Abstract: The deployment of large language models (LLMs) in Swiss financial and regulatory contexts demands empirical ev
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation
arXiv:2604.05906v1 Announce Type: cross Abstract: Numerous studies on text-to-image (T2I) generative models have utilized cross-attention maps to boost applicat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
"I See What You Did There": Can Large Vision-Language Models Understand Multimodal Puns?
arXiv:2604.05930v1 Announce Type: cross Abstract: Puns are a common form of rhetorical wordplay that exploits polysemy and phonetic similarity to create humor.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
arXiv:2604.05931v1 Announce Type: cross Abstract: Zero-shot unsupervised reinforcement learning (URL) offers a promising direction for building generalist agent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Does Pass Rate Tell the Whole Story? Evaluating Design Constraint Compliance in LLM-based Issue Resolution
arXiv:2604.05955v1 Announce Type: cross Abstract: Repository-level issue resolution benchmarks have become a standard testbed for evaluating LLM-based agents, y
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models
arXiv:2604.05995v1 Announce Type: cross Abstract: Large Language Models (LLMs) internalize vast world knowledge as parametric memory, yet inevitably inherit the
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments
arXiv:2604.06019v1 Announce Type: cross Abstract: The advancement of Large Language Models (LLMs) has raised concerns regarding their dual-use potential in cybe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
A Multi-Stage Validation Framework for Trustworthy Large-scale Clinical Information Extraction using Large Language Models
arXiv:2604.06028v1 Announce Type: cross Abstract: Large language models (LLMs) show promise for extracting clinically meaningful information from unstructured h
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles
arXiv:2604.06071v1 Announce Type: cross Abstract: Personality traits are richly encoded in natural language, and large language models (LLMs) trained on human t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning
arXiv:2604.06079v1 Announce Type: cross Abstract: Graphics Program Synthesis is pivotal for interpreting and editing visual data, effectively facilitating the r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LAG-XAI: A Lie-Inspired Affine Geometric Framework for Interpretable Paraphrasing in Transformer Latent Spaces
arXiv:2604.06086v1 Announce Type: cross Abstract: Modern Transformer-based language models achieve strong performance in natural language processing tasks, yet
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Making in LLM Collectives
arXiv:2604.06091v1 Announce Type: cross Abstract: Large language model (LLM) agents are increasingly acting as human delegates in multi-agent environments, wher
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM4CodeRE: Generative AI for Code Decompilation Analysis and Reverse Engineering
arXiv:2604.06095v1 Announce Type: cross Abstract: Code decompilation analysis is a fundamental yet challenging task in malware reverse engineering, particularly
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer
arXiv:2604.06129v1 Announce Type: cross Abstract: This paper introduces the Polynomial Mixer (PoM), a novel token mixing mechanism with linear complexity that s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Shot-Based Quantum Encoding: A Data-Loading Paradigm for Quantum Neural Networks
arXiv:2604.06135v1 Announce Type: cross Abstract: Efficient data loading remains a bottleneck for near-term quantum machine-learning. Existing schemes (angle, a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Generating Synthetic Doctor-Patient Conversations for Long-form Audio Summarization
arXiv:2604.06138v1 Announce Type: cross Abstract: Long-context audio reasoning is underserved in both training data and evaluation. Existing benchmarks target s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement
arXiv:2604.06155v1 Announce Type: cross Abstract: Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control
arXiv:2604.06156v1 Announce Type: cross Abstract: MLLMs have been successfully applied to multimodal embedding tasks, yet their generative reasoning capabilitie
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
In-Place Test-Time Training
arXiv:2604.06169v1 Announce Type: cross Abstract: The static ``train then deploy" paradigm fundamentally limits Large Language Models (LLMs) from dynamically ad
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Advancing AI Research Assistants with Expert-Involved Learning
arXiv:2505.04638v5 Announce Type: replace Abstract: Large language models (LLMs) and large multimodal models (LMMs) promise to accelerate biomedical discovery,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Beyond Syntax: Action Semantics Learning for App Agents
arXiv:2506.17697v3 Announce Type: replace Abstract: The recent development of Large Language Models (LLMs) enables the rise of App agents that interpret user in
DeepCamp AI