Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,089 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks
arXiv:2604.05100v1 Announce Type: cross Abstract: Instructed code editing, where an LLM modifies existing code based on a natural language instruction, accounts
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner
arXiv:2604.05112v1 Announce Type: cross Abstract: Recent progress in in-context reinforcement learning (ICRL) has demonstrated its potential for training genera
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Watch Before You Answer: Learning from Visually Grounded Post-Training
arXiv:2604.05117v1 Announce Type: cross Abstract: It is critical for vision-language models (VLMs) to comprehensively understand visual, temporal, and textual c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Offline RL for Adaptive Policy Retrieval in Prior Authorization
arXiv:2604.05125v1 Announce Type: cross Abstract: Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing ret
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning
arXiv:2604.05134v1 Announce Type: cross Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning evolve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EffiPair: Improving the Efficiency of LLM-generated Code with Relative Contrastive Feedback
arXiv:2604.05137v1 Announce Type: cross Abstract: Large language models (LLMs) often generate code that is functionally correct but inefficient in runtime and m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation
arXiv:2604.05150v1 Announce Type: cross Abstract: We study compiled AI, a paradigm in which large language models generate executable code artifacts during a co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Planning to Explore: Curiosity-Driven Planning for LLM Test Generation
arXiv:2604.05159v1 Announce Type: cross Abstract: The use of LLMs for code generation has naturally extended to code testing and evaluation. As codebases grow i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning
arXiv:2604.05164v1 Announce Type: cross Abstract: As LLM reasoning performance plateau, improving inference-time compute efficiency is crucial to mitigate overt
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Use to Oversight: How Mental Models Influence User Behavior and Output in AI Writing Assistants
arXiv:2604.05166v1 Announce Type: cross Abstract: AI-based writing assistants are ubiquitous, yet little is known about how users' mental models shape their use
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
arXiv:2604.05183v1 Announce Type: cross Abstract: In a rapidly growing field of model training there is a constant practical interest in parameter-efficient fin
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Improving Clinical Trial Recruitment using Clinical Narratives and Large Language Models
arXiv:2604.05190v1 Announce Type: cross Abstract: Screening patients for enrollment is a well-known, labor-intensive bottleneck that leads to under-enrollment a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts
arXiv:2604.05242v1 Announce Type: cross Abstract: Multi-bit watermarking has emerged as a promising solution for embedding imperceptible binary messages into La
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning
arXiv:2604.05243v1 Announce Type: cross Abstract: Background: Children do not simply learn that balls are round and blocks are square. They learn that shape is
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Extending Tabular Denoising Diffusion Probabilistic Models for Time-Series Data Generation
arXiv:2604.05257v1 Announce Type: cross Abstract: Diffusion models are increasingly being utilised to create synthetic tabular and time series data for privacy-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking
arXiv:2604.05268v1 Announce Type: cross Abstract: Multi-modal retrieval-augmented generation (MM-RAG) relies heavily on re-rankers to surface the most relevant
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code
arXiv:2604.05292v1 Announce Type: cross Abstract: AI coding assistants are now used to generate production code in security-sensitive domains, yet the exploitab
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLMs Should Express Uncertainty Explicitly
arXiv:2604.05306v1 Announce Type: cross Abstract: Large language models are increasingly used in settings where uncertainty must drive decisions such as abstent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
DQA: Diagnostic Question Answering for IT Support
arXiv:2604.05350v1 Announce Type: cross Abstract: Enterprise IT support interactions are fundamentally diagnostic: effective resolution requires iterative evide
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
OGA-AID: Clinician-in-the-loop AI Report Drafting Assistant for Multimodal Observational Gait Analysis in Post-Stroke Rehabilitation
arXiv:2604.05360v1 Announce Type: cross Abstract: Gait analysis is essential in post-stroke rehabilitation but remains time-intensive and cognitively demanding,
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
VideoStir: Understanding Long Videos via Spatio-Temporally Structured and Intent-Aware RAG
arXiv:2604.05418v1 Announce Type: cross Abstract: Scaling multimodal large language models (MLLMs) to long videos is constrained by limited context windows. Whi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads
arXiv:2604.05426v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) is now the dominant method for parameter-efficient fine-tuning of large language mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use
arXiv:2604.05432v1 Announce Type: cross Abstract: Tool-use large language model (LLM) agents are increasingly deployed to support sensitive workflows, relying o
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling
arXiv:2604.05445v1 Announce Type: cross Abstract: Vision-language reward modeling faces a dilemma: generative approaches are interpretable but slow, while discr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM Evaluation as Tensor Completion: Low Rank Structure and Semiparametric Efficiency
arXiv:2604.05460v1 Announce Type: cross Abstract: Large language model (LLM) evaluation platforms increasingly rely on pairwise human judgments. These data are
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
On the Role of Fault Localization Context for LLM-Based Program Repair
arXiv:2604.05481v1 Announce Type: cross Abstract: Fault Localization (FL) is a key component of Large Language Model (LLM)-based Automated Program Repair (APR),
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis
arXiv:2604.05482v1 Announce Type: cross Abstract: Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system
arXiv:2604.05536v1 Announce Type: cross Abstract: Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version
arXiv:2604.05551v1 Announce Type: cross Abstract: Self-conditioning has been central to the success of continuous diffusion language models, as it allows models
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue
arXiv:2604.05552v1 Announce Type: cross Abstract: Large Language Models demonstrate outstanding performance in many language tasks but still face fundamental ch
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
AI-Driven Modular Services for Accessible Multilingual Education in Immersive Extended Reality Settings: Integrating Speech Processing, Translation, and Sign Language Rendering
arXiv:2604.05591v1 Announce Type: cross Abstract: This work introduces a modular platform that brings together six AI services, automatic speech recognition via
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Analogical Reasoning as a Doctor: A Foundation Model for Gastrointestinal Endoscopy Diagnosis
arXiv:2604.05649v1 Announce Type: cross Abstract: Gastrointestinal diseases impose a growing global health burden, and endoscopy is a primary tool for early dia
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Multiscale Physics-Informed Neural Network for Complex Fluid Flows with Long-Range Dependencies
arXiv:2604.05652v1 Announce Type: cross Abstract: Fluid flows are governed by the nonlinear Navier-Stokes equations, which can manifest multiscale dynamics even
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals
arXiv:2604.05655v1 Announce Type: cross Abstract: This work characterizes large language models' chain-of-thought generation as a structured trajectory through
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Rectified Schr\"odinger Bridge Matching for Few-Step Visual Navigation
arXiv:2604.05673v1 Announce Type: cross Abstract: Visual navigation is a core challenge in Embodied AI, requiring autonomous agents to translate high-dimensiona
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
From Incomplete Architecture to Quantified Risk: Multimodal LLM-Driven Security Assessment for Cyber-Physical Systems
arXiv:2604.05674v1 Announce Type: cross Abstract: Cyber-physical systems often contend with incomplete architectural documentation or outdated information resul
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion
arXiv:2604.05688v1 Announce Type: cross Abstract: Key-Value (KV) cache memory and bandwidth increasingly dominate large language model inference cost in long-co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Hackers or Hallucinators? A Comprehensive Analysis of LLM-Based Automated Penetration Testing
arXiv:2604.05719v1 Announce Type: cross Abstract: The rapid advancement of Large Language Models (LLMs) has created new opportunities for Automated Penetration
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
CAKE: Cloud Architecture Knowledge Evaluation of Large Language Models
arXiv:2604.05755v1 Announce Type: cross Abstract: In today's software architecture, large language models (LLMs) serve as software architecture co-pilots. Howev
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say "I Don't Know"
arXiv:2604.05779v1 Announce Type: cross Abstract: While large language models (LLMs) demonstrate strong capabilities across diverse user queries, they still suf
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
EEG-MFTNet: An Enhanced EEGNet Architecture with Multi-Scale Temporal Convolutions and Transformer Fusion for Cross-Session Motor Imagery Decoding
arXiv:2604.05843v1 Announce Type: cross Abstract: Brain-computer interfaces (BCIs) enable direct communication between the brain and external devices, providing
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Neural Network Pruning via QUBO Optimization
arXiv:2604.05856v1 Announce Type: cross Abstract: Neural network pruning can be formulated as a combinatorial optimization problem, yet most existing approaches
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts
arXiv:2604.05872v1 Announce Type: cross Abstract: The deployment of large language models (LLMs) in Swiss financial and regulatory contexts demands empirical ev
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation
arXiv:2604.05906v1 Announce Type: cross Abstract: Numerous studies on text-to-image (T2I) generative models have utilized cross-attention maps to boost applicat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
"I See What You Did There": Can Large Vision-Language Models Understand Multimodal Puns?
arXiv:2604.05930v1 Announce Type: cross Abstract: Puns are a common form of rhetorical wordplay that exploits polysemy and phonetic similarity to create humor.
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
arXiv:2604.05931v1 Announce Type: cross Abstract: Zero-shot unsupervised reinforcement learning (URL) offers a promising direction for building generalist agent
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
Does Pass Rate Tell the Whole Story? Evaluating Design Constraint Compliance in LLM-based Issue Resolution
arXiv:2604.05955v1 Announce Type: cross Abstract: Repository-level issue resolution benchmarks have become a standard testbed for evaluating LLM-based agents, y
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
2w ago
The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models
arXiv:2604.05995v1 Announce Type: cross Abstract: Large Language Models (LLMs) internalize vast world knowledge as parametric memory, yet inevitably inherit the
DeepCamp AI