Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,473 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning
arXiv:2603.22317v1 Announce Type: cross Abstract: Graph-structured data typically exhibits complex topological heterogeneity, making it difficult to model accur
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs
arXiv:2603.22321v1 Announce Type: cross Abstract: The recent advancements introduced by Large Language Models (LLMs) have transformed how Artificial Intelligenc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations
arXiv:2603.22322v1 Announce Type: cross Abstract: Machine learning systems deployed in medical devices require governance frameworks that ensure safety while en
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life
arXiv:2603.22323v1 Announce Type: cross Abstract: Accurately predicting the state-of-health (SOH) and remaining useful life (RUL) of lithium-ion batteries is cr
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression
arXiv:2603.22324v1 Announce Type: cross Abstract: We introduce Delta-Aware Quantization (DAQ), a data-free post-training quantization framework that preserves t
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI
arXiv:2603.22327v1 Announce Type: cross Abstract: Systematic literature reviews are essential for synthesizing scientific evidence but are costly, difficult to
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Trained Persistent Memory for Frozen Decoder-Only LLMs
arXiv:2603.22329v1 Announce Type: cross Abstract: Decoder-only language models are stateless: hidden representations are discarded after every forward pass and
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Large Language Models for Missing Data Imputation: Understanding Behavior, Hallucination Effects, and Control Mechanisms
arXiv:2603.22332v1 Announce Type: cross Abstract: Data imputation is a cornerstone technique for handling missing values in real-world datasets, which are often
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Graph Signal Processing Meets Mamba2: Adaptive Filter Bank via Delta Modulation
arXiv:2603.22333v1 Announce Type: cross Abstract: State-space models (SSMs) offer efficient alternatives to attention with linear-time recurrence. Mamba2, a rec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation
arXiv:2603.22335v1 Announce Type: cross Abstract: Direct Preference Optimization (DPO) guides large language models (LLMs) to generate recommendations aligned w
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Graphs RAG at Scale: Beyond Retrieval-Augmented Generation With Labeled Property Graphs and Resource Description Framework for Complex and Unknown Search Spaces
arXiv:2603.22340v1 Announce Type: cross Abstract: Recent advances in Retrieval-Augmented Generation (RAG) have revolutionized knowledge-intensive tasks, yet tra
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
arXiv:2603.22341v1 Announce Type: cross Abstract: While prior red-teaming efforts have focused on eliciting harmful text outputs from large language models (LLM
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement
arXiv:2603.22352v1 Announce Type: cross Abstract: Recent progress in reinforcement learning with verifiable rewards (RLVR) offers a practical path to self-impro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale
arXiv:2603.22363v1 Announce Type: cross Abstract: Designing algorithms with provable guarantees that also work well in practice remains difficult, requiring bot
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives
arXiv:2603.22364v1 Announce Type: cross Abstract: Diffusion models have achieved state-of-the-art performance in generative modeling, but their success often re
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Q-AGNN: Quantum-Enhanced Attentive Graph Neural Network for Intrusion Detection
arXiv:2603.22365v1 Announce Type: cross Abstract: With the rapid growth of interconnected devices, accurately detecting malicious activities in network traffic
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Modeling Quantum Federated Autoencoder for Anomaly Detection in IoT Networks
arXiv:2603.22366v1 Announce Type: cross Abstract: We propose a Quantum Federated Autoencoder for Anomaly Detection, a framework that leverages quantum federated
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window
arXiv:2603.22367v1 Announce Type: cross Abstract: Large Language Models (LLMs) deployed as autonomous agents commonly use Retrieval-Augmented Generation (RAG),
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
FAAR: Format-Aware Adaptive Rounding for NVFP4
arXiv:2603.22370v1 Announce Type: cross Abstract: Deploying large language models (LLMs) on edge devices requires extremely low-bit quantization. Ultra-low prec
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Three Creates All: You Only Sample 3 Steps
arXiv:2603.22375v1 Announce Type: cross Abstract: Diffusion models deliver high-fidelity generation but remain slow at inference time due to many sequential net
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access
arXiv:2603.22376v1 Announce Type: cross Abstract: Recent advances in AI agents for software engineering and scientific discovery have demonstrated remarkable ca
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Instruction-Tuned, but Not More Verifiable Instruction-Following: A Cross-Task Diagnosis for LoRA Adapters
arXiv:2603.22379v1 Announce Type: cross Abstract: Adapters are often selected and deployed based on nominal labels (e.g., instruction-tuned), which implicitly s
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure
arXiv:2603.22384v1 Announce Type: cross Abstract: Autonomous agents operating in continuous environments must decide not only what to do, but when to act. We in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Latent Style-based Quantum Wasserstein GAN for Drug Design
arXiv:2603.22399v1 Announce Type: cross Abstract: The development of new drugs is a tedious, time-consuming, and expensive process, for which the average costs
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
arXiv:2603.22435v1 Announce Type: cross Abstract: "Code-as-Policy" considers how executable code can complement data-intensive Vision-Language-Action (VLA) meth
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs
arXiv:2603.22446v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly improved reasoning in large language m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
LLM-guided headline rewriting for clickability enhancement without clickbait
arXiv:2603.22459v1 Announce Type: cross Abstract: Enhancing reader engagement while preserving informational fidelity is a central challenge in controllable tex
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Stability-Preserving Online Adaptation of Neural Closed-loop Maps
arXiv:2603.22469v1 Announce Type: cross Abstract: The growing complexity of modern control tasks calls for controllers that can react online as objectives and d
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
arXiv:2603.22473v1 Announce Type: cross Abstract: Hybrid language models combining attention with state space models (SSMs) or linear attention offer improved e
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games
arXiv:2603.22479v1 Announce Type: cross Abstract: Defining a constructive process to build general capabilities for language models in an automatic manner is co
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Tiny Inference-Time Scaling with Latent Verifiers
arXiv:2603.22492v1 Announce Type: cross Abstract: Inference-time scaling has emerged as an effective way to improve generative models at test time by using a ve
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals
arXiv:2603.22510v1 Announce Type: cross Abstract: Large language models such as ChatGPT have increased scholarly output, but whether this productivity boost pro
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface
arXiv:2603.22519v1 Announce Type: cross Abstract: Textual Large Language Models (LLMs) provide a simple and familiar interface: a string of text is used for bot
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs
arXiv:2603.22528v1 Announce Type: cross Abstract: Large Language Models (LLMs) combined with Retrieval-Augmented Generation (RAG) and knowledge graphs offer new
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
arXiv:2603.22529v1 Announce Type: cross Abstract: Multimodal AI agents are increasingly automating complex real-world workflows that involve online web executio
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
arXiv:2603.22577v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated potential in code generation, yet they struggle with the multi-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?
arXiv:2603.22582v1 Announce Type: cross Abstract: Chain-of-thought (CoT) reasoning has been proposed as a transparency mechanism for large language models in sa
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
flexvec: SQL Vector Retrieval with Programmatic Embedding Modulation
arXiv:2603.22587v1 Announce Type: cross Abstract: As AI agents become the primary consumers of retrieval APIs, there is an opportunity to expose more of the ret
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Language Models Can Explain Visual Features via Steering
arXiv:2603.22593v1 Announce Type: cross Abstract: Sparse Autoencoders uncover thousands of features in vision models, yet explaining these features without requ
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Do Consumers Accept AIs as Moral Compliance Agents?
arXiv:2603.22617v1 Announce Type: cross Abstract: Consumers are generally resistant to Artificial Intelligence (AI) involvement in moral decision-making, percei
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models
arXiv:2603.22623v1 Announce Type: cross Abstract: Vision-language models (VLMs) adapted to the medical domain have shown strong performance on visual question a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals
arXiv:2603.22634v1 Announce Type: cross Abstract: Productive human-AI collaboration requires appropriate reliance, yet contemporary AI systems are often miscali
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research
arXiv:2603.22648v1 Announce Type: cross Abstract: There are different goals for literature research, from understanding an unfamiliar topic to generate hypothes
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset
arXiv:2603.22714v1 Announce Type: cross Abstract: We present PopResume, a population-representative resume dataset for causal fairness auditing of LLM- and VLM-
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
KALAVAI: Predicting When Independent Specialist Fusion Works -- A Quantitative Model for Post-Hoc Cooperative LLM Training
arXiv:2603.22755v1 Announce Type: cross Abstract: Independently trained domain specialists can be fused post-hoc into a single model that outperforms any indivi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona
arXiv:2603.22765v1 Announce Type: cross Abstract: Data scarcity remains a persistent challenge in low-resource domains. While existing data augmentation methods
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Overload to Convergence: Supporting Multi-Issue Human-AI Negotiation with Bayesian Visualization
arXiv:2603.22766v1 Announce Type: cross Abstract: As AI systems increasingly mediate negotiations, understanding how the number of negotiated issues impacts hum
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
1mo ago
From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips
arXiv:2603.22770v1 Announce Type: cross Abstract: The deployment of deep neural networks (DNNs) in safety-critical edge environments necessitates robustness aga
DeepCamp AI