Core AI
Large Language Models
Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI
Skills in this topic
5 skills — Sign in to track your progress
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding
Showing 5,133 reads from curated sources
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning
arXiv:2604.00790v1 Announce Type: new Abstract: While large language models (LLMs) have demonstrated strong performance on complex reasoning tasks such as compe
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models
arXiv:2604.00890v1 Announce Type: new Abstract: Geometric Problem Solving (GPS) remains at the heart of enhancing mathematical reasoning in large language model
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Experience as a Compass: Multi-agent RAG with Evolving Orchestration and Agent Prompts
arXiv:2604.00901v1 Announce Type: new Abstract: Multi-agent Retrieval-Augmented Generation (RAG), wherein each agent takes on a specific role, supports hard que
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor
arXiv:2604.00931v1 Announce Type: new Abstract: Existing methods for AI psychological counselors predominantly rely on supervised fine-tuning using static dialo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
OmniMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
arXiv:2604.01007v1 Announce Type: new Abstract: AI agents increasingly operate over extended time horizons, yet their ability to retain, organize, and recall mu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Adversarial Moral Stress Testing of Large Language Models
arXiv:2604.01108v1 Announce Type: new Abstract: Evaluating the ethical robustness of large language models (LLMs) deployed in software systems remains challengi
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Detecting Multi-Agent Collusion Through Multi-Agent Interpretability
arXiv:2604.01151v1 Announce Type: new Abstract: As LLM agents are increasingly deployed in multi-agent systems, they introduce risks of covert coordination that
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Therefore I am. I Think
arXiv:2604.01202v1 Announce Type: new Abstract: We consider the question: when a large language reasoning model makes a choice, did it think first and then deci
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Two-Stage Optimizer-Aware Online Data Selection for Large Language Models
arXiv:2604.00001v1 Announce Type: cross Abstract: Gradient-based data selection offers a principled framework for estimating sample utility in large language mo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Benchmark for Assessing Olfactory Perception of Large Language Models
arXiv:2604.00002v1 Announce Type: cross Abstract: Here we introduce the Olfactory Perception (OP) benchmark, designed to assess the capability of large language
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
A Reliability Evaluation of Hybrid Deterministic-LLM Based Approaches for Academic Course Registration PDF Information Extraction
arXiv:2604.00003v1 Announce Type: cross Abstract: This study evaluates the reliability of information extraction approaches from KRS documents using three strat
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
LinearARD: Linear-Memory Attention Distillation for RoPE Restoration
arXiv:2604.00004v1 Announce Type: cross Abstract: The extension of context windows in Large Language Models is typically facilitated by scaling positional encod
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Dynin-Omni: Omnimodal Unified Large Diffusion Language Model
arXiv:2604.00007v1 Announce Type: cross Abstract: We present Dynin-Omni, the first masked-diffusion-based omnimodal foundation model that unifies text, image, a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
How Trustworthy Are LLM-as-Judge Ratings for Interpretive Responses? Implications for Qualitative Research Workflows
arXiv:2604.00008v1 Announce Type: cross Abstract: As qualitative researchers show growing interest in using automated tools to support interpretive analysis, a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Eyla: Toward an Identity-Anchored LLM Architecture with Integrated Biological Priors -- Vision, Implementation Attempt, and Lessons from AI-Assisted Development
arXiv:2604.00009v1 Announce Type: cross Abstract: We present the design rationale, implementation attempt, and failure analysis of Eyla, a proposed identity-anc
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Can LLMs Perceive Time? An Empirical Investigation
arXiv:2604.00010v1 Announce Type: cross Abstract: Large language models cannot estimate how long their own tasks take. We investigate this limitation through fo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Quantifying Gender Bias in Large Language Models: When ChatGPT Becomes a Hiring Manager
arXiv:2604.00011v1 Announce Type: cross Abstract: The growing prominence of large language models (LLMs) in daily life has heightened concerns that LLMs exhibit
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms
arXiv:2604.00012v1 Announce Type: cross Abstract: Despite the impressive performance of general-purpose large language models (LLMs), they often require fine-tu
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MSA-Thinker: Discrimination-Calibration Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis
arXiv:2604.00013v1 Announce Type: cross Abstract: Multimodal sentiment analysis aims to understand human emotions by integrating textual, auditory, and visual m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Are they human? Detecting large language models by probing human memory constraints
arXiv:2604.00016v1 Announce Type: cross Abstract: The validity of online behavioral research relies on study participants being human rather than machine. In th
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Think Twice Before You Write -- an Entropy-based Decoding Strategy to Enhance LLM Reasoning
arXiv:2604.00018v1 Announce Type: cross Abstract: Decoding strategies play a central role in shaping the reasoning ability of large language models (LLMs). Trad
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Chronicles of RiDiC: Generating Datasets with Controlled Popularity Distribution for Long-form Factuality Evaluation
arXiv:2604.00019v1 Announce Type: cross Abstract: We present a configurable pipeline for generating multilingual sets of entities with specified characteristics
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
How Do Language Models Process Ethical Instructions? Deliberation, Consistency, and Other-Recognition Across Four Models
arXiv:2604.00021v1 Announce Type: cross Abstract: Alignment safety research assumes that ethical instructions improve model behavior, but how language models in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Criterion Validity of LLM-as-Judge for Business Outcomes in Conversational Commerce
arXiv:2604.00022v1 Announce Type: cross Abstract: Multi-dimensional rubric-based dialogue evaluation is widely used to assess conversational AI, yet its criteri
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women's Health Topics
arXiv:2604.00024v1 Announce Type: cross Abstract: Large language models are increasingly used for medical guidance, but women's health remains under-evaluated i
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Brevity Constraints Reverse Performance Hierarchies in Language Models
arXiv:2604.00025v1 Announce Type: cross Abstract: Standard evaluation protocols reveal a counterintuitive phenomenon: on 7.7% of benchmark problems spanning fiv
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
"Who Am I, and Who Else Is Here?" Behavioral Differentiation Without Role Assignment in Multi-Agent LLM Systems
arXiv:2604.00026v1 Announce Type: cross Abstract: When multiple large language models interact in a shared conversation, do they develop differentiated social r
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
When and Where: A Model Hippocampal Network Unifies Formation of Time Cells and Place Cells
arXiv:2604.00036v1 Announce Type: cross Abstract: Hippocampal place and time cells encode spatial and temporal aspects of experience. Both have the same neural
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Task-Centric Personalized Federated Fine-Tuning of Language Models
arXiv:2604.00050v1 Announce Type: cross Abstract: Federated Learning (FL) has emerged as a promising technique for training language models on distributed and p
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Energy Footprint of LLM-Based Environmental Analysis: LLMs and Domain Products
arXiv:2604.00053v1 Announce Type: cross Abstract: As large language models (LLMs) are increasingly used in domain-specific applications, including climate chang
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
GenoBERT: A Language Model for Accurate Genotype Imputation
arXiv:2604.00058v1 Announce Type: cross Abstract: Genotype imputation enables dense variant coverage for genome-wide association and risk-prediction studies, ye
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Temporal Memory for Resource-Constrained Agents: Continual Learning via Stochastic Compress-Add-Smooth
arXiv:2604.00067v1 Announce Type: cross Abstract: An agent that operates sequentially must incorporate new experience without forgetting old experience, under a
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Brain MR Image Synthesis with Multi-contrast Self-attention GAN
arXiv:2604.00070v1 Announce Type: cross Abstract: Accurate and complete multi-modal Magnetic Resonance Imaging (MRI) is essential for neuro-oncological assessme
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Learning to Play Blackjack: A Curriculum Learning Perspective
arXiv:2604.00076v1 Announce Type: cross Abstract: Reinforcement Learning (RL) agents often struggle with efficiency and performance in complex environments. We
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Hierarchical Pre-Training of Vision Encoders with Large Language Models
arXiv:2604.00086v1 Announce Type: cross Abstract: The field of computer vision has experienced significant advancements through scalable vision encoders and mul
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Oblivion: Self-Adaptive Agentic Memory Control through Decay-Driven Activation
arXiv:2604.00131v1 Announce Type: cross Abstract: Human memory adapts through selective forgetting: experiences become less accessible over time but can be reac
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Explainable AI for Blind and Low-Vision Users: Navigating Trust, Modality, and Interpretability in the Agentic Era
arXiv:2604.00187v1 Announce Type: cross Abstract: Explainable Artificial Intelligence (XAI) is critical for ensuring trust and accountability, yet its developme
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
QUEST: A robust attention formulation using query-modulated spherical attention
arXiv:2604.00199v1 Announce Type: cross Abstract: The Transformer model architecture has become one of the most widely used in deep learning and the attention m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Diversity-Aware Reverse Kullback-Leibler Divergence for Large Language Model Distillation
arXiv:2604.00223v1 Announce Type: cross Abstract: Reverse Kullback-Leibler (RKL) divergence has recently emerged as the preferred objective for large language m
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
MAC-Attention: a Match-Amend-Complete Scheme for Fast and Accurate Attention Computation
arXiv:2604.00235v1 Announce Type: cross Abstract: Long-context decoding in LLMs is IO-bound: each token re-reads an ever-growing KV cache. Prior accelerations c
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
REM-CTX: Automated Peer Review via Reinforcement Learning with Auxiliary Context
arXiv:2604.00248v1 Announce Type: cross Abstract: Most automated peer review systems rely on textual manuscript content alone, leaving visual elements such as f
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Hierarchical Apprenticeship Learning from Imperfect Demonstrations with Evolving Rewards
arXiv:2604.00258v1 Announce Type: cross Abstract: While apprenticeship learning has shown promise for inducing effective pedagogical policies directly from stud
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
LLM Essay Scoring Under Holistic and Analytic Rubrics: Prompt Effects and Bias
arXiv:2604.00259v1 Announce Type: cross Abstract: Despite growing interest in using Large Language Models (LLMs) for educational assessment, it remains unclear
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Hybrid Energy-Based Models for Physical AI: Provably Stable Identification of Port-Hamiltonian Dynamics
arXiv:2604.00277v1 Announce Type: cross Abstract: Energy-based models (EBMs) implement inference as gradient descent on a learned Lyapunov function, yielding in
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
The Geometry of Compromise: Unlocking Generative Capabilities via Controllable Modality Alignment
arXiv:2604.00279v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) such as CLIP learn a shared embedding space for images and text, yet their repre
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Asymmetric Actor-Critic for Multi-turn LLM Agents
arXiv:2604.00304v1 Announce Type: cross Abstract: Large language models (LLMs) exhibit strong reasoning and conversational abilities, but ensuring reliable beha
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Robust Multimodal Safety via Conditional Decoding
arXiv:2604.00310v1 Announce Type: cross Abstract: Multimodal large-language models (MLLMs) often experience degraded safety alignment when harmful queries explo
ArXiv cs.AI
🧠 Large Language Models
📄 Paper
⚡ AI Lesson
3w ago
Prompt-Guided Prefiltering for VLM Image Compression
arXiv:2604.00314v1 Announce Type: cross Abstract: The rapid progress of large Vision-Language Models (VLMs) has enabled a wide range of applications, such as im
DeepCamp AI