Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,759

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,450 Reads 5,309

Showing 5,309 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

Bluesky’s new Attie app uses AI to give you full control over your social feed

The Next Web AI 🧠 Large Language Models ⚡ AI Lesson 3w ago

Bluesky’s new Attie app uses AI to give you full control over your social feed

The standalone app, built on the AT Protocol and powered by Anthropic’s Claude, was unveiled at the ATmosphere conference by Jay Graber, who stepped back from B

The AI Factory: What It Is And Why Every CEO Should Care

Forbes Innovation 🧠 Large Language Models ⚡ AI Lesson 3w ago

The AI Factory: What It Is And Why Every CEO Should Care

AI factories are emerging as the model for building, deploying and improving AI at scale, and they could become a major source of competitive advantage for comp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional Environments

arXiv:2603.25747v1 Announce Type: new Abstract: The rapid evolution of Large Multimodal Models (LMMs) has enabled agents to perform complex digital and physical

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AutoB2G: A Large Language Model-Driven Agentic Framework For Automated Building-Grid Co-Simulation

arXiv:2603.26005v1 Announce Type: new Abstract: The growing availability of building operational data motivates the use of reinforcement learning (RL), which ca

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play Annotation

arXiv:2603.26266v1 Announce Type: new Abstract: Large vision-language models have endowed GUI agents with strong general capabilities for interface understandin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

AIRA_2: Overcoming Bottlenecks in AI Research Agents

arXiv:2603.26499v1 Announce Type: new Abstract: Existing research has identified three structural performance bottlenecks in AI research agents: (1) synchronous

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

CADSmith: Multi-Agent CAD Generation with Programmatic Geometric Validation

arXiv:2603.26512v1 Announce Type: new Abstract: Existing methods for text-to-CAD generation either operate in a single pass with no geometric verification or re

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

arXiv:2603.26535v1 Announce Type: new Abstract: We propose Process-Aware Policy Optimization (PAPO), a method that integrates process-level evaluation into Grou

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

arXiv:2603.25750v1 Announce Type: cross Abstract: As the paradigm of AI shifts from text-based LLMs to Speech Language Models (SLMs), there is a growing demand

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy

arXiv:2603.25764v1 Announce Type: cross Abstract: As LLM-based agents are deployed in production systems, understanding their behavioral consistency (whether th

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

arXiv:2603.25766v1 Announce Type: cross Abstract: The integration of Vision-Language-Action (VLA) models into autonomous driving systems offers a unified framew

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

UCAgent: An End-to-End Agent for Block-Level Functional Verification

arXiv:2603.25768v1 Announce Type: cross Abstract: Functional verification remains a critical bottleneck in modern IC development cycles, accounting for approxim

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

IncreRTL: Traceability-Guided Incremental RTL Generation under Requirement Evolution

arXiv:2603.25769v1 Announce Type: cross Abstract: Large language models (LLMs) have shown promise in generating RTL code from natural-language descriptions, but

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ReCUBE: Evaluating Repository-Level Context Utilization in Code Generation

arXiv:2603.25770v1 Announce Type: cross Abstract: Large Language Models (LLMs) have recently emerged as capable coding assistants that operate over large codeba

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Empowering Epidemic Response: The Role of Reinforcement Learning in Infectious Disease Control

arXiv:2603.25771v1 Announce Type: cross Abstract: Reinforcement learning (RL), owing to its adaptability to various dynamic systems in many real-world scenarios

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Beyond identifiability: Learning causal representations with few environments and finite samples

arXiv:2603.25796v1 Announce Type: cross Abstract: We provide explicit, finite-sample guarantees for learning causal representations from data with a sublinear n

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training

arXiv:2603.25813v1 Announce Type: cross Abstract: We present MAGNET (Model Autonomously Growing Network), a decentralized system for autonomous generation, trai

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

arXiv:2603.25823v1 Announce Type: cross Abstract: Beneath the stunning visual fidelity of modern AIGC models lies a "logical desert", where systems fail tasks t

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

A Compression Perspective on Simplicity Bias

arXiv:2603.25839v1 Announce Type: cross Abstract: Deep neural networks exhibit a simplicity bias, a well-documented tendency to favor simple functions over comp

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding

arXiv:2603.25841v1 Announce Type: cross Abstract: Current multimodal large language models (MLLMs) cannot effectively utilize eye-gaze information for video und

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Why Safety Probes Catch Liars But Miss Fanatics

arXiv:2603.25861v1 Announce Type: cross Abstract: Activation-based probes have emerged as a promising approach for detecting deceptively aligned AI systems by i

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks

arXiv:2603.25864v1 Announce Type: cross Abstract: Graphical User Interface (GUI) agents have the potential to assist users in interacting with complex software

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins

arXiv:2603.25898v1 Announce Type: cross Abstract: LLM-assisted modeling holds the potential to rapidly build executable Digital Twins of complex systems from on

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Good Scores, Bad Data: A Metric for Multimodal Coherence

arXiv:2603.25924v1 Announce Type: cross Abstract: Multimodal AI systems are evaluated by downstream task accuracy, but high accuracy does not mean the underlyin

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation

arXiv:2603.25931v1 Announce Type: cross Abstract: Flow-matching video generators produce temporally coherent, high-fidelity outputs yet routinely violate elemen

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Reinforcing Structured Chain-of-Thought for Video Understanding

arXiv:2603.25942v1 Announce Type: cross Abstract: Multi-modal Large Language Models (MLLMs) show promise in video understanding. However, their reasoning often

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models

arXiv:2603.25960v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly deployed in medical settings, yet their sensitivity to prompt fo

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

arXiv:2603.25981v1 Announce Type: cross Abstract: Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants

arXiv:2603.26008v1 Announce Type: cross Abstract: While powerful in image-conditioned generation, multimodal large language models (MLLMs) can display uneven pe

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

H-Node Attack and Defense in Large Language Models

arXiv:2603.26045v1 Announce Type: cross Abstract: We present H-Node Adversarial Noise Cancellation (H-Node ANC), a mechanistic framework that identifies, exploi

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

MuDD: A Multimodal Deception Detection Dataset and GSR-Guided Progressive Distillation for Non-Contact Deception Detection

arXiv:2603.26064v1 Announce Type: cross Abstract: Non-contact automatic deception detection remains challenging because visual and auditory deception cues often

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

When Identities Collapse: A Stress-Test Benchmark for Multi-Subject Personalization

arXiv:2603.26078v1 Announce Type: cross Abstract: Subject-driven text-to-image diffusion models have achieved remarkable success in preserving single identities

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind

arXiv:2603.26089v1 Announce Type: cross Abstract: The ability to represent oneself and others as agents with knowledge, intentions, and belief states that guide

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

arXiv:2603.26098v1 Announce Type: cross Abstract: While self-supervised learning (SSL) has revolutionized audio representation, the excessive parameterization a

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

"Oops! ChatGPT is Temporarily Unavailable!": A Diary Study on Knowledge Workers' Experiences of LLM Withdrawal

arXiv:2603.26099v1 Announce Type: cross Abstract: LLMs have become deeply embedded in knowledge work, raising concerns about growing dependency and the potentia

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis

arXiv:2603.26122v1 Announce Type: cross Abstract: While recent advancements in Large Language Models have significantly advanced dermatological diagnosis, monol

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Finding Distributed Object-Centric Properties in Self-Supervised Transformers

arXiv:2603.26127v1 Announce Type: cross Abstract: Self-supervised Vision Transformers (ViTs) like DINO show an emergent ability to discover objects, typically o

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

SWE-PRBench: Benchmarking AI Code Review Quality Against Pull Request Feedback

arXiv:2603.26130v1 Announce Type: cross Abstract: We introduce SWE-PRBench, a benchmark of 350 pull requests with human-annotated ground truth for evaluating AI

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Sparse Auto-Encoders and Holism about Large Language Models

arXiv:2603.26207v1 Announce Type: cross Abstract: Does Large Language Model (LLM) technology suggest a meta-semantic picture i.e. a picture of how words and com

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Towards GUI Agents: Vision-Language Diffusion Models for GUI Grounding

arXiv:2603.26211v1 Announce Type: cross Abstract: Autoregressive (AR) vision-language models (VLMs) have long dominated multimodal understanding, reasoning, and

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Clawed and Dangerous: Can We Trust Open Agentic Systems?

arXiv:2603.26221v1 Announce Type: cross Abstract: Open agentic systems combine LLM-based planning with external capabilities, persistent memory, and privileged

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Automating Domain-Driven Design: Experience with a Prompting Framework

arXiv:2603.26244v1 Announce Type: cross Abstract: Domain-driven design (DDD) is a powerful design technique for architecting complex software systems. This pape

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Physics-Informed Neural Networks and Sequence Encoder: Application to heating and early cooling of thermo-stamping process

arXiv:2603.26245v1 Announce Type: cross Abstract: In a previous work (Elaarabi et al., 2025b), the Sequence Encoder for online dynamical system identification (

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction

arXiv:2603.26258v1 Announce Type: cross Abstract: We present ARTA, a mixed-resolution coarse-to-fine vision transformer for efficient dense feature extraction.

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Working Notes on Late Interaction Dynamics: Analyzing Targeted Behaviors of Late Interaction Models

arXiv:2603.26259v1 Announce Type: cross Abstract: While Late Interaction models exhibit strong retrieval performance, many of their underlying dynamics remain u

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Knowdit: Agentic Smart Contract Vulnerability Detection with Auditing Knowledge Summarization

arXiv:2603.26270v1 Announce Type: cross Abstract: Smart contracts govern billions of dollars in decentralized finance (DeFi), yet automated vulnerability detect

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

PhysVid: Physics Aware Local Conditioning for Generative Video Models

arXiv:2603.26285v1 Announce Type: cross Abstract: Generative video models achieve high visual fidelity but often violate basic physical principles, limiting rel

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 3w ago

Preference-Aligned LoRA Merging: Preserving Subspace Coverage and Addressing Directional Anisotropy

arXiv:2603.26299v1 Announce Type: cross Abstract: Merging multiple Low-Rank Adaptation (LoRA) modules is promising for constructing general-purpose systems, yet