Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

24,697
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,255 reads from curated sources

ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
arXiv:2604.01591v2 Announce Type: replace Abstract: We introduce ThinkTwice, a simple two-phase framework that jointly optimizes LLMs to solve reasoning problem
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
arXiv:2407.14971v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) rely heavily on pretrained vision encoders to support downstream tasks s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment
arXiv:2409.19894v5 Announce Type: replace-cross Abstract: Code translation transforms code between programming languages while preserving functionality, which i
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Cobblestone: A Divide-and-Conquer Approach for Automating Formal Verification
arXiv:2410.19940v4 Announce Type: replace-cross Abstract: Formal verification using proof assistants, such as Coq, is an effective way of improving software qua
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
arXiv:2410.20791v3 Announce Type: replace-cross Abstract: The rapid expansion of foundation models (FMs), such as large language models (LLMs), has given rise t
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
arXiv:2411.05961v2 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) are central to Visual Question Answering (VQA) systems and are typically
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Retrieval Augmented Time Series Forecasting
arXiv:2411.08249v2 Announce Type: replace-cross Abstract: Retrieval-augmented generation (RAG) is a central component of modern LLM systems, particularly in sce
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
ENTER: Event Based Interpretable Reasoning for VideoQA
arXiv:2501.14194v2 Announce Type: replace-cross Abstract: In this paper, we present ENTER, an interpretable Video Question Answering (VideoQA) system based on e
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
arXiv:2502.17421v3 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) can now process extremely long contexts, efficient inference over thes
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Hedging and Non-Affirmation: Quantifying LLM Alignment on Questions of Human Rights
arXiv:2502.19463v2 Announce Type: replace-cross Abstract: Hedging and non-affirmation are behaviors exhibited by large language models (LLMs) that limit the cle
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge
arXiv:2504.05995v3 Announce Type: replace-cross Abstract: The rapid progress of large language models (LLMs) raises concerns about cultural bias, fairness, and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Phonetic Perturbations Reveal Tokenizer-Rooted Safety Gaps in LLMs
arXiv:2505.14226v5 Announce Type: replace-cross Abstract: Safety-aligned LLMs remain vulnerable to digital phenomena like textese that introduce non-canonical p
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Synthesis of discrete-continuous quantum circuits with multimodal diffusion models
arXiv:2506.01666v3 Announce Type: replace-cross Abstract: Efficiently compiling quantum operations remains a major bottleneck in scaling quantum computing. Toda
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
HeartcareGPT: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understanding
arXiv:2506.05831v4 Announce Type: replace-cross Abstract: Although electrocardiograms (ECG) play a dominant role in cardiovascular diagnosis and treatment, thei
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
A Survey of Continual Reinforcement Learning
arXiv:2506.21872v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) is an important machine learning paradigm for solving sequential decision-
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Enhancing Hallucination Detection via Future Context
arXiv:2507.20546v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are widely used to generate plausible text on online platforms, without r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization
arXiv:2509.17183v2 Announce Type: replace-cross Abstract: Alignment plays a crucial role in Large Language Models (LLMs) in aligning with human preferences on a
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
A State-Update Prompting Strategy for Efficient and Robust Multi-turn Dialogue
arXiv:2509.17766v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) struggle with information forgetting and inefficiency in long-horizon, mu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Dissecting Transformers: A CLEAR Perspective towards Green AI
arXiv:2510.02810v2 Announce Type: replace-cross Abstract: The rapid adoption of Large Language Models (LLMs) has raised significant environmental concerns. Unli
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Reveal-to-Revise: Explainable Bias-Aware Generative Modeling with Multimodal Attention
arXiv:2510.12957v3 Announce Type: replace-cross Abstract: We present an explainable, bias-aware generative framework that unifies cross-modal attention fusion,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Unlocking the Potential of Diffusion Language Models through Template Infilling
arXiv:2510.13870v2 Announce Type: replace-cross Abstract: Diffusion Language Models (DLMs) have emerged as a promising alternative to Autoregressive Language Mo
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Knowledge Reasoning Language Model: Unifying Knowledge and Language for Inductive Knowledge Graph Reasoning
arXiv:2510.13909v2 Announce Type: replace-cross Abstract: Inductive Knowledge Graph Reasoning (KGR) aims to discover facts in open-domain KGs containing unknown
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
RLAIF-SPA: Structured AI Feedback for Semantic-Prosodic Alignment in Speech Synthesis
arXiv:2510.14628v2 Announce Type: replace-cross Abstract: Recent advances in Text-To-Speech (TTS) synthesis have achieved near-human speech quality in neutral s
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Fairness Evaluation and Inference Level Mitigation in LLMs
arXiv:2510.18914v3 Announce Type: replace-cross Abstract: Large language models often display undesirable behaviors embedded in their internal representations,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Routing-Based Continual Learning for Multimodal Large Language Models
arXiv:2511.01831v3 Announce Type: replace-cross Abstract: Multimodal Large Language Models (MLLMs) struggle with continual learning, often suffering from catast
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Developing and Evaluating a Large Language Model-Based Automated Feedback System Grounded in Evidence-Centered Design for Supporting Physics Problem Solving
arXiv:2512.10785v2 Announce Type: replace-cross Abstract: Generative AI offers new opportunities for individualized and adaptive learning, e.g., through large l
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Automatic Replication of LLM Mistakes in Medical Conversations
arXiv:2512.20983v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly evaluated in clinical settings using multi-dimensional r
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation
arXiv:2512.23994v2 Announce Type: replace-cross Abstract: Text-to-audio-video (T2AV) generation is central to applications such as filmmaking and world modeling
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
arXiv:2601.02978v2 Announce Type: replace-cross Abstract: Recent work in Mechanistic Interpretability (MI) has enabled the identification and intervention of in
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation
arXiv:2601.03054v4 Announce Type: replace-cross Abstract: Recent research on medical MLLMs has gradually shifted its focus from image-level understanding to fin
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency
arXiv:2601.05905v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) are increasingly deployed in real-world settings, correctness alone is
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Frame of Reference: Addressing the Challenges of Common Ground Representation in Situational Dialogs
arXiv:2601.09365v2 Announce Type: replace-cross Abstract: Common ground plays a critical role in situated spoken dialogs, where interlocutors must establish and
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching
arXiv:2601.11652v2 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) become increasingly accessible to end users, an ever-growing number of
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Why Can't I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action Recognition
arXiv:2601.16211v2 Announce Type: replace-cross Abstract: Zero-Shot Compositional Action Recognition (ZS-CAR) requires recognizing novel verb-object combination
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
arXiv:2602.00185v2 Announce Type: replace-cross Abstract: The integration of large language models (LLMs) into materials science offers a transformative opportu
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration
arXiv:2602.00913v3 Announce Type: replace-cross Abstract: Human value detection from single sentences is a sparse, imbalanced multi-label task. We study whether
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
arXiv:2602.12705v4 Announce Type: replace-cross Abstract: We present MedXIAOHE, a medical vision-language foundation model designed to advance general-purpose m
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Weight space Detection of Backdoors in LoRA Adapters
arXiv:2602.15195v3 Announce Type: replace-cross Abstract: LoRA adapters let users fine-tune large language models (LLMs) efficiently. However, LoRA adapters are
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice
arXiv:2603.07339v3 Announce Type: replace-cross Abstract: Deliberative democratic theory suggests that civic competence: the capacity to navigate disagreement,
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Not All Latent Spaces Are Flat: Hyperbolic Concept Control
arXiv:2603.14093v3 Announce Type: replace-cross Abstract: As modern text-to-image (T2I) models draw closer to synthesizing highly realistic content, the threat
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization
arXiv:2603.16105v2 Announce Type: replace-cross Abstract: Post-training model compression is essential for enhancing the portability of Large Language Models (L
ArXiv cs.AI 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 2w ago
Moral Mazes in the Era of LLMs
arXiv:2603.20231v2 Announce Type: replace-cross Abstract: Navigating complex social situations is an integral part of corporate life, ranging from giving critic
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
I Built a Personal Second Brain with Markdown Files and Claude Code — Here's How
The Inspiration I saw Andrej Karpathy's viral post about using LLMs to build personal knowledge bases — no vector database, no chunking pipeline. Just markdown
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
LLMKube Now Deploys Any Inference Engine, Not Just llama.cpp
LLMKube started as a Kubernetes operator for llama.cpp. You define a Model, define an InferenceService, and the controller handles GPU scheduling, health probes
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
I benchmarked GPT-4o, Claude 3.5, and Gemini 1.5 for security — the results
We all know LLMs can be tricked. Prompt injection, jailbreaks, PII leakage — these aren't theoretical anymore. They're happening in production. But here's the t
Dev.to AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
The AI landscape is experiencing unprecedented growth and transformation. This post delves into the key developments shaping the future of artificial intelligen
GLM-4.7-Flash-GGUF Brings Fast Local AI to Consumer Hardware
Hackernoon 🧠 Large Language Models ⚡ AI Lesson 2w ago
GLM-4.7-Flash-GGUF Brings Fast Local AI to Consumer Hardware
GLM-4.7-Flash-GGUF offers fast local text generation with multiple quantization options for PCs, edge devices, and small servers.
TechCrunch AI 🧠 Large Language Models ⚡ AI Lesson 2w ago
I can’t help rooting for tiny open source AI model maker Arcee
Arcee is a tiny 26-person U.S. startup that built a high-performing, massive, open source LLM. And it's gaining popularity with OpenClaw users.