Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

25,266

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,496 Reads 5,770

Showing 5,770 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Powering virtual education for the classroom

Khan Academy explores the potential for GPT-4 in a limited pilot program.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

HNSW+PQ - Exploring ANN algorithms Part 2.1

Implementing HNSW + Product Quantization (PQ) vector compression in Weaviate.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Planning for AGI and beyond

Our mission is to ensure that artificial general intelligence—AI systems that are generally smarter than humans—benefits all of humanity.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Combining LangChain and Weaviate

LangChain is one of the most exciting new tools in AI. It helps overcome many limitations of LLMs, such as hallucination and limited input lengths.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

How should AI systems behave, and who should decide?

We’re clarifying how ChatGPT’s behavior is shaped and our plans for improving that behavior, allowing more user customization, and getting more public input int

Introducing LoRA: A faster way to fine-tune Stable Diffusion

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Introducing LoRA: A faster way to fine-tune Stable Diffusion

It's like DreamBooth, but much faster. And you can run it in the cloud on Replicate.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Introducing ChatGPT Plus

We’re launching a pilot subscription plan for ChatGPT, a conversational AI that can chat with you, answer follow-up questions, and challenge incorrect assumptio

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

New AI classifier for indicating AI-written text

We’re launching a classifier trained to distinguish between AI-written and human-written text.

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The Transformer Family Version 2.0

Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

OpenAI and Microsoft extend partnership

We’re happy to announce that OpenAI and Microsoft are extending our partnership.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

3D Asset Generation: AI for Game Development #3

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Welcome PaddlePaddle to the Hugging Face Hub

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk

OpenAI researchers collaborated with Georgetown University’s Center for Security and Emerging Technology and the Stanford Internet Observatory to investigate ho

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Large Transformer Model Inference Optimization

[Updated on 2023-01-24: add a small section on Distillation .] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. T

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Delivering nuanced insights from customer feedback

Using GPT-3 to deliver fast, nuanced insights from customer feedback.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Fine-tuning GPT-3 to scale video creation

Fine-tuning GPT-3 to power and scale done-for-you video creation.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Creating next-gen characters

Using GPT-3 to create the next generation of AI-powered characters.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

The power of continuous learning

Lilian Weng works on Applied AI Research at OpenAI.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

New and improved embedding model

We are excited to announce a new embedding model which is significantly more capable, cost effective, and simpler to use.

Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

A conversation with Kevin Scott: What’s next in AI

The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog .

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

The Sphere Dataset in Weaviate

Learn how to import and query the Sphere dataset in Weaviate!

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Deep Learning with Proteins

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Using Stable Diffusion with Core ML on Apple Silicon

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Introducing ChatGPT

We’ve trained a model called ChatGPT which interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, ad

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

An overview of inference solutions on Hugging Face

Train and deploy a DreamBooth model on Replicate

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Train and deploy a DreamBooth model on Replicate

With just a handful of images and a single API call, you can train a model, publish it to Replicate, and run predictions on it in the cloud.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Hugging Face Machine Learning Demos on arXiv

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Sentiment Analysis on Encrypted Data with Homomorphic Encryption

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

DALL·E API now available in public beta

Starting today, developers can begin building apps with the DALL·E API.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Accelerate your models with 🤗 Optimum Intel and OpenVINO

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Weaviate 1.16 release

Weaviate 1.16 introduces New Filter Operators, Distributed Backups, Centroid Module, Node Status API, Azure-based OIDC, and more. Lear all about it.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

🧨 Stable Diffusion in JAX / Flax !

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Optimization story: Bloom inference

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

DALL·E now available without waitlist

New users can start creating straight away. Lessons learned from deployment and improvements to our safety systems make wider availability possible.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How 🤗 Accelerate runs very large models thanks to PyTorch

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Support for Hugging Face Inference API in Weaviate

Running ML Model Inference in production is hard. You can use Weaviate – a vector database – with Hugging Face Inference module to delegate the heavy lifting.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

SetFit: Efficient Few-Shot Learning Without Prompts

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Some Math behind Neural Tangent Kernel

Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Train your first Decision Transformer

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

How to train a Language Model with Megatron-LM

Run Stable Diffusion on your M1 Mac’s GPU

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Run Stable Diffusion on your M1 Mac’s GPU

How to run Stable Diffusion locally so you can hack on it

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Research Insights – Learning to Retrieve Passages without Supervision

Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about i

Run Stable Diffusion with an API

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Run Stable Diffusion with an API

How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects

Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io

Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io

A tutorial for building a chat bot that replies to prompts with the output of a text-to-image model.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago

Our approach to alignment research

We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI syst

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Visualize proteins on Hugging Face Spaces

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago

Stable Diffusion with 🧨 Diffusers