Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

25,266
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,770 reads from curated sources

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Powering virtual education for the classroom
Khan Academy explores the potential for GPT-4 in a limited pilot program.
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
HNSW+PQ - Exploring ANN algorithms Part 2.1
Implementing HNSW + Product Quantization (PQ) vector compression in Weaviate.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Planning for AGI and beyond
Our mission is to ensure that artificial general intelligence—AI systems that are generally smarter than humans—benefits all of humanity.
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Combining LangChain and Weaviate
LangChain is one of the most exciting new tools in AI. It helps overcome many limitations of LLMs, such as hallucination and limited input lengths.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
How should AI systems behave, and who should decide?
We’re clarifying how ChatGPT’s behavior is shaped and our plans for improving that behavior, allowing more user customization, and getting more public input int
Introducing LoRA: A faster way to fine-tune Stable Diffusion
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Introducing LoRA: A faster way to fine-tune Stable Diffusion
It's like DreamBooth, but much faster. And you can run it in the cloud on Replicate.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Introducing ChatGPT Plus
We’re launching a pilot subscription plan for ChatGPT, a conversational AI that can chat with you, answer follow-up questions, and challenge incorrect assumptio
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
New AI classifier for indicating AI-written text
We’re launching a classifier trained to distinguish between AI-written and human-written text.
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
The Transformer Family Version 2.0
Many new Transformer architecture improvements have been proposed since my last post on “The Transformer Family” about three years ago. Here I did a
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
OpenAI and Microsoft extend partnership
We’re happy to announce that OpenAI and Microsoft are extending our partnership.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
3D Asset Generation: AI for Game Development #3
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Welcome PaddlePaddle to the Hugging Face Hub
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk
OpenAI researchers collaborated with Georgetown University’s Center for Security and Emerging Technology and the Stanford Internet Observatory to investigate ho
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Large Transformer Model Inference Optimization
[Updated on 2023-01-24: add a small section on Distillation .] Large transformer models are mainstream nowadays, creating SoTA results for a variety of tasks. T
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Delivering nuanced insights from customer feedback
Using GPT-3 to deliver fast, nuanced insights from customer feedback.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Fine-tuning GPT-3 to scale video creation
Fine-tuning GPT-3 to power and scale done-for-you video creation.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Creating next-gen characters
Using GPT-3 to create the next generation of AI-powered characters.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
The power of continuous learning
Lilian Weng works on Applied AI Research at OpenAI.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
New and improved embedding model
We are excited to announce a new embedding model which is significantly more capable, cost effective, and simpler to use.
Microsoft AI Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
A conversation with Kevin Scott: What’s next in AI
The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog .
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
The Sphere Dataset in Weaviate
Learn how to import and query the Sphere dataset in Weaviate!
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Deep Learning with Proteins
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Using Stable Diffusion with Core ML on Apple Silicon
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Introducing ChatGPT
We’ve trained a model called ChatGPT which interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, ad
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
An overview of inference solutions on Hugging Face
Train and deploy a DreamBooth model on Replicate
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Train and deploy a DreamBooth model on Replicate
With just a handful of images and a single API call, you can train a model, publish it to Replicate, and run predictions on it in the cloud.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Hugging Face Machine Learning Demos on arXiv
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Sentiment Analysis on Encrypted Data with Homomorphic Encryption
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
DALL·E API now available in public beta
Starting today, developers can begin building apps with the DALL·E API.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Accelerate your models with 🤗 Optimum Intel and OpenVINO
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Weaviate 1.16 release
Weaviate 1.16 introduces New Filter Operators, Distributed Backups, Centroid Module, Node Status API, Azure-based OIDC, and more. Lear all about it.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
🧨 Stable Diffusion in JAX / Flax !
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Optimization story: Bloom inference
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
DALL·E now available without waitlist
New users can start creating straight away. Lessons learned from deployment and improvements to our safety systems make wider availability possible.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
How 🤗 Accelerate runs very large models thanks to PyTorch
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Support for Hugging Face Inference API in Weaviate
Running ML Model Inference in production is hard. You can use Weaviate – a vector database – with Hugging Face Inference module to delegate the heavy lifting.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
SetFit: Efficient Few-Shot Learning Without Prompts
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Some Math behind Neural Tangent Kernel
Neural networks are well known to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on tes
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Train your first Decision Transformer
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
How to train a Language Model with Megatron-LM
Run Stable Diffusion on your M1 Mac’s GPU
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Run Stable Diffusion on your M1 Mac’s GPU
How to run Stable Diffusion locally so you can hack on it
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Research Insights – Learning to Retrieve Passages without Supervision
Self-Supervised Retrieval can surpass BM25 and Supervised techniques. This technique also pairs very well alongside BM25 in Hybrid Retrieval. Learn more about i
Run Stable Diffusion with an API
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Run Stable Diffusion with an API
How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects
Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io
Replicate Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io
A tutorial for building a chat bot that replies to prompts with the output of a text-to-image model.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 3y ago
Our approach to alignment research
We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI syst
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Visualize proteins on Hugging Face Spaces
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 3y ago
Stable Diffusion with 🧨 Diffusers