Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

25,266
lessons
Skills in this topic
View full skill map →
LLM Foundations
beginner
Explain how transformers generate text
Prompt Craft
beginner
Write zero-shot and few-shot prompts
LLM Engineering
intermediate
Call LLM APIs with function/tool use
Fine-tuning LLMs
advanced
Prepare fine-tuning datasets
Multimodal LLMs
advanced
Use GPT-4V / Claude Vision for image understanding

Showing 5,770 reads from curated sources

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Supercharged Searching on the 🤗 Hub
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Gradio is joining Hugging Face!
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
WebGPT: Improving the factual accuracy of language models through web browsing
We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Perceiver IO: a scalable, fully-attentional model that works on any modality
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Customizing GPT-3 for your application
Fine-tune with a single command.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI Residency
As part of our effort to support and develop AI talent, we’re excited to announce the OpenAI Residency.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Accelerating PyTorch distributed fine-tuning with Intel technologies
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI’s API now available with no waitlist
Wider availability made possible by safety progress.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Solving math word problems
We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems a
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
The Age of Machine Learning As Code Has Arrived
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Fine tuning CLIP with Remote Sensing (Satellite) images and captions
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Hosting your Models and Datasets on Hugging Face Spaces using Streamlit
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Showcase Your Projects in Spaces using Gradio
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
How to Train Really Large Models on Many GPUs?
[Updated on 2022-03-13: add expert choice routing .] [U
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Summarizing books with human feedback
Scaling human oversight of AI systems for tasks that are difficult to evaluate.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing Optimum: The Optimization Toolkit for Transformers at Scale
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Helen Toner joins OpenAI’s board of directors
Today, we’re excited to announce the appointment of Helen Toner to our board of directors.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI Codex
We’ve created an improved version of OpenAI Codex, our AI system that translates natural language to code, and we are releasing it through our API in private be
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Introducing Triton: Open-source GPU programming for neural networks
We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago
Distill Hiatus
After five years, Distill will be taking a break.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Improving language model behavior by training on a curated dataset
Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago
Contrastive Representation Learning
The goal of contrastive representation learning is to learn such an embedding space in which similar sample pairs stay close to each other while dissimilar ones
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
OpenAI Scholars 2021: Final projects
We’re proud to announce that the 2021 class of OpenAI Scholars has completed our six-month mentorship program and have produced an open-source research project
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago
Adversarial Reprogramming of Neural Cellular Automata
Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago
Will Hurd joins OpenAI’s board of directors
OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago
Branch Specialization
When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.
Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Weaviate 1.2 release - transformer models
Weaviate v1.2 introduced support for transformers (DistilBERT, BERT, RoBERTa, Sentence-BERT, etc) to vectorize and semantically search through your data.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
GPT-3 powers the next generation of apps
Over 300 applications are delivering GPT-3–powered search, conversation, text completion, and other advanced AI features through our API.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
The Partnership: Amazon SageMaker and Hugging Face
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Reducing Toxicity in Language Models
Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
Multimodal neurons in artificial neural networks
We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP’s accuracy i
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Simple considerations for simple people building fancy neural networks
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Retrieval Augmented Generation with Huggingface Transformers and Ray
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Hugging Face on PyTorch / XLA TPUs
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Faster TensorFlow models in Hugging Face Transformers
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Fit More and Train Faster With ZeRO via DeepSpeed and FairScale
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
Organizational update from OpenAI
It’s been a year of dramatic change and growth at OpenAI.
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago
Understanding RL Vision
With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.
Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Hyperparameter Search with Transformers and Ray Tune
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
OpenAI licenses GPT-3 technology to Microsoft
OpenAI has agreed to license GPT-3 to Microsoft for their own products and services.
Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago
Thread: Differentiable Self-organizing Systems
A collection of articles and comments with the goal of understanding how to design robust and general purpose self-organizing systems.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
OpenAI Scholars 2020: Final projects
Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past five months.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
Procgen and MineRL Competitions
We’re excited to announce that OpenAI is co-organizing two NeurIPS 2020 competitions with AIcrowd, Carnegie Mellon University, and DeepMind, using Procgen Bench
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
OpenAI API
We’re releasing an API for accessing new AI models developed by OpenAI.
Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago
Exploration Strategies in Deep Reinforcement Learning
[Updated on 2020-06-17: Add “exploration via disagreement” in the “Forward Dynamics” section . Exploitation versus ex
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago
AI and efficiency
We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has be