Core AI

Large Language Models

Deep dives into GPT, Claude, Gemini, Llama and the transformers powering modern AI

25,266

lessons

Skills in this topic

5 skills — Sign in to track your progress

View full skill map →

LLM Foundations

Explain how transformers generate text

Write zero-shot and few-shot prompts

LLM Engineering

Call LLM APIs with function/tool use

Fine-tuning LLMs

Prepare fine-tuning datasets

Multimodal LLMs

Use GPT-4V / Claude Vision for image understanding

Videos 19,496 Reads 5,770

Showing 5,770 reads from curated sources

Level: All Beginner Intermediate Advanced

Newest Popular Oldest

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Supercharged Searching on the 🤗 Hub

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Gradio is joining Hugging Face!

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

WebGPT: Improving the factual accuracy of language models through web browsing

We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Perceiver IO: a scalable, fully-attentional model that works on any modality

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Customizing GPT-3 for your application

Fine-tune with a single command.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

OpenAI Residency

As part of our effort to support and develop AI talent, we’re excited to announce the OpenAI Residency.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Accelerating PyTorch distributed fine-tuning with Intel technologies

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

OpenAI’s API now available with no waitlist

Wider availability made possible by safety progress.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Solving math word problems

We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems a

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

The Age of Machine Learning As Code Has Arrived

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Fine tuning CLIP with Remote Sensing (Satellite) images and captions

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Hosting your Models and Datasets on Hugging Face Spaces using Streamlit

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Showcase Your Projects in Spaces using Gradio

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

How to Train Really Large Models on Many GPUs?

[Updated on 2022-03-13: add expert choice routing .] [U

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Summarizing books with human feedback

Scaling human oversight of AI systems for tasks that are difficult to evaluate.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Helen Toner joins OpenAI’s board of directors

Today, we’re excited to announce the appointment of Helen Toner to our board of directors.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

We’ve created an improved version of OpenAI Codex, our AI system that translates natural language to code, and we are releasing it through our API in private be

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Introducing Triton: Open-source GPU programming for neural networks

We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago

After five years, Distill will be taking a break.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Improving language model behavior by training on a curated dataset

Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 4y ago

Contrastive Representation Learning

The goal of contrastive representation learning is to learn such an embedding space in which similar sample pairs stay close to each other while dissimilar ones

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

OpenAI Scholars 2021: Final projects

We’re proud to announce that the 2021 class of OpenAI Scholars has completed our six-month mentorship program and have produced an open-source research project

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 4y ago

Adversarial Reprogramming of Neural Cellular Automata

Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 4y ago

Will Hurd joins OpenAI’s board of directors

OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago

Branch Specialization

When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.

Weaviate Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Weaviate 1.2 release - transformer models

Weaviate v1.2 introduced support for transformers (DistilBERT, BERT, RoBERTa, Sentence-BERT, etc) to vectorize and semantically search through your data.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

GPT-3 powers the next generation of apps

Over 300 applications are delivering GPT-3–powered search, conversation, text completion, and other advanced AI features through our API.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

The Partnership: Amazon SageMaker and Hugging Face

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Reducing Toxicity in Language Models

Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

Multimodal neurons in artificial neural networks

We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP’s accuracy i

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Simple considerations for simple people building fancy neural networks

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Retrieval Augmented Generation with Huggingface Transformers and Ray

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Hugging Face on PyTorch / XLA TPUs

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Faster TensorFlow models in Hugging Face Transformers

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

Organizational update from OpenAI

It’s been a year of dramatic change and growth at OpenAI.

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago

Understanding RL Vision

With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.

Hugging Face Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Hyperparameter Search with Transformers and Ray Tune

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

OpenAI licenses GPT-3 technology to Microsoft

OpenAI has agreed to license GPT-3 to Microsoft for their own products and services.

Distill.pub 🧠 Large Language Models 📄 Paper ⚡ AI Lesson 5y ago

Thread: Differentiable Self-organizing Systems

A collection of articles and comments with the goal of understanding how to design robust and general purpose self-organizing systems.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

OpenAI Scholars 2020: Final projects

Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past five months.

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

Procgen and MineRL Competitions

We’re excited to announce that OpenAI is co-organizing two NeurIPS 2020 competitions with AIcrowd, Carnegie Mellon University, and DeepMind, using Procgen Bench

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

We’re releasing an API for accessing new AI models developed by OpenAI.

Lilian Weng's Blog 🧠 Large Language Models ⚡ AI Lesson 5y ago

Exploration Strategies in Deep Reinforcement Learning

[Updated on 2020-06-17: Add “exploration via disagreement” in the “Forward Dynamics” section . Exploitation versus ex

OpenAI News 🧠 Large Language Models ⚡ AI Lesson 5y ago

AI and efficiency

We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has be