Bringing serverless GPU inference to Hugging Face users

📰 Hugging Face Blog

Hugging Face integrates serverless GPU inference with Cloudflare Workers AI for easy model deployment

intermediate Published 2 Apr 2024
Action Steps
  1. Explore the Hugging Face Hub for available models
  2. Deploy models using Cloudflare Workers AI for serverless GPU inference
  3. Monitor and optimize model performance using Cloudflare's edge data centers
Who Needs to Know This

AI engineers and data scientists can benefit from this integration to easily deploy models as serverless APIs, while product managers can leverage this to improve model scalability and performance

Key Insight

💡 Serverless GPU inference enables scalable and performant model deployment without managing infrastructure

Share This
🚀 Hugging Face + Cloudflare Workers AI: Easy serverless GPU inference for open models!

Key Takeaways

Hugging Face integrates serverless GPU inference with Cloudflare Workers AI for easy model deployment

Full Article

Published Time: 2024-04-02T00:00:00.362Z

# Bringing serverless GPU inference to Hugging Face users

[![Image 1: Hugging Face's logo](https://huggingface.co/front/assets/huggingface_logo-noborder.svg)Hugging Face](https://huggingface.co/)

* [Models](https://huggingface.co/models)
* [Datasets](https://huggingface.co/datasets)
* [Spaces](https://huggingface.co/spaces)
* [Buckets new](https://huggingface.co/storage)
* [Docs](https://huggingface.co/docs)
* [Enterprise](https://huggingface.co/enterprise)
* [Pricing](https://huggingface.co/pricing)
*
*
* * *

* [Log In](https://huggingface.co/login)
* [Sign Up](https://huggingface.co/join)

[Back to Articles](https://huggingface.co/blog)

# [](https://huggingface.co/blog/cloudflare-workers-ai#bringing-serverless-gpu-inference-to-hugging-face-users) Bringing serverless GPU inference to Hugging Face users

Published April 2, 2024

[Update on GitHub](https://github.com/huggingface/blog/blob/main/cloudflare-workers-ai.md)

[- [x] Upvote 11](https://huggingface.co/login?next=%2Fblog%2Fcloudflare-workers-ai)
* [![Image 2](https://cdn-avatars.huggingface.co/v1/production/uploads/5dd96eb166059660ed1ee413/NQtzmrDdbG0H8qkZvRyGk.jpeg)](https://huggingface.co/julien-c "julien-c")
* [![Image 3](https://cdn-avatars.huggingface.co/v1/production/uploads/621b497944b048c1df6526e6/2akhrJummj872nRUZEWXj.png)](https://huggingface.co/eloukas "eloukas")
* [![Image 4](https://cdn-avatars.huggingface.co/v1/production/uploads/6340651b388c3fa40f9a5bc0/vM3rB17pUNT11MUhYqfFY.png)](https://huggingface.co/adamm-hf "adamm-hf")
* [![Image 5](https://cdn-avatars.huggingface.co/v1/production/uploads/1675652727502-63765e6b2361581ceb232cc8.jpeg)](https://huggingface.co/chenglu "chenglu")
* [![Image 6](https://cdn-avatars.huggingface.co/v1/production/uploads/638eb5f949de7ae552dd6211/mJkQJGpn9tXV37N2VLFCh.jpeg)](https://huggingface.co/derek-thomas "derek-thomas")
* [![Image 7](https://huggingface.co/avatars/a98ca1de4f607cc3fe0441eeaa0bfe17.svg)](https://huggingface.co/chenxinfeng "chenxinfeng")
* +5

[![Image 8: Philipp Schmid's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/1624629516652-5ff5d596f244529b3ec0fb89.png)](https://huggingface.co/philschmid)

[Philipp Schmid philschmid Follow](https://huggingface.co/philschmid)

[![Image 9: Jeff Boudier's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/1605114051380-noauth.jpeg)](https://huggingface.co/jeffboudier)

[Jeff Boudier jeffboudier Follow](https://huggingface.co/jeffboudier)

[![Image 10: Rita Kozlov's avatar](https://huggingface.co/avatars/a022460e9db28ddf363e65ce3171453b.svg)](https://huggingface.co/rita3ko)

[Rita Kozlov rita3ko Follow](https://huggingface.co/rita3ko)

guest

[![Image 11: Nikhil Kothari's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/65416f1741676ceaa2c14c58/PJMka1bOP20Jk3ZnnG7us.jpeg)](https://huggingface.co/nkothariCF)

[Nikhil Kothari nkothariCF Follow](https://huggingface.co/nkothariCF)

guest

Update (November 2024): The integration is no longer available. Please switch to the Hugging Face Inference API, Inference Endpoints, or other deployment options for your AI model needs.

* [Generative AI for Developers](https://huggingface.co/blog/cloudflare-workers-ai#generative-ai-for-developers "Generative AI for Developers")

* [How it works](https://huggingface.co/blog/cloudflare-workers-ai#how-it-works "How it works")

* [We’re just getting started](https://huggingface.co/blog/cloudflare-workers-ai#were-just-getting-started "We’re just getting started")

Today, we are thrilled to announce the launch of **Deploy on Cloudflare Workers AI**, a new integration on the Hugging Face Hub. Deploy on Cloudflare Workers AI makes using open models as a serverless API easy, powered by state-of-the-art GPUs deployed in Cloudflare edge data centers. Starting today, we are integrating some of the most popular open models on Hugging Face into Cloudflare Workers
Read full article → ← Back to Reads

Related Videos

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
How To Use Google Omni | Real AI Avatar Videos Kaise Banaye | Full Tutorial
How To Use Google Omni | Real AI Avatar Videos Kaise Banaye | Full Tutorial
Digital Marketing Guruji
What exactly is a diffusion language model?
What exactly is a diffusion language model?
Vizuara
AI Named the 2026 FIFA World Cup Winner (Shocking Prediction)
AI Named the 2026 FIFA World Cup Winner (Shocking Prediction)
AI Master
Our vibe coded projects that actually work | The Vergecast
Our vibe coded projects that actually work | The Vergecast
The Verge
5 Insane Claude Cowork Use Cases That Feel Illegal
5 Insane Claude Cowork Use Cases That Feel Illegal
Charlie Chang