Bringing serverless GPU inference to Hugging Face users
📰 Hugging Face Blog
Hugging Face integrates serverless GPU inference with Cloudflare Workers AI for easy model deployment
Action Steps
- Explore the Hugging Face Hub for available models
- Deploy models using Cloudflare Workers AI for serverless GPU inference
- Monitor and optimize model performance using Cloudflare's edge data centers
Who Needs to Know This
AI engineers and data scientists can benefit from this integration to easily deploy models as serverless APIs, while product managers can leverage this to improve model scalability and performance
Key Insight
💡 Serverless GPU inference enables scalable and performant model deployment without managing infrastructure
Share This
🚀 Hugging Face + Cloudflare Workers AI: Easy serverless GPU inference for open models!
Key Takeaways
Hugging Face integrates serverless GPU inference with Cloudflare Workers AI for easy model deployment
Full Article
Published Time: 2024-04-02T00:00:00.362Z
# Bringing serverless GPU inference to Hugging Face users
[Hugging Face](https://huggingface.co/)
* [Models](https://huggingface.co/models)
* [Datasets](https://huggingface.co/datasets)
* [Spaces](https://huggingface.co/spaces)
* [Buckets new](https://huggingface.co/storage)
* [Docs](https://huggingface.co/docs)
* [Enterprise](https://huggingface.co/enterprise)
* [Pricing](https://huggingface.co/pricing)
*
*
* * *
* [Log In](https://huggingface.co/login)
* [Sign Up](https://huggingface.co/join)
[Back to Articles](https://huggingface.co/blog)
# [](https://huggingface.co/blog/cloudflare-workers-ai#bringing-serverless-gpu-inference-to-hugging-face-users) Bringing serverless GPU inference to Hugging Face users
Published April 2, 2024
[Update on GitHub](https://github.com/huggingface/blog/blob/main/cloudflare-workers-ai.md)
[- [x] Upvote 11](https://huggingface.co/login?next=%2Fblog%2Fcloudflare-workers-ai)
* [](https://huggingface.co/julien-c "julien-c")
* [](https://huggingface.co/eloukas "eloukas")
* [](https://huggingface.co/adamm-hf "adamm-hf")
* [](https://huggingface.co/chenglu "chenglu")
* [](https://huggingface.co/derek-thomas "derek-thomas")
* [](https://huggingface.co/chenxinfeng "chenxinfeng")
* +5
[](https://huggingface.co/philschmid)
[Philipp Schmid philschmid Follow](https://huggingface.co/philschmid)
[](https://huggingface.co/jeffboudier)
[Jeff Boudier jeffboudier Follow](https://huggingface.co/jeffboudier)
[](https://huggingface.co/rita3ko)
[Rita Kozlov rita3ko Follow](https://huggingface.co/rita3ko)
guest
[](https://huggingface.co/nkothariCF)
[Nikhil Kothari nkothariCF Follow](https://huggingface.co/nkothariCF)
guest
Update (November 2024): The integration is no longer available. Please switch to the Hugging Face Inference API, Inference Endpoints, or other deployment options for your AI model needs.
* [Generative AI for Developers](https://huggingface.co/blog/cloudflare-workers-ai#generative-ai-for-developers "Generative AI for Developers")
* [How it works](https://huggingface.co/blog/cloudflare-workers-ai#how-it-works "How it works")
* [We’re just getting started](https://huggingface.co/blog/cloudflare-workers-ai#were-just-getting-started "We’re just getting started")
Today, we are thrilled to announce the launch of **Deploy on Cloudflare Workers AI**, a new integration on the Hugging Face Hub. Deploy on Cloudflare Workers AI makes using open models as a serverless API easy, powered by state-of-the-art GPUs deployed in Cloudflare edge data centers. Starting today, we are integrating some of the most popular open models on Hugging Face into Cloudflare Workers
# Bringing serverless GPU inference to Hugging Face users
[Hugging Face](https://huggingface.co/)
* [Models](https://huggingface.co/models)
* [Datasets](https://huggingface.co/datasets)
* [Spaces](https://huggingface.co/spaces)
* [Buckets new](https://huggingface.co/storage)
* [Docs](https://huggingface.co/docs)
* [Enterprise](https://huggingface.co/enterprise)
* [Pricing](https://huggingface.co/pricing)
*
*
* * *
* [Log In](https://huggingface.co/login)
* [Sign Up](https://huggingface.co/join)
[Back to Articles](https://huggingface.co/blog)
# [](https://huggingface.co/blog/cloudflare-workers-ai#bringing-serverless-gpu-inference-to-hugging-face-users) Bringing serverless GPU inference to Hugging Face users
Published April 2, 2024
[Update on GitHub](https://github.com/huggingface/blog/blob/main/cloudflare-workers-ai.md)
[- [x] Upvote 11](https://huggingface.co/login?next=%2Fblog%2Fcloudflare-workers-ai)
* [](https://huggingface.co/julien-c "julien-c")
* [](https://huggingface.co/eloukas "eloukas")
* [](https://huggingface.co/adamm-hf "adamm-hf")
* [](https://huggingface.co/chenglu "chenglu")
* [](https://huggingface.co/derek-thomas "derek-thomas")
* [](https://huggingface.co/chenxinfeng "chenxinfeng")
* +5
[](https://huggingface.co/philschmid)
[Philipp Schmid philschmid Follow](https://huggingface.co/philschmid)
[](https://huggingface.co/jeffboudier)
[Jeff Boudier jeffboudier Follow](https://huggingface.co/jeffboudier)
[](https://huggingface.co/rita3ko)
[Rita Kozlov rita3ko Follow](https://huggingface.co/rita3ko)
guest
[](https://huggingface.co/nkothariCF)
[Nikhil Kothari nkothariCF Follow](https://huggingface.co/nkothariCF)
guest
Update (November 2024): The integration is no longer available. Please switch to the Hugging Face Inference API, Inference Endpoints, or other deployment options for your AI model needs.
* [Generative AI for Developers](https://huggingface.co/blog/cloudflare-workers-ai#generative-ai-for-developers "Generative AI for Developers")
* [How it works](https://huggingface.co/blog/cloudflare-workers-ai#how-it-works "How it works")
* [We’re just getting started](https://huggingface.co/blog/cloudflare-workers-ai#were-just-getting-started "We’re just getting started")
Today, we are thrilled to announce the launch of **Deploy on Cloudflare Workers AI**, a new integration on the Hugging Face Hub. Deploy on Cloudflare Workers AI makes using open models as a serverless API easy, powered by state-of-the-art GPUs deployed in Cloudflare edge data centers. Starting today, we are integrating some of the most popular open models on Hugging Face into Cloudflare Workers
DeepCamp AI