Makes Workers AI and Hugging Face integration generally available; deploying serverless AI is now easier and more affordable than ever
Cloudflare, Inc. (NYSE: NET), the leading connectivity cloud company, today announced that developers can now deploy AI applications on Cloudflare’s global network in one simple click directly from Hugging Face, the leading open and collaborative platform for AI builders. With Workers AI now generally available, Cloudflare is the first serverless inference partner integrated on the Hugging Face Hub for deploying models, enabling developers to quickly, easily, and affordably deploy AI globally, without managing infrastructure or paying for unused compute capacity.
Despite significant strides in AI innovation, there is still a disconnect between its potential and the value it brings businesses. Organizations and their developers need to be able to experiment and iterate quickly and affordably, without having to set up, manage, or maintain GPUs or infrastructure. Businesses are in need of a straightforward platform that unlocks speed, security, performance, observability, and compliance to bring innovative, production-ready applications to their customers faster.
“The recent generative AI boom has companies across industries investing massive amounts of time and money into AI. Some of it will work, but the real challenge of AI is that the demo is easy, but putting it into production is incredibly hard,” said Matthew Prince, CEO and co-founder, Cloudflare. “We can solve this by abstracting away the cost and complexity of building AI-powered apps. Workers AI is one of the most affordable and accessible solutions to run inference. And with Hugging Face and Cloudflare both deeply aligned in our efforts to democratize AI in a simple, affordable way, we’re giving developers the freedom and agility to choose a model and scale their AI apps from zero to global in an instant.”
Workers AI is generally available with GPUs now deployed in more than 150 cities globally
Today, Workers AI is generally available, providing the end-to-end infrastructure needed to scale and deploy AI models efficiently and affordably for the next era of AI applications. Cloudflare now has GPUs deployed across more than 150 cities globally, most recently launching in Cape Town, Durban, Johannesburg, and Lagos for the first locations in Africa, as well as Amman, Buenos Aires, Mexico City, Mumbai, New Delhi, and Seoul, to provide low-latency inference around the world. Workers AI is also expanding to support fine-tuned model weights, enabling organizations to build and deploy more specialized, domain-specific applications.
In addition to Workers AI, Cloudflare’s AI Gateway offers a control plane for your AI applications, allowing developers to dynamically evaluate and route requests to different models and providers, eventually enabling developers to use data to create fine tunes and run the fine-tuned jobs directly on the Workers AI platform.
Cloudflare powers one-click deployment with Hugging Face
With Workers AI generally available, developers can now deploy AI models in one click directly from Hugging Face, for the fastest way to access a variety of models and run inference requests on Cloudflare’s global network of GPUs. Developers can choose one of the popular open source models and then simply click “Deploy to Cloudflare Workers AI” to deploy a model instantly. There are 14 curated Hugging Face models now optimized for Cloudflare’s global serverless inference platform, supporting three different task categories including text generation, embeddings, and sentence similarity.
“We are excited to work with Cloudflare to make AI more accessible to developers,” said Julien Chaumond, co-founder and chief technology officer, Hugging Face. “Offering the most popular open models with a serverless API, powered by a global fleet of GPUs, is an amazing proposition for the Hugging Face community, and I can’t wait to see what they build with it.”
AI-first companies are building with Workers AI
Companies around the world trust Workers AI and Cloudflare’s global network to power their AI applications, including:
- “Talkmap helps customers uncover and surface real-time conversational intelligence and insights. With millions of customer conversations daily and the need for a fast turnaround for CX & EX outcomes, Cloudflare’s developer platform has helped us keep storage costs and latency low. We’ve selected Cloudflare to help us scale our generative AI service and simplify our runtime architecture so that we can stay focused on adding customer value for conversation insights in the contact center.” -- Jonathan Eisenzopf, Founder and Chief Strategy & Research Officer, Talkmap
- “ChainFuse transforms unstructured data chaos into actionable insights, ensuring every piece of customer feedback, issue, and opportunity is heard and valued. Using products such as Workers AI, AI Gateway, and Vectorize, we have successfully analyzed and categorized over 50,000 unique conversations from places like Discord, Discourse, Twitter, G2, and more. Having access to 28 AI models for any task—and swapping them on the fly—allows us to be accurate and efficient at scale.” – George Portillo, co-founder, ChainFuse.com.
- “Discourse.org is a modern, open-source discussion platform powering over 20,000 online communities from small hobby groups to forums for some of the largest companies worldwide. Discourse leverages Cloudflare’s Workers AI to run embedding models to power our popular ‘Related Topics’ feature. This produces relevant results within communities, giving community members new opportunities to find and engage with topics they are interested in. Workers AI is currently one of the affordable, open-source ways we can provide Related Topics using a high-performing embeddings model to give our customers an avenue to provide their community members with a new way to discover more relevant content and improve engagement.” – Saif Murtaza, AI Product Manager, Discourse.org
- “Simmer brings the swiping of dating apps to the recipe and cooking world, to bring couples together over a meal they both enjoy. Simmer has continually adopted Cloudflare products as the platform expands, and Workers AI was no exception; we use Workers AI embeddings and large language models, such as Mistral 7B, to help us create a personalized experience for users on the app, including curated recipes based on preferences. We go to Cloudflare first to explore if their products fit our use case since they’re so easy to work with. Using Cloudflare products also helps us save a lot on costs as we grow our startup.” – Ben Ankiel, CTO, Simmer
- “Audioflare uses AI to convert, examine, condense, and translate brief audio files into various languages. We heavily count on Workers AI for streamlining AI-related tasks including audio file processing, sentiment evaluation, language translation, and maintaining AI’s overall efficiency and dependability. We’re impressed with Cloudflare’s ability to simplify the backend operations of our app. We believe in Cloudflare’s consistent improvements and dedication, and feel confident about growing with their platform.” – Sean Oliver, creator of the open-source LLM repository, Audioflare
To learn more, please check out the resources below:
- Blog: Cloudflare’s Inference Platform is Generally Available
- Blog: Running fine-tuned models on Workers AI with LoRAs
- Learn more at ai.cloudflare.com and developers.cloudflare.com/ai
- Two million developers are now building on Cloudflare’s developer platform
- Cloudflare was named to Fast Company’s list of the World’s Most Innovative Companies of 2024 for its innovative approach to democratizing how developers build AI-powered applications
About Cloudflare
Cloudflare, Inc. (NYSE: NET) is the leading connectivity cloud company on a mission to help build a better Internet. It empowers organizations to make their employees, applications and networks faster and more secure everywhere, while reducing complexity and cost. Cloudflare’s connectivity cloud delivers the most full-featured, unified platform of cloud-native products and developer tools, so any organization can gain the control they need to work, develop, and accelerate their business.
Powered by one of the world’s largest and most interconnected networks, Cloudflare blocks billions of threats online for its customers every day. It is trusted by millions of organizations – from the largest brands to entrepreneurs and small businesses to nonprofits, humanitarian groups, and governments across the globe.
Learn more about Cloudflare’s connectivity cloud at cloudflare.com/connectivity-cloud. Learn more about the latest Internet trends and insights at https://radar.cloudflare.com.
Follow us: Blog | X | LinkedIn | Facebook | Instagram
Forward-Looking Statements
This press release contains forward-looking statements within the meaning of Section 27A of the Securities Act of 1933, as amended, and Section 21E of the Securities Exchange Act of 1934, as amended, which statements involve substantial risks and uncertainties. In some cases, you can identify forward-looking statements because they contain words such as “may,” “will,” “should,” “expect,” “explore,” “plan,” “anticipate,” “could,” “intend,” “target,” “project,” “contemplate,” “believe,” “estimate,” “predict,” “potential,” or “continue,” or the negative of these words, or other similar terms or expressions that concern Cloudflare’s expectations, strategy, plans, or intentions. However, not all forward-looking statements contain these identifying words. Forward-looking statements expressed or implied in this press release include, but are not limited to, statements regarding the capabilities and effectiveness of Workers AI and Cloudflare’s other products and technology, the benefits to Cloudflare’s customers from using Workers AI and Cloudflare’s other products and technology, Cloudflare’s partnership with Hugging Face and the potential resulting benefits to Cloudflare customers, the potential benefits to customers of integrating Cloudflare and Hugging Face products, the potential opportunity for Cloudflare to attract additional customers and to expand sales to existing customers through Cloudflare’s partnership and product integrations with Hugging Face, Cloudflare’s technological development, future operations, growth, initiatives, or strategies, and comments made by Cloudflare’s CEO and others. Actual results could differ materially from those stated or implied in forward-looking statements due to a number of factors, including but not limited to, risks detailed in Cloudflare’s filings with the Securities and Exchange Commission (SEC), including Cloudflare’s Annual Report on Form 10-K filed on February 21, 2024, as well as other filings that Cloudflare may make from time to time with the SEC.
The forward-looking statements made in this press release relate only to events as of the date on which the statements are made. Cloudflare undertakes no obligation to update any forward-looking statements made in this press release to reflect events or circumstances after the date of this press release or to reflect new information or the occurrence of unanticipated events, except as required by law. Cloudflare may not actually achieve the plans, intentions, or expectations disclosed in Cloudflare’s forward-looking statements, and you should not place undue reliance on Cloudflare’s forward-looking statements.
© 2024 Cloudflare, Inc. All rights reserved. Cloudflare, the Cloudflare logo, and other Cloudflare marks are trademarks and/or registered trademarks of Cloudflare, Inc. in the U.S. and other jurisdictions. All other marks and names referenced herein may be trademarks of their respective owners.
View source version on businesswire.com: https://www.businesswire.com/news/home/20240402560109/en/
Contacts
Cloudflare, Inc.
Daniella Vallurupalli
Vice President, Head of Global Communications
press@cloudflare.com

