Accelerating Document AI

📰 Hugging Face Blog

Unlock document knowledge with open-source models and custom solutions for Document AI

intermediate Published 21 Nov 2022

Action Steps

Identify the type of document and task at hand
Choose the best open-source model for the task, such as image classification or document question answering
Use the model to build a custom solution for free
Explore use cases and resources for Document AI

Who Needs to Know This

Data scientists and developers on a team can benefit from using open-source models to build custom Document AI solutions, improving workflow accessibility and efficiency

Key Insight

💡 Open-source models can be used to build custom solutions for Document AI, unlocking inaccessible knowledge in documents

Key Takeaways

Unlock document knowledge with open-source models and custom solutions for Document AI

Full Article

Published Time: 2022-11-21T00:00:00.150Z

# Accelerating Document AI

[![Image 1: Hugging Face's logo](https://huggingface.co/front/assets/huggingface_logo-noborder.svg)Hugging Face](https://huggingface.co/)

* [Models](https://huggingface.co/models)
* [Datasets](https://huggingface.co/datasets)
* [Spaces](https://huggingface.co/spaces)
* [Buckets new](https://huggingface.co/storage)
* [Docs](https://huggingface.co/docs)
* [Enterprise](https://huggingface.co/enterprise)
* [Pricing](https://huggingface.co/pricing)
*
*
* * *

* [Log In](https://huggingface.co/login)
* [Sign Up](https://huggingface.co/join)

[Back to Articles](https://huggingface.co/blog)

# [](https://huggingface.co/blog/document-ai#accelerating-document-ai) Accelerating Document AI

Published November 21, 2022

[Update on GitHub](https://github.com/huggingface/blog/blob/main/document-ai.md)

[- [x] Upvote 80](https://huggingface.co/login?next=%2Fblog%2Fdocument-ai)
* [![Image 2](https://huggingface.co/avatars/5a53d7f5866c34834c055e0547a060ea.svg)](https://huggingface.co/mgoksu "mgoksu")
* [![Image 3](https://cdn-avatars.huggingface.co/v1/production/uploads/1636283009799-noauth.jpeg)](https://huggingface.co/ritwikraha "ritwikraha")
* [![Image 4](https://cdn-avatars.huggingface.co/v1/production/uploads/618b40a1cabd3c4c8e448203/Ccgv0dwgHEIb4vwYwHgQz.jpeg)](https://huggingface.co/sadhaklal "sadhaklal")
* [![Image 5](https://cdn-avatars.huggingface.co/v1/production/uploads/1637353084858-noauth.jpeg)](https://huggingface.co/martinolmos "martinolmos")
* [![Image 6](https://huggingface.co/avatars/930213330b6a6bd66825a6dd5d5f9758.svg)](https://huggingface.co/N-avin-N "N-avin-N")
* [![Image 7](https://huggingface.co/avatars/d8e3b879bd9ee0715d2c5115e9452e8b.svg)](https://huggingface.co/adsk2050 "adsk2050")
* +74

[![Image 8: Rajiv Shah's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/1652986473945-60f2e74cadf471cbdf8bb663.jpeg)](https://huggingface.co/rajistics)

[Rajiv Shah rajistics Follow](https://huggingface.co/rajistics)

[![Image 9: Niels Rogge's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/1608042047613-5f1158120c833276f61f1a84.jpeg)](https://huggingface.co/nielsr)

[Niels Rogge nielsr Follow](https://huggingface.co/nielsr)

[![Image 10: Florent Gbelidji's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/620a77b7dbba8fc1fbb8bdb4/ZRW2pH9Iawj700OyLpJl8.png)](https://huggingface.co/florentgbelidji)

[Florent Gbelidji florentgbelidji Follow](https://huggingface.co/florentgbelidji)

[![Image 11: Nicholas Broad's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/1639773384591-5f353bb37e58354338621655.jpeg)](https://huggingface.co/nbroad)

[Nicholas Broad nbroad Follow](https://huggingface.co/nbroad)

* [Use Cases](https://huggingface.co/blog/document-ai#use-cases "Use Cases")

* [Next Steps](https://huggingface.co/blog/document-ai#next-steps "Next Steps")

* [Resources](https://huggingface.co/blog/document-ai#resources "Resources")

Enterprises are full of documents containing knowledge that isn't accessible by digital workflows. These documents can vary from letters, invoices, forms, reports, to receipts. With the improvements in text, vision, and multimodal AI, it's now possible to unlock that information. This post shows you how your teams can use open-source models to build custom solutions for free!
Document AI includes many data science tasks from [image classification](https://huggingface.co/tasks/image-classification), [image to text](https://huggingface.co/tasks/image-to-text), [document question answering](https://huggingface.co/tasks/document-question-answering), [table question answering](https://huggingface.co/tasks/table-question-answering), and [visual question answering](https://huggingface.co/tasks/visual-question-answering). This post starts with a taxonomy of use cases within Document AI and the best open-source models for those use cases.

Read full article → ← Back to Reads