Visual Document Retrieval Goes Multilingual

📰 Hugging Face Blog

Hugging Face introduces multilingual visual document retrieval, enabling search across languages

advanced Published 10 Jan 2025
Action Steps
  1. Explore the Hugging Face blog post on multilingual visual document retrieval
  2. Review the usage and training dataset sections
  3. Investigate the evaluations and cross-lingual retrieval capabilities
Who Needs to Know This

Machine learning engineers and researchers on a team can benefit from this technology to improve document retrieval systems, while product managers can leverage it to enhance user experience

Key Insight

💡 Multilingual visual document retrieval enables search across languages, breaking language barriers

Share This
📄 Multilingual visual document retrieval is here! 🌎

Key Takeaways

Hugging Face introduces multilingual visual document retrieval, enabling search across languages

Full Article

Published Time: 2025-01-10T00:00:00.507Z

# Visual Document Retrieval Goes Multilingual

[![Image 1: Hugging Face's logo](https://huggingface.co/front/assets/huggingface_logo-noborder.svg)Hugging Face](https://huggingface.co/)

* [Models](https://huggingface.co/models)
* [Datasets](https://huggingface.co/datasets)
* [Spaces](https://huggingface.co/spaces)
* [Buckets new](https://huggingface.co/storage)
* [Docs](https://huggingface.co/docs)
* [Enterprise](https://huggingface.co/enterprise)
* [Pricing](https://huggingface.co/pricing)
*
*
* * *

* [Log In](https://huggingface.co/login)
* [Sign Up](https://huggingface.co/join)

[Back to Articles](https://huggingface.co/blog)

# [](https://huggingface.co/blog/vdr-2b-multilingual#visual-document-retrieval-goes-multilingual) Visual Document Retrieval Goes Multilingual

Published January 10, 2025

[Update on GitHub](https://github.com/huggingface/blog/blob/main/vdr-2b-multilingual.md)

[- [x] Upvote 78](https://huggingface.co/login?next=%2Fblog%2Fvdr-2b-multilingual)
* [![Image 2](https://cdn-avatars.huggingface.co/v1/production/uploads/5f0988ad19cb630495b8147a/W9PMu6cURwe_RkwovKjdR.jpeg)](https://huggingface.co/ucalyptus "ucalyptus")
* [![Image 3](https://huggingface.co/avatars/845885bcd14a73f1f4a6559b2e63e241.svg)](https://huggingface.co/zanelim "zanelim")
* [![Image 4](https://cdn-avatars.huggingface.co/v1/production/uploads/5f43448a79c1ba4c353d0d8f/DiSygV3dn7A_OjmGVTrHD.jpeg)](https://huggingface.co/sugatoray "sugatoray")
* [![Image 5](https://cdn-avatars.huggingface.co/v1/production/uploads/1604164897157-noauth.jpeg)](https://huggingface.co/PriyanK7n "PriyanK7n")
* [![Image 6](https://cdn-avatars.huggingface.co/v1/production/uploads/1605114051380-noauth.jpeg)](https://huggingface.co/jeffboudier "jeffboudier")
* [![Image 7](https://cdn-avatars.huggingface.co/v1/production/uploads/1624935381963-5fac1f245eec0323e9470ba4.jpeg)](https://huggingface.co/wolosonovich "wolosonovich")
* +72

[![Image 8: Marco Cimolai's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/608c698a33a86d1a1a84b4da/9TjyQyys2Zn_9J_ABEb8O.jpeg)](https://huggingface.co/marco)

[Marco Cimolai marco Follow](https://huggingface.co/marco)

[![Image 9: LlamaIndex's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/6424f2be988b8b19edba5b73/NHVYrW0BnfWgrxt6Thh9C.png)](https://huggingface.co/llamaindex "LlamaIndex")[llamaindex](https://huggingface.co/llamaindex)

[![Image 10: Logan Markewich's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/6424f2be988b8b19edba5b73/pgJK3TMNZWJRlAXP3vSCu.jpeg)](https://huggingface.co/cheesyFishes)

[Logan Markewich cheesyFishes Follow](https://huggingface.co/cheesyFishes)

[![Image 11: LlamaIndex's avatar](https://cdn-avatars.huggingface.co/v1/production/uploads/6424f2be988b8b19edba5b73/NHVYrW0BnfWgrxt6Thh9C.png)](https://huggingface.co/llamaindex "LlamaIndex")[llamaindex](https://huggingface.co/llamaindex)

> * [Usage](https://huggingface.co/blog/vdr-2b-multilingual#usage "Usage")
>
> * [Training Dataset](https://huggingface.co/blog/vdr-2b-multilingual#training-dataset "Training Dataset")
> * [Data Gathering](https://huggingface.co/blog/vdr-2b-multilingual#data-gathering "Data Gathering")
>
> * [Synthetic Generation](https://huggingface.co/blog/vdr-2b-multilingual#synthetic-generation "Synthetic Generation")
>
> * [Filtering and Hard-Negative Mining](https://huggingface.co/blog/vdr-2b-multilingual#filtering-and-hard-negative-mining "Filtering and Hard-Negative Mining")
>
> * [Download](https://huggingface.co/blog/vdr-2b-multilingual#download "Download")
>
>
> * [Evaluations](https://huggingface.co/blog/vdr-2b-multilingual#evaluations "Evaluations")
> * [Faster Inference](https://huggingface.co/blog/vdr-2b-multilingual#faster-inference "Faster Inference")
>
> * [Cross-Lingual Retrieval](https://huggingface.co/blog/vdr-2b-multilingual#cross-lingual-retrieval
Read full article → ← Back to Reads