How modern search engines work – Vector databases explained! | Weaviate open-source

AI Coffee Break with Letitia · Beginner ·🔍 RAG & Vector Search ·4y ago

Skills: RAG Basics90%Vector Stores80%RAG Evaluation70%Advanced RAG60%

Key Takeaways

The video explains how modern search engines work, focusing on vector databases and their ability to organize data based on context, with tools like Weaviate and Transformers, and concepts such as high-dimensional vector space and contextual relationships. It also covers the functionality of vector search engines and the role of Transformer models in vectorization.

Full Transcript

hi there i've been active in what you might call the field of artificial intelligence for a while now of course miss coffee bean knew everything about the technology behind ai-based search engines but i honestly didn't have any deeper thoughts about what it meant to actually implement a search engine ever but i was approached by semi technologies that made an open source vector search engine called we v8 a vector what yes exactly in this video we will explain the general mechanism of modern search engines it's especially important to know if i'm ever going to make my private custom search engine for my super secret miss coffee bean data starting from the data we will go through and explain the most important steps transforming the database into a vector database with a machine learning model like fast text or transformers then querying the database thanks to semi technologies for sponsoring this educational video miss coffee bean are you ready to do the explaining bits okay let's go search engines are the algorithms helping you search over large sets of documents or other kinds of data and do it fast these documents can be either from the internet or the documents a company or organization has on their private servers when search engines first appeared they were not the clever ai-powered beasts of today we'll get to those in a minute the first search engines were relatively simple implementations of full text based search this is very similar to how we look something up in the dictionary if you searched for computer the algorithm behind the search engine went and matched strings to find occurrences of the word computer on the internet or text database there were clever tricks for example to generalize from computers to computer word inflections got removed or simple word distances were employed that you also see in spell checkers like the lievenstein metric that measures the minimum number of single character edits needed to change one word into the other but you see there are many reasons to let these search engines evolve into something smarter what if with computer i do not search for a pc or mac or a compute cluster but what was originally meant by the word computer a human person usually a woman that used to do computations before digital computers were invented in this ambiguous case how should the search algorithm know which meaning of the word i intended taking the most frequent meaning would make the search engine insensitive to rarer use cases of words even more think of what happens if i search something that is not even retrievable by simple string matching like when i ask a question who is the one who has the most money how should the string matching algorithm know that most money means the richest or that the one refers to coffee beans or even worse what if we try to do an image search these two images might look the same but if we look closer bit by bit they are different because the same content is encoded differently so if we search for an image in a database or on the internet we might quickly fail if we just match strings we need a search engine that is more than just a dictionary lookup and is aware of content and context so let's bring in the heavy artillery of today's ai powered search engines well ai is a little of a buzzword here calling them for what they really are modern search engines are vector search engines over vector databases vector databases are a way of organizing the documents in the database in meaningful vectors to be able to search through them with queries that are vectorized beforehand too in contrast things in a dictionary are organized alphabetically which makes sense when searching by string matching but vector databases allow us to organize things even better based on the context in which information finds itself so we have a new system for organizing our database or the index internet based on the principle of context words that occur in the same context belong together we have here two examples of how words that are different live in the same context it becomes clear that the computer which i turned off is different from the computer as a job description that miss coffeebean's grandmother had of course one example is often not representative so we'd look at more and more examples like these and we can set up a graph where computer is close to pc and mac and the jobs are close to computer 2 but pc and the jobs should be farther away from each other if we take in all the other words used on all web pages or in our database we can grow a big graph a dictionary was in a sense one-dimensional as things follow one another only alphabetically the first character was our only feature but now we organize our database entries in this graph where things have more than one feature one feature might be describing how much of a job something is the other feature might describe how human something is from zero human to very human with each feature or dimension that we add the more characteristics we can assign for a data point therefore to what we have crammed here into a two-dimensional visualization we can actually give more space where all the features possibly hundreds each occupy a dimension in a vector space to really flesh out the complex contextual relationships hereby each word becomes a vector encoding the exact position of the word in this high dimensional space to explain this on even higher levels imagine you are going to the supermarket in the classical search engine case it's like the supermarket was a long row of products all organized after the first letter in the alphabet but the problem there is that if we go and find the apples next to the apples there are bongos and not other fruit like pears or strawberries modern vector databases are just giving things more space for modeling relationships better in each vector we can store the position of each data point in this vector space encoding complex relationships in between so after we encoded everything in a meaningful vector we now have a vector database and of course we do not have to do this encoding of words into vectors by hand and we also do not have to come up with features or vectors entries ourselves because there are already available machine learning models for this proprietary search engines of course do not let you choose the vectorizer but customizable search engines like vav88 do give you the choice you could take their custom one based on fast text called the contextionary or the transformers module or a resnet for images or you could add your own machine learning models to vectorize with but for this explanation we will resort to miss coffee beans favorite vectorization method transformers that are state-of-the-art at this they are reading in lots of documents like your whole database the whole wikipedia or the whole internet not kidding after they have read in all sentences in the input data they corrupt their input by deleting words and then they try to reconstruct these words the words are all represented by random vectors at first so initially the output looks random too but example after example these transformers start to learn something about language by deleting reconstructing words and updating their vectors to be better many many times until they start to encode patterns in context dependent vectors these patterns reflect which words occur together in these many many sentences and which context words have a kind of self-organization occurs where vectors in this vector space start to cluster together based on how their respective words behaved in the input data such a model has to be trained once and we can now feed any kinds of text into it without the need of retraining and can encode our whole database now with all the words indexed in our database in vector form we can take in our query pass it through the same model to produce us vectors for the query too and now we can compare the vectors if we search for female computer the word for computer in the query is also informed by its context female therefore we will retrieve the database entries that also figure the word computer in the same job context through the same mechanism we can answer questions too the cool thing is that what we have explained here with text is also possible with images videos speech or other kinds of data too especially because these vectorizing models the transformers can handle all these kinds of data to compute content and context aware vectors for more about this watch some of our videos on multimodality well this is what happens on a high level in ai-powered search engines yeah it's cool especially because it does not need human supervision that would be impossible at these large scales but admittedly it's less magical when one really knows what happens right we are not really telling you that the google or bing or any other search engines work exactly this way the algorithms behind them are proprietary and not disclosed but judging by their capabilities and especially their failure modes there is definitely a vector database and the vector search engine behind them the real lessons about these modern vector search engines we can know from open source projects vv8 is such an open source vector search engine you know open where one can inspect the code open the documentation this is important to have not only because you know what's going in there but also in the situation where you are a company or an institution with millions of documents and want to search through them for example what if you're a hospital with millions of clinical data samples or a factory with product data or a shop with lots of customer data post all your data on the internet to be searchable by google search of course that's a no-go it's estimated that more than 99 of data is not publicly available so if you possess a part of that data you would either have to implement a search engine yourself or take and contribute to the open source work that was already done having control about the search engine on your data is important also concerning the vectorizing model we talked about just before that takes in data and transforms it into vectors because that model is good with types of data it has already seen if it was trained on web pages it is good at judging internet forum text normal websites but what if you have technical engineering terms or medical data in your database in this case you need to take this model and find unit on that medical data meaning that you train it further by showing it medical text it takes it in and self trains by following the same procedure as before it deletes words and tries to reconstruct a text based on the corrupted version only this then lets it extract patterns on this new domain of data like medical texts or other domains we v8 even has a simpler solution where for every undefined word you can provide the definition of that word and use this as a context and you do not need to do any further training so to recapitulate what we need for a vector search engine first we need the data then we need the machine learning model like the transformer but doesn't need to which we use to vectorize the data so our database is now a vector database and we are ready to search through it all we need is a query that we should also vectorize with the same machine learning model because we now want to ask which vector in the database is closest to our query but miss coffee bean is not the best coder when it comes to handling tons of data without running out of memory and waiting for ages for the search to complete searching through a lot of data fast is not an easy thing admittedly so here is where she wants to rely on vv8 that has already figured it out how to scale the vector search for millions of vectors in milliseconds okay miss coffee bean you are allowed to check it out i mean why not it's open source after all so now miss coffee bean are you ready to find out the truth we need to ask where can i get the best coffee for my ai coffee break ah around the globe exactly that's the spirit we hope you liked our explanation of modern search engines in general if you're interested in a more technical presentation about vd8 in particular check out the video in the upper right corner also linked in the description below you

Original Description

Modern search engines explained. What you need to know about today’s vector search engines, explained on a high-level with visuals! 👉 Check out Weaviate: Our sponsor’s open-source customizable search engine and vector database. ► Get started with Weaviate: https://www.semi.technology/developers/weaviate/current/ ► Weaviate on GitHub: https://github.com/semi-technologies/weaviate ► 📺 Learn more about Weaviate: https://youtu.be/IExopg1r4fw ⏭️ Ms. Coffee Bean’s Multimodalities: https://youtube.com/playlist?list=PLpZBeKTZRGPNKxoNaeMD9GViU_aH_HJab Outline: 00:00 Intro to search engines and our sponsor 01:12 Naive search engines 03:35 AI-power: Vector search engines explained 06:05 Vectors and why we need them 06:49 How to vectorizer models work? 10:38 Adapting the ML-model to your use case 12:20 Summary & playing with Weaviate Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏 donor, Dres. Trost GbR ➡️ AI Coffee Break Merch! 🛍️ https://aicoffeebreak.creator-spring.com/ ▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀ 🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕ Patreon: https://www.patreon.com/AICoffeeBreak Ko-fi: https://ko-fi.com/aicoffeebreak ▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀ 🔗 Links: AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community Twitter: https://twitter.com/AICoffeeBreak Reddit: https://www.reddit.com/r/AICoffeeBreak/ YouTube: https://www.youtube.com/AICoffeeBreak #AICoffeeBreak #MsCoffeeBean #Weaviate #SeMI #MachineLearning #AI

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from AI Coffee Break with Letitia · AI Coffee Break with Letitia · 53 of 60

← Previous Next →

AI Coffee Break - Channel Trailer

AI Coffee Break - Channel Trailer

AI Coffee Break with Letitia

How to check if a neural network has learned a specific phenomenon?

How to check if a neural network has learned a specific phenomenon?

AI Coffee Break with Letitia

A brief history of the Transformer architecture in NLP

A brief history of the Transformer architecture in NLP

AI Coffee Break with Letitia

Our paper at CVPR 2020 - MUL Workshop and ACL 2020 - ALVR Workshop

Our paper at CVPR 2020 - MUL Workshop and ACL 2020 - ALVR Workshop

AI Coffee Break with Letitia

The Transformer neural network architecture EXPLAINED. “Attention is all you need”

The Transformer neural network architecture EXPLAINED. “Attention is all you need”

AI Coffee Break with Letitia

Transformer combining Vision and Language? ViLBERT - NLP meets Computer Vision

Transformer combining Vision and Language? ViLBERT - NLP meets Computer Vision

AI Coffee Break with Letitia

Pre-training of BERT-based Transformer architectures explained – language and vision!

Pre-training of BERT-based Transformer architectures explained – language and vision!

AI Coffee Break with Letitia

GPT-3 explained with examples. Possibilities, and implications.

GPT-3 explained with examples. Possibilities, and implications.

AI Coffee Break with Letitia

Adversarial Machine Learning explained! | With examples.

Adversarial Machine Learning explained! | With examples.

AI Coffee Break with Letitia

BERTology meets Biology | Solving biological problems with Transformers

BERTology meets Biology | Solving biological problems with Transformers

AI Coffee Break with Letitia

Can a neural network tell if an image is mirrored? – Visual Chirality

Can a neural network tell if an image is mirrored? – Visual Chirality

AI Coffee Break with Letitia

The ultimate intro to Graph Neural Networks. Maybe.

The ultimate intro to Graph Neural Networks. Maybe.

AI Coffee Break with Letitia

Can language models understand? Bender and Koller argument.

Can language models understand? Bender and Koller argument.

AI Coffee Break with Letitia

GANs explained | Generative Adversarial Networks video with showcase!

GANs explained | Generative Adversarial Networks video with showcase!

AI Coffee Break with Letitia

What nobody tells you about MULTIMODAL Machine Learning! 🙊 THE definition.

What nobody tells you about MULTIMODAL Machine Learning! 🙊 THE definition.

AI Coffee Break with Letitia

Multimodal Machine Learning models do not work. Here is why. Part 1/2 – The SYMPTOMS

Multimodal Machine Learning models do not work. Here is why. Part 1/2 – The SYMPTOMS

AI Coffee Break with Letitia

Why Multimodal Machine Learning models do not work. Part 2/2 – The CAUSES

Why Multimodal Machine Learning models do not work. Part 2/2 – The CAUSES

AI Coffee Break with Letitia

An image is worth 16x16 words: ViT | Vision Transformer explained

An image is worth 16x16 words: ViT | Vision Transformer explained

AI Coffee Break with Letitia

AI understanding language!? A roadmap to natural language understanding.

AI understanding language!? A roadmap to natural language understanding.

AI Coffee Break with Letitia

"What Can We Do to Improve Peer Review in NLP?" 👀

"What Can We Do to Improve Peer Review in NLP?" 👀

AI Coffee Break with Letitia

The curse of dimensionality. Or is it a blessing?

The curse of dimensionality. Or is it a blessing?

AI Coffee Break with Letitia

PCA explained with intuition, a little math and code

PCA explained with intuition, a little math and code

AI Coffee Break with Letitia

Data-efficient Image Transformers EXPLAINED! Facebook AI's DeiT paper

Data-efficient Image Transformers EXPLAINED! Facebook AI's DeiT paper

AI Coffee Break with Letitia

OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.

OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.

AI Coffee Break with Letitia

Leaking training data from GPT-2. How is this possible?

Leaking training data from GPT-2. How is this possible?

AI Coffee Break with Letitia

OpenAI’s CLIP explained! | Examples, links to code and pretrained model

OpenAI’s CLIP explained! | Examples, links to code and pretrained model

AI Coffee Break with Letitia

Transformers can do both images and text. Here is why.

Transformers can do both images and text. Here is why.

AI Coffee Break with Letitia

UMAP explained | The best dimensionality reduction?

UMAP explained | The best dimensionality reduction?

AI Coffee Break with Letitia

NVIDIA Jarvis (now NVIDIA Riva) meets Ms. Coffee Bean

NVIDIA Jarvis (now NVIDIA Riva) meets Ms. Coffee Bean

AI Coffee Break with Letitia

Transformer in Transformer: Paper explained and visualized | TNT

Transformer in Transformer: Paper explained and visualized | TNT

AI Coffee Break with Letitia

[RANT] Adversarial attack on OpenAI’s CLIP? Are we the fools or the foolers?

[RANT] Adversarial attack on OpenAI’s CLIP? Are we the fools or the foolers?

AI Coffee Break with Letitia

Pattern Exploiting Training explained! | PET, iPET, ADAPET

Pattern Exploiting Training explained! | PET, iPET, ADAPET

AI Coffee Break with Letitia

Deep Learning for Symbolic Mathematics!? | Paper EXPLAINED

Deep Learning for Symbolic Mathematics!? | Paper EXPLAINED

AI Coffee Break with Letitia

FNet: Mixing Tokens with Fourier Transforms – Paper Explained

FNet: Mixing Tokens with Fourier Transforms – Paper Explained

AI Coffee Break with Letitia

Are Pre-trained Convolutions Better than Pre-trained Transformers? – Paper Explained

Are Pre-trained Convolutions Better than Pre-trained Transformers? – Paper Explained

AI Coffee Break with Letitia

"Please Commit More Blatant Academic Fraud" – A fellow PhD student's response.

"Please Commit More Blatant Academic Fraud" – A fellow PhD student's response.

AI Coffee Break with Letitia

Scaling Vision Transformers? How much data can a transformer get? #Shorts

Scaling Vision Transformers? How much data can a transformer get? #Shorts

AI Coffee Break with Letitia

How cross-modal are vision and language models really? 👀 Seeing past words. [Own work]

How cross-modal are vision and language models really? 👀 Seeing past words. [Own work]

AI Coffee Break with Letitia

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization +Tokenizer explained

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization +Tokenizer explained

AI Coffee Break with Letitia

Positional embeddings in transformers EXPLAINED | Demystifying positional encodings.

Positional embeddings in transformers EXPLAINED | Demystifying positional encodings.

AI Coffee Break with Letitia

Adding vs. concatenating positional embeddings & Learned positional encodings

Adding vs. concatenating positional embeddings & Learned positional encodings

AI Coffee Break with Letitia

Self-Attention with Relative Position Representations – Paper explained

Self-Attention with Relative Position Representations – Paper explained

AI Coffee Break with Letitia

Saddle points vs. local minima in high dimensional spaces | ❓ #AICoffeeBreakQuiz #Shorts

Saddle points vs. local minima in high dimensional spaces | ❓ #AICoffeeBreakQuiz #Shorts

AI Coffee Break with Letitia

What is the model identifiability problem? | Explained in 60 seconds! | ❓ #AICoffeeBreakQuiz #Shorts

What is the model identifiability problem? | Explained in 60 seconds! | ❓ #AICoffeeBreakQuiz #Shorts

AI Coffee Break with Letitia

Data leakage during data preparation? | Using AntiPatterns to avoid MLOps Mistakes

Data leakage during data preparation? | Using AntiPatterns to avoid MLOps Mistakes

AI Coffee Break with Letitia

Is today's AI smarter than YOU? #Shorts

Is today's AI smarter than YOU? #Shorts

AI Coffee Break with Letitia

Convolution vs Cross-Correlation. How most CNNs do not compute convolutions. | ❓ #Shorts

Convolution vs Cross-Correlation. How most CNNs do not compute convolutions. | ❓ #Shorts

AI Coffee Break with Letitia

Why do we care about cross-correlations vs convolutions | ❓ #AICoffeeBreakQuiz #Shorts

Why do we care about cross-correlations vs convolutions | ❓ #AICoffeeBreakQuiz #Shorts

AI Coffee Break with Letitia

The convolution is not shift invariant. | Invariance vs Equivariance | ❓ #AICoffeeBreakQuiz #Shorts

The convolution is not shift invariant. | Invariance vs Equivariance | ❓ #AICoffeeBreakQuiz #Shorts

AI Coffee Break with Letitia

How to increase the receptive field in CNNs? | #AICoffeeBreakQuiz #Shorts

How to increase the receptive field in CNNs? | #AICoffeeBreakQuiz #Shorts

AI Coffee Break with Letitia

What is tokenization and how does it work? Tokenizers explained.

What is tokenization and how does it work? Tokenizers explained.

AI Coffee Break with Letitia

Foundation Models | On the opportunities and risks of calling pre-trained models “Foundation Models”

Foundation Models | On the opportunities and risks of calling pre-trained models “Foundation Models”

AI Coffee Break with Letitia

How modern search engines work – Vector databases explained! | Weaviate open-source

How modern search engines work – Vector databases explained! | Weaviate open-source

AI Coffee Break with Letitia

Eyes tell all: How to tell that an AI generated a face?

Eyes tell all: How to tell that an AI generated a face?

AI Coffee Break with Letitia

Swin Transformer paper animated and explained

Swin Transformer paper animated and explained

AI Coffee Break with Letitia

Data BAD | What Will it Take to Fix Benchmarking for NLU?

Data BAD | What Will it Take to Fix Benchmarking for NLU?

AI Coffee Break with Letitia

SimVLM explained | What the paper doesn’t tell you

SimVLM explained | What the paper doesn’t tell you

AI Coffee Break with Letitia

Generalization – Interpolation – Extrapolation in Machine Learning: Which is it now!?

Generalization – Interpolation – Extrapolation in Machine Learning: Which is it now!?

AI Coffee Break with Letitia

Do Transformers process sequences of FIXED or of VARIABLE length? | #AICoffeeBreakQuiz

Do Transformers process sequences of FIXED or of VARIABLE length? | #AICoffeeBreakQuiz

AI Coffee Break with Letitia

The efficiency misnomer | Size does not matter | What does the number of parameters mean in a model?

The efficiency misnomer | Size does not matter | What does the number of parameters mean in a model?

AI Coffee Break with Letitia

This video teaches viewers how modern search engines work, with a focus on vector databases and their ability to organize data based on context. It covers the basics of vector databases, Transformer models, and vector search engines, and provides a high-level overview of the technology. Viewers will learn how to use tools like Weaviate and Transformers to implement RAG search and vector databases.

Key Takeaways

Understand the basics of vector databases
Learn how to use Weaviate for vector storage
Implement RAG search using Transformers
Evaluate the performance of vector search engines
Fine-tune Transformer models for advanced RAG search

💡 Vector databases organize data based on context, not just alphabetical order, allowing for more accurate and efficient search results.

🔒 Pro feature: Ask AI to explain this lesson →

More on: RAG Basics

View skill →

High Performance (Realtime) RAG Chains: From Basic to Advanced

High Performance (Realtime) RAG Chains: From Basic to Advanced

Coding the Ultimate RAG Engine from Zero

Coding the Ultimate RAG Engine from Zero

Building Agentic RAG From Scratch in Pure Python

Building Agentic RAG From Scratch in Pure Python

Build an LLM and RAG-based Chat Application using AlloyDB and LangChain

I Built a RAG App to Decode Airline Bureaucracy (So You Don't Have To)

I Built a RAG App to Decode Airline Bureaucracy (So You Don't Have To)

Akamai Developers

RAG Demo for Beginners: Full Hands-On Tutorial in Tamil | Build Your Own RAG AI | Karthik's Show

RAG Demo for Beginners: Full Hands-On Tutorial in Tamil | Build Your Own RAG AI | Karthik's Show

Related Reads

RAG Is Not a Feature. It's a System, and These Are the Parts Nobody Demos.

Learn how RAG is a system, not a feature, and understand its key components beyond demos

Dev.to · Marketing wizr

What Is RAG? The AI Technology That Makes ChatGPT Smarter Without Retraining

Learn about RAG, the AI technology that enhances ChatGPT's capabilities without requiring retraining, and why it matters for advancing language models

Understanding the Limits of Linear RAG — and Why Agentic Workflows Are Catching On

Learn the limitations of linear RAG pipelines and how agentic workflows are becoming a popular alternative for more efficient and effective AI workflows

Understanding the Limits of Linear RAG — and Why Agentic Workflows Are Catching On

Learn why linear RAG pipelines have limitations and how Agentic workflows are becoming a preferred alternative in the industry

Medium · Machine Learning

Chapters (7)

Intro to search engines and our sponsor

1:12 Naive search engines

3:35 AI-power: Vector search engines explained

6:05 Vectors and why we need them

6:49 How to vectorizer models work?

10:38 Adapting the ML-model to your use case

12:20 Summary & playing with Weaviate

RRF vs DBSF with Qdrant: Hybrid Retrieval Fusion for RAG in Python

Professor Py: AI Engineering