Core AI

RAG & Vector Search

Retrieval-augmented generation, vector databases, embeddings and semantic search

4,147
lessons
Skills in this topic
View full skill map →
RAG Basics
beginner
Chunk documents with LangChain or LlamaIndex
Vector Stores
intermediate
Set up Pinecone, Weaviate, or pgvector
RAG Evaluation
intermediate
Run RAGAS evaluation on a RAG pipeline
Advanced RAG
advanced
Build a hybrid BM25 + dense retrieval pipeline
All Reads (683) Articles (194)Blog Posts (418)Tutorials (69)Research Papers (2)
Why Your RAG System Needs a Graph Database (Not Just Vectors)
Dev.to · Nathaniel Hamlett 🔍 RAG & Vector Search 3mo ago
Why Your RAG System Needs a Graph Database (Not Just Vectors)
Vector search finds what's similar. Graph traversal finds what's connected. I built a system with both — 3M vectors and 252K graph nodes — and the graph catches
Cómo construí un pipeline RAG de producción con FastAPI, pgvector y cross-encoder reranking
Dev.to · Martin Palopoli 🔍 RAG & Vector Search 3mo ago
Cómo construí un pipeline RAG de producción con FastAPI, pgvector y cross-encoder reranking
Construí un motor RAG de producción que combina búsqueda híbrida (pgvector + BM25), reranking con...
RAG Security 101: Protecting Your Retrieval-Augmented Generation Pipeline
Dev.to · BotGuard 🔍 RAG & Vector Search 3mo ago
RAG Security 101: Protecting Your Retrieval-Augmented Generation Pipeline
A single maliciously crafted input can bring down an entire Retrieval-Augmented Generation (RAG)...
# Building Scalable RAG Systems with Hierarchical Clustering + Hierarchical RAG (and Why Cluster Summaries Matter)
Dev.to · Praveen Kumar 🔍 RAG & Vector Search 3mo ago
# Building Scalable RAG Systems with Hierarchical Clustering + Hierarchical RAG (and Why Cluster Summaries Matter)
Retrieval-Augmented Generation (RAG) has become the backbone of many AI-powered applications such as...
Understanding Word2Vec – Part 6: Two Ways Word2Vec Learns Context
Dev.to · Rijul Rajesh 🔍 RAG & Vector Search 3mo ago
Understanding Word2Vec – Part 6: Two Ways Word2Vec Learns Context
In the previous article, we saw the word embeddings concept, and how training causes similar words to...
Vector Database Integration Patterns: pgvector, Qdrant, and Weaviate Compared
Dev.to · myougaTheAxo 🔍 RAG & Vector Search 3mo ago
Vector Database Integration Patterns: pgvector, Qdrant, and Weaviate Compared
How to Choose a Vector Database Vector DB selection affects the entire system...
Complete RAG Tutorial Python: Build Your First Agent
Dev.to · Iniyarajan 🔍 RAG & Vector Search 3mo ago
Complete RAG Tutorial Python: Build Your First Agent
Complete RAG tutorial Python guide. Build production-ready Retrieval-Augmented Generation systems with LangChain, vector databases, and AI agents.
OpenAI Batch API: Adaptive Status Checks
Dev.to · Marin Muštra 🔍 RAG & Vector Search 3mo ago
OpenAI Batch API: Adaptive Status Checks
OpenAI's Batch API lets you run large asynchronous workloads (evaluations, RAG, embeddings, and more)...
Detecting Embedding Drift: The Silent Killer of RAG Accuracy
Dev.to · Zaki Cole 🔍 RAG & Vector Search 3mo ago
Detecting Embedding Drift: The Silent Killer of RAG Accuracy
Your RAG pipeline shipped fine. Then answers started slipping. The problem is upstream, not the LLM. Here's how embedding drift breaks retrieval and what to do
How We Built AI Search for WooCommerce Using RAG
Dev.to · Anton Pronin 🔍 RAG & Vector Search 3mo ago
How We Built AI Search for WooCommerce Using RAG
TL;DR: Default WooCommerce search is a glorified blog search engine. We built a RAG-based AI search...
Multi‑Vector Embeddings in Production: Typed Vectors, Cache Keys, and a Generator That Refuses Poison Records
Dev.to · Daniel Romitelli 🔍 RAG & Vector Search 3mo ago
Multi‑Vector Embeddings in Production: Typed Vectors, Cache Keys, and a Generator That Refuses Poison Records
I built an embedding pipeline for our recruitment platform that represents each record as four typed vectors instead of one pooled blob: profile, e...
Understanding Word2Vec – Part 5: How Training Creates Word Embeddings
Dev.to · Rijul Rajesh 🔍 RAG & Vector Search 3mo ago
Understanding Word2Vec – Part 5: How Training Creates Word Embeddings
In the previous article, we visualized the vectors on a graph and saw how we can represent similarity...
Multi‑Vector Embeddings in Production: Typed Vectors, Cache Keys, and a Generator That Refuses Poison Records
Dev.to · Daniel Romitelli 🔍 RAG & Vector Search 3mo ago
Multi‑Vector Embeddings in Production: Typed Vectors, Cache Keys, and a Generator That Refuses Poison Records
I built an embedding pipeline for our recruitment platform that represents each record as four typed vectors instead of one pooled blob: profile, e...
Vector Databases Explained: A Builder's Guide
Dev.to · Jamie Thompson 🔍 RAG & Vector Search 3mo ago
Vector Databases Explained: A Builder's Guide
A practical comparison of Pinecone, Qdrant, pgvector, and Weaviate from someone who has built production RAG systems. Tradeoffs, performance characteristics, an
What Is RAG? A Practitioner's Guide to Retrieval-Augmented Generation
Dev.to · Jamie Thompson 🔍 RAG & Vector Search 3mo ago
What Is RAG? A Practitioner's Guide to Retrieval-Augmented Generation
A practical guide to RAG from someone who has deployed it in production for government and enterprise clients. Covers architecture, vector databases, chunking s
How I Built an Anti-Hallucination Pipeline for Enterprise Legal Documents
Dev.to · Victor Okefie 🔍 RAG & Vector Search 3mo ago
How I Built an Anti-Hallucination Pipeline for Enterprise Legal Documents
The standard advice for building RAG pipelines is to improve your retrieval. Better embeddings....
I rewrote LangChain in 300 lines of Rust (and here's what I found)
Dev.to · Lakshmi Sravya Vedantham 🔍 RAG & Vector Search 3mo ago
I rewrote LangChain in 300 lines of Rust (and here's what I found)
LangChain has over 200,000 lines of code. I wanted to understand what RAG actually does — not what...
How Retrieval Augmented Generation Actually Prevents Ai Hallucinations
Dev.to · Delafosse Olivier 🔍 RAG & Vector Search 3mo ago
How Retrieval Augmented Generation Actually Prevents Ai Hallucinations
Originally published on CoreProse KB-incidents Introduction Retrieval Augmented...
LangChain vs LlamaIndex vs Haystack: Lo que aprendí construyendo RAG en producción
Dev.to · Moon Robert 🔍 RAG & Vector Search 3mo ago
LangChain vs LlamaIndex vs Haystack: Lo que aprendí construyendo RAG en producción
Pasé las últimas dos semanas migrando un sistema RAG entre tres frameworks — y no fue una decisión...
LangChain vs LlamaIndex vs Haystack: What Two Weeks in Production Actually Taught Me
Dev.to · Moon Robert 🔍 RAG & Vector Search 3mo ago
LangChain vs LlamaIndex vs Haystack: What Two Weeks in Production Actually Taught Me
My team got handed a RAG project earlier this year — 40,000 documents, mix of PDFs and Confluence...
Decoding Embedding Models: Why Your RAG Is Only as Good as Your Vectors 🚀
Dev.to · Ananya S 🔍 RAG & Vector Search 3mo ago
Decoding Embedding Models: Why Your RAG Is Only as Good as Your Vectors 🚀
As an AI Engineer, the first major decision you make in a RAG (Retrieval-Augmented Generation)...
FastGPT vs Dify: The Chinese RAG Platform Battle You're Missing
Dev.to · Victorjia 🔍 RAG & Vector Search 3mo ago
FastGPT vs Dify: The Chinese RAG Platform Battle You're Missing
There's a RAG platform with 27,000+ GitHub stars, 200,000+ users, and a thriving ecosystem of...
Construyendo RAG con pgvector: Por Qué Dejé de Pagar Pinecone
Dev.to · Moon Robert 🔍 RAG & Vector Search 3mo ago
Construyendo RAG con pgvector: Por Qué Dejé de Pagar Pinecone
En enero abrí las facturas de Supabase y Pinecone al mismo tiempo por primera vez. Hasta ese momento...
Building RAG with pgvector: Why I Stopped Paying for Pinecone
Dev.to · Moon Robert 🔍 RAG & Vector Search 3mo ago
Building RAG with pgvector: Why I Stopped Paying for Pinecone
The $340 invoice showed up on a Tuesday in late January. Not wild money in isolation, but this was...
Specialized chatbot using RAG
Dev.to · NEBULA DATA 🔍 RAG & Vector Search 3mo ago
Specialized chatbot using RAG
Specialized Chatbot using RAG (Retrieval-Augmented Generation) — Part II In the previous...
Building a RAG System from Scratch: Turning Aviation Disruption Data into an AI-Powered Q&A App
Dev.to · parupati madhukar reddy 🔍 RAG & Vector Search 3mo ago
Building a RAG System from Scratch: Turning Aviation Disruption Data into an AI-Powered Q&A App
I recently built a Retrieval-Augmented Generation (RAG) system that lets you ask natural language...
How I Scoped a RAG Knowledge Base Per Tenant
Dev.to · Damla Hamurcu 🔍 RAG & Vector Search 3mo ago
How I Scoped a RAG Knowledge Base Per Tenant
I'm building a multi-tenant AI chatbot, the kind where businesses embed a chat widget on their site,...
Build a Production RAG Chatbot with Django + pgvector + OpenAI (Full Guide)
Dev.to · Kirill Strelnikov 🔍 RAG & Vector Search 3mo ago
Build a Production RAG Chatbot with Django + pgvector + OpenAI (Full Guide)
I'm Kirill Strelnikov, a freelance AI/Django developer in Barcelona. I've built RAG chatbots that...
Let's build a flow to vectorize data
Dev.to · Allan Roberto 🔍 RAG & Vector Search 3mo ago
Let's build a flow to vectorize data
Indexing Knowledge Base Content with Spring Boot and pgvector ...
Turning PostgreSQL Into a Vector Database with Docker
Dev.to · Allan Roberto 🔍 RAG & Vector Search 3mo ago
Turning PostgreSQL Into a Vector Database with Docker
To store and query embeddings, we need a database capable of handling vector similarity search. A...
From Image to Vector: Building Image Similarity Search with Python and MySQL
Dev.to · Sanjay Ghosh 🔍 RAG & Vector Search 3mo ago
From Image to Vector: Building Image Similarity Search with Python and MySQL
Modern applications increasingly rely on vector embeddings to search and compare data such as text,...
Building Production RAG Pipelines on AWS with Bedrock and OpenSearch
Dev.to · Kehinde Ogunlowo 🔍 RAG & Vector Search 3mo ago
Building Production RAG Pipelines on AWS with Bedrock and OpenSearch
RAG (Retrieval-Augmented Generation) is how enterprises are deploying LLMs without fine-tuning. But...
Building a Semantic Search API with Spring Boot and pgvector - Part 1: Architecture
Dev.to · Ozioma Ochin 🔍 RAG & Vector Search 3mo ago
Building a Semantic Search API with Spring Boot and pgvector - Part 1: Architecture
The problem with Keyword Search Keyword search breaks more often than most engineers...
Vectorless RAG: Entenda Como Fazer RAG Sem Vector Database
Dev.to · suissAI 🔍 RAG & Vector Search 3mo ago
Vectorless RAG: Entenda Como Fazer RAG Sem Vector Database
A indústria de Retrieval-Augmented Generation (RAG) passou os últimos dois anos orbitando uma mesma...
Vectorizing Your Vitals: Converting 10GB of Apple Health Data into a Personal RAG Brain
Dev.to · Beck_Moulton 🔍 RAG & Vector Search 3mo ago
Vectorizing Your Vitals: Converting 10GB of Apple Health Data into a Personal RAG Brain
If you've ever tried to open your Apple Health export file, you know it's where dreams of "quantified...
Scaling pgvector: Memory, Quantization, and Index Build Strategies
Dev.to · Philip McClarence 🔍 RAG & Vector Search 3mo ago
Scaling pgvector: Memory, Quantization, and Index Build Strategies
Scaling pgvector: Memory, Quantization, and Index Build Strategies pgvector handles...
I Deleted Pinecone, Redis, and 400 Lines of Python. My RAG Pipeline Still Works.
Dev.to · Ahmet Zeybek 🔍 RAG & Vector Search 3mo ago
I Deleted Pinecone, Redis, and 400 Lines of Python. My RAG Pipeline Still Works.
I had 5 services running for a RAG pipeline. Turns out I only needed PostgreSQL. Here's the whole thing in SQL.
I got tired of writing 30 lines of LangChain boilerplate every time. So I published a fix.
Dev.to · Aman Pandey 🔍 RAG & Vector Search 3mo ago
I got tired of writing 30 lines of LangChain boilerplate every time. So I published a fix.
Every time I started a new project that needed RAG, I wrote the same 30 lines. Load documents. Split...
Your RAG Pipeline is Leaking - 4 Data Leak Points Nobody Talks About
Dev.to · Rohan Sharma 🔍 RAG & Vector Search 3mo ago
Your RAG Pipeline is Leaking - 4 Data Leak Points Nobody Talks About
Every enterprise running RAG today is doing what Samsung engineers did in 2023 — sending sensitive...
How Retrieval-Augmented Generation (RAG) Works on AWS
Dev.to · saif ur rahman 🔍 RAG & Vector Search 3mo ago
How Retrieval-Augmented Generation (RAG) Works on AWS
How Retrieval-Augmented Generation (RAG) Works on AWS Generative AI models are powerful,...
Document RAG and GraphRAG APIs in NodeJS
Dev.to · Muhammad Arslan 🔍 RAG & Vector Search 3mo ago
Document RAG and GraphRAG APIs in NodeJS
Real-world document ingestion, semantic search, RAG Q&A, and knowledge-graph retrieval—all in one...
Zero-Knowledge AI Matching: Binarized Embeddings + Hamming Distance
Dev.to · Venkat 🔍 RAG & Vector Search 3mo ago
Zero-Knowledge AI Matching: Binarized Embeddings + Hamming Distance
Part 3 of a series on building a privacy-first dating platform for HIV-positive communities. Building...
Why CRAG is the Evolutionary Leap RAG Has Been Waiting For
Dev.to · ruchika bhat 🔍 RAG & Vector Search 3mo ago
Why CRAG is the Evolutionary Leap RAG Has Been Waiting For
For all the justifiable hype surrounding Retrieval-Augmented Generation (RAG), a dirty secret lurks...
Your pgvector Queries Are Doing Sequential Scans — Here's Why
Dev.to · Philip McClarence 🔍 RAG & Vector Search 3mo ago
Your pgvector Queries Are Doing Sequential Scans — Here's Why
Your pgvector Queries Are Doing Sequential Scans -- Here's Why pgvector makes it easy to...
From Prototype to Production: Building a Reliable RAG API with FastAPI + ChromaDB
Dev.to · Sowndappan S 🔍 RAG & Vector Search 3mo ago
From Prototype to Production: Building a Reliable RAG API with FastAPI + ChromaDB
I recently upgraded my Retrieval-Augmented Generation (RAG) project from a simple demo into a...
Best Open-Source LLMs for RAG in 2026: 10 Models Ranked by Retrieval Accuracy
Dev.to · Jaipal Singh 🔍 RAG & Vector Search 3mo ago
Best Open-Source LLMs for RAG in 2026: 10 Models Ranked by Retrieval Accuracy
The best LLM for RAG is two models working together. Your embedding model determines whether you retrieve the right chunks. Your generation model determines whe
SQLite as a Vector Database — Yes, Really
Dev.to · Zoricic 🔍 RAG & Vector Search 3mo ago
SQLite as a Vector Database — Yes, Really
Do you really need a vector database? For local AI agents, SQLite handles embeddings just fine — and...
Beyond the API Wrapper: A Web Developer's Deep Dive into RAG (Retrieval-Augmented Generation)
Dev.to · Armand al-farizy 🔍 RAG & Vector Search 3mo ago
Beyond the API Wrapper: A Web Developer's Deep Dive into RAG (Retrieval-Augmented Generation)
Introduction Take a look around the tech ecosystem today. Every week, hundreds of new "AI...