Core AI

RAG & Vector Search

Retrieval-augmented generation, vector databases, embeddings and semantic search

4,144
lessons
Skills in this topic
View full skill map →
RAG Basics
beginner
Chunk documents with LangChain or LlamaIndex
Vector Stores
intermediate
Set up Pinecone, Weaviate, or pgvector
RAG Evaluation
intermediate
Run RAGAS evaluation on a RAG pipeline
Advanced RAG
advanced
Build a hybrid BM25 + dense retrieval pipeline
All Reads (680) Articles (192)Blog Posts (418)Tutorials (68)Research Papers (2)
PostgreSQL as a Vector Database: When to Use pgvector vs Pinecone vs Weaviate
Dev.to · Polliog 🔍 RAG & Vector Search 3mo ago
PostgreSQL as a Vector Database: When to Use pgvector vs Pinecone vs Weaviate
"Should we use PostgreSQL as our vector database?" I've heard this question a lot in 2026. pgvector...
IVFFlat vs HNSW in pgvector: Which Index Should You Use?
Dev.to · Philip McClarence 🔍 RAG & Vector Search 3mo ago
IVFFlat vs HNSW in pgvector: Which Index Should You Use?
IVFFlat vs HNSW in pgvector: Which Index Should You Use? Every pgvector deployment...
## I Built an AI Tourism Assistant for Kenya Using RAG, pgvector, and Streamlit
Dev.to · John Wakaba 🔍 RAG & Vector Search 3mo ago
## I Built an AI Tourism Assistant for Kenya Using RAG, pgvector, and Streamlit
Imagine asking: "What's the best luxury safari in Maasai Mara?" and instantly getting...
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs
Dev.to · Moon Robert 🔍 RAG & Vector Search 3mo ago
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs You've shipped a...
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs
Dev.to · Moon Robert 🔍 RAG & Vector Search 3mo ago
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs You've shipped a...
Stanford Just Exposed the Fatal Flaw Killing Every RAG System at Scale
Dev.to · Aaryan Shukla 🔍 RAG & Vector Search 3mo ago
Stanford Just Exposed the Fatal Flaw Killing Every RAG System at Scale
RAG was supposed to fix hallucinations. Turns out it just hid them behind math. I've been deep in...
Building a Knowledge Base with RAG Using NestJS, LangChain and OpenAI
Dev.to · Daniel Malek 🔍 RAG & Vector Search 3mo ago
Building a Knowledge Base with RAG Using NestJS, LangChain and OpenAI
Source code: github.com/Dan1618/Articles-rag 1. What We're Building and...
Private RAG Deployment: Building Zero-Leakage Retrieval Pipelines for Enterprise
Dev.to · Jaipal Singh 🔍 RAG & Vector Search 3mo ago
Private RAG Deployment: Building Zero-Leakage Retrieval Pipelines for Enterprise
Private RAG deployment matters when you’re handling sensitive data. But here’s the problem: most implementations leak information at multiple points without tea
From Zero to RAG: Implementing Retrieval-Augmented Generation in a Laravel Application
Dev.to · Marcc Atayde 🔍 RAG & Vector Search 3mo ago
From Zero to RAG: Implementing Retrieval-Augmented Generation in a Laravel Application
Imagine you've deployed a sleek AI chatbot for a client — it answers questions confidently, users...
The Real Reason Your RAG Dies in Production — Your Vector DB Is Full of Garbage
Dev.to · dosanko_tousan 🔍 RAG & Vector Search 4mo ago
The Real Reason Your RAG Dies in Production — Your Vector DB Is Full of Garbage
§0 About the Person Writing This Non-engineer. 50 years old. Stay-at-home dad in Hokkaido,...
My RAG Pipeline Took an Hour. Here's How I Got It Down to 30 Seconds.
Dev.to · Prithvi Rajan 🔍 RAG & Vector Search 4mo ago
My RAG Pipeline Took an Hour. Here's How I Got It Down to 30 Seconds.
A content ingestion job used to take over an hour. Now it finishes in 30 seconds. No change in...
pgvector Distance Functions: Cosine vs L2 vs Inner Product
Dev.to · Philip McClarence 🔍 RAG & Vector Search 4mo ago
pgvector Distance Functions: Cosine vs L2 vs Inner Product
pgvector Distance Functions: Cosine vs L2 vs Inner Product If you're using pgvector for...
I Built a Baseline RAG System — Then Measured Where It Actually Breaks
Dev.to · Mukesh Z 🔍 RAG & Vector Search 4mo ago
I Built a Baseline RAG System — Then Measured Where It Actually Breaks
Most RAG demos stop at: “Look, it answers correctly.” I wanted to go further. Instead of...
I Built a Baseline RAG System — Then Measured Where It Actually Breaks
Dev.to · Mukesh Z 🔍 RAG & Vector Search 4mo ago
I Built a Baseline RAG System — Then Measured Where It Actually Breaks
Most RAG demos stop at: “Look, it answers correctly.” I wanted to go further. Instead of...
Production RAG with Semantic Kernel: Patterns, Chunking, and Retrieval Strategies
Dev.to · Brian Spann 🔍 RAG & Vector Search 4mo ago
Production RAG with Semantic Kernel: Patterns, Chunking, and Retrieval Strategies
Retrieval-Augmented Generation (RAG) is the pattern that makes LLMs genuinely useful for enterprise...
I Built My Own dev.to Feed Page Instead of Embedding a Widget
Dev.to · Kristieene Knowles 🔍 RAG & Vector Search 4mo ago
I Built My Own dev.to Feed Page Instead of Embedding a Widget
There’s an easy way to show your dev.to posts on your website. You paste in a widget. You let it...
How I Built a Smart Q&A Bot for a Local Music Venue in a Weekend — A Practical Introduction to RAG Systems
Dev.to · Clint Westbrook 🔍 RAG & Vector Search 4mo ago
How I Built a Smart Q&A Bot for a Local Music Venue in a Weekend — A Practical Introduction to RAG Systems
How I Built a Smart Q&A Bot for a Local Music Venue in a Weekend — A Practical...
Beyond Keywords: Building an AI Assistant for Aviation Maintenance using Elastic RAG
Dev.to · Arnab Sen 🔍 RAG & Vector Search 4mo ago
Beyond Keywords: Building an AI Assistant for Aviation Maintenance using Elastic RAG
> **Disclaimer**: This blog post was submitted to the Elastic Blogathon Contest and is eligible...
Semantic Kernel Memory: Vector Stores, Embeddings, and Semantic Search
Dev.to · Brian Spann 🔍 RAG & Vector Search 4mo ago
Semantic Kernel Memory: Vector Stores, Embeddings, and Semantic Search
LLMs have a fundamental limitation: they're stateless. Every request starts fresh with no memory of...
RAG vs GraphRAG: When Agents Hallucinate Answers
Dev.to · Elizabeth Fuentes L 🔍 RAG & Vector Search 4mo ago
RAG vs GraphRAG: When Agents Hallucinate Answers
Traditional RAG makes AI agents hallucinate statistics and aggregations. This demo builds a travel...
I Switched From Pure Vector Search to Hybrid Retrieval in My RAG System — Here's What Changed
Dev.to · vapmail16 🔍 RAG & Vector Search 4mo ago
I Switched From Pure Vector Search to Hybrid Retrieval in My RAG System — Here's What Changed
I've been building RAG (Retrieval-Augmented Generation) systems for a while now, and I recently made...
Bedrock, Agents, and RAG on AWS: the design that takes generative AI from prototype to production with confidence
Dev.to · Cláudio Menezes de Oliveira Santos 🔍 RAG & Vector Search 4mo ago
Bedrock, Agents, and RAG on AWS: the design that takes generative AI from prototype to production with confidence
What changes when the question stops being “does it work?” and becomes “can we trust it?” If you have...
Adaptive RAG Depth Control: Dynamically Optimizing Retrieval for Cost and Quality
Dev.to · Shreekansha 🔍 RAG & Vector Search 4mo ago
Adaptive RAG Depth Control: Dynamically Optimizing Retrieval for Cost and Quality
What RAG Depth Means Beyond Top-k In a naive RAG implementation, depth is defined as the fixed...
How we built a hybrid FTS5 + embedding search for code — and why you need both
Dev.to · Tim Uy 🔍 RAG & Vector Search 4mo ago
How we built a hybrid FTS5 + embedding search for code — and why you need both
How we built a hybrid FTS5 + embedding search for code — and why you need both srclight...
RAG on a Budget: How I Replaced a $360/Month OpenSearch Cluster with S3 and In-Memory Search
Dev.to · Stephanie 🔍 RAG & Vector Search 4mo ago
RAG on a Budget: How I Replaced a $360/Month OpenSearch Cluster with S3 and In-Memory Search
I shelved a project in March 2025 because it cost $360/month to run. I rebuilt it in January 2026...
Building Intelligent Chatbots with RAG and Vector Databases: A Practical Developer's Guide
Dev.to · Marcc Atayde 🔍 RAG & Vector Search 4mo ago
Building Intelligent Chatbots with RAG and Vector Databases: A Practical Developer's Guide
If you've ever watched a GPT-powered chatbot confidently hallucinate a fact that doesn't exist, you...
Beyond RAG: Building Self Healing Vector Indexes with Elasticsearch for Production Grade Agentic Systems
Dev.to · Mihir Phalke 🔍 RAG & Vector Search 4mo ago
Beyond RAG: Building Self Healing Vector Indexes with Elasticsearch for Production Grade Agentic Systems
TL;DR Production RAG systems face a silent killer: vector drift. Embeddings become stale,...
RAG From Scratch: Build a System That Answers Questions From Your Docs
Dev.to · vapmail16 🔍 RAG & Vector Search 4mo ago
RAG From Scratch: Build a System That Answers Questions From Your Docs
My first RAG system answered "I don't know" to questions that were clearly in the documents. The...
Graph RAG: Architecture and Implementation of Knowledge-Graph-Augmented Generation
Dev.to · Sandeep Pamarthi 🔍 RAG & Vector Search 4mo ago
Graph RAG: Architecture and Implementation of Knowledge-Graph-Augmented Generation
How replacing flat vector retrieval with structured graph traversal unlocks multi-hop reasoning in...
Laravel AI SDK Tutorial Part 2: Build a RAG-Powered Support Bot with Tools and Memory
Dev.to · Hafiz 🔍 RAG & Vector Search 4mo ago
Laravel AI SDK Tutorial Part 2: Build a RAG-Powered Support Bot with Tools and Memory
Laravel AI SDK tutorial: build a RAG support bot with custom tools, conversation memory, vector embeddings, similarity search, and full testing. Step-by-step.
Vectors, embeddings, and search: an intuition-first guide
Dev.to · Nicolas 🔍 RAG & Vector Search 4mo ago
Vectors, embeddings, and search: an intuition-first guide
PS: You can find this fully animated article with concrete examples on my blog :...
Building a RAG pipeline with Kreuzberg and LangChain
Dev.to · TI 🔍 RAG & Vector Search 4mo ago
Building a RAG pipeline with Kreuzberg and LangChain
Most discussions about retrieval-augmented generation (RAG) focus on choosing the right model, tuning...
# Introducing chunklet-py 2.2.0+:
Dev.to · Speedyk-005 🔍 RAG & Vector Search 4mo ago
# Introducing chunklet-py 2.2.0+:
The Smart Text Chunking Library You Didn't Know You Needed Ever tried splitting text for your RAG...
Building a Global AI Design Studio: RAG, Vector Search, and the Physics of Localization
Dev.to · Thirupathi Pyati 🔍 RAG & Vector Search 4mo ago
Building a Global AI Design Studio: RAG, Vector Search, and the Physics of Localization
The Hidden Cost of Localization If you’ve ever built a global product, you know the...
How OpenClaw Orchestrates Long-Term Memory
Dev.to · Chen-Hung Wu 🔍 RAG & Vector Search 4mo ago
How OpenClaw Orchestrates Long-Term Memory
Files Are the Source of Truth Forget embeddings stored in some opaque vector database...
I Built a RAG Bot to Fix Flaky Cypress Tests
Dev.to · Miguel Porras 🔍 RAG & Vector Search 4mo ago
I Built a RAG Bot to Fix Flaky Cypress Tests
I was spending hours every week manually fixing the same flaky Cypress tests. The failures had...
Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production
Dev.to · Humza Tareen 🔍 RAG & Vector Search 4mo ago
Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production
Our embedding model got deprecated overnight. Every RAG query started returning 404s. Here's the...
Building an AI DJ: What I Got Wrong About Music Embeddings 🎶🕺
Dev.to · Brandon Lozano 🔍 RAG & Vector Search 4mo ago
Building an AI DJ: What I Got Wrong About Music Embeddings 🎶🕺
It All Begins With a Vibe Gripe We have all been there, at weddings, maybe even our own....
Choosing the Right Vector Embedding Model and Dimension: A School Analogy That Makes Everything Clear
Dev.to · Seenivasa Ramadurai 🔍 RAG & Vector Search 4mo ago
Choosing the Right Vector Embedding Model and Dimension: A School Analogy That Makes Everything Clear
A practical guide for AI engineers, RAG architects, and anyone building systems that need to...
When Similarity Search Breaks: Why RAG Fails on Numerical Queries
Dev.to · Akshay Rajinikanth 🔍 RAG & Vector Search 4mo ago
When Similarity Search Breaks: Why RAG Fails on Numerical Queries
I was building a chatbot using Retrieval-Augmented Generation (RAG) over a semi-structured...
RAG Recall vs Precision: A Practical Diagnostic Guide for Reliable Retrieval
Dev.to · Daniel R. Foster 🔍 RAG & Vector Search 4mo ago
RAG Recall vs Precision: A Practical Diagnostic Guide for Reliable Retrieval
Understand the real difference between recall and precision in RAG systems and learn a practical diagnostic framework to fix retrieval failures effectively.
Advanced RAG: Parsing Complex Medical PDFs with LayoutLMv3 and LlamaIndex
Dev.to · Beck_Moulton 🔍 RAG & Vector Search 4mo ago
Advanced RAG: Parsing Complex Medical PDFs with LayoutLMv3 and LlamaIndex
Let’s be honest: PDFs are where data goes to die. Especially medical check-up reports. They are a...
What we learned from 100+ production RAG deployments (free 118-page handbook)
Dev.to · Trilok Kanwar 🔍 RAG & Vector Search 4mo ago
What we learned from 100+ production RAG deployments (free 118-page handbook)
We’ve been building RAG systems for a while and wanted to share a resource we just published. It’s a...
RAG Anti-Patterns in Production: What Breaks and Why
Dev.to · Shreekansha 🔍 RAG & Vector Search 4mo ago
RAG Anti-Patterns in Production: What Breaks and Why
Retrieval-Augmented Generation (RAG) is often presented as a simple three-step process: embed,...
From Chatbot to Medical AI: How I Used RAG, FAISS & Mistral to Ground AI in Reality
Dev.to · Prateek Mangalgi 🔍 RAG & Vector Search 4mo ago
From Chatbot to Medical AI: How I Used RAG, FAISS & Mistral to Ground AI in Reality
Most AI demos look impressive. They answer anything. They speak confidently. They sound...
Embedding Static Files in Go for Production-Grade Deployments
Dev.to · Anand B 🔍 RAG & Vector Search 4mo ago
Embedding Static Files in Go for Production-Grade Deployments
One common source of production issues in Go services is filesystem dependency. Applications often...
Beyond Customer Support: Building Production-Grade Financial RAG Systems
Dev.to · ruchika bhat 🔍 RAG & Vector Search 4mo ago
Beyond Customer Support: Building Production-Grade Financial RAG Systems
The Day Our Financial Chatbot Almost Cost a Client 1,00,000 Six months into production,...
I built a memory system that outperforms standard RAG on temporal queries -- try the live playground
Dev.to · Martin Minchev 🔍 RAG & Vector Search 4mo ago
I built a memory system that outperforms standard RAG on temporal queries -- try the live playground
Standard RAG stores text as vectors and returns the closest match by similarity. It works, but it has...