Core AI

RAG & Vector Search

Retrieval-augmented generation, vector databases, embeddings and semantic search

4,143
lessons
Skills in this topic
View full skill map →
RAG Basics
beginner
Chunk documents with LangChain or LlamaIndex
Vector Stores
intermediate
Set up Pinecone, Weaviate, or pgvector
RAG Evaluation
intermediate
Run RAGAS evaluation on a RAG pipeline
Advanced RAG
advanced
Build a hybrid BM25 + dense retrieval pipeline
All Reads (679) Articles (192)Blog Posts (418)Tutorials (67)Research Papers (2)
Document-to-Markdown for RAG: Preparing Documents for Your AI Knowledge Base
Dev.to · Iteration Layer 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
Document-to-Markdown for RAG: Preparing Documents for Your AI Knowledge Base
Your RAG Pipeline Is Only as Good as Its Ingestion Every team building retrieval-augmented...
RAG from Public Documentation Websites: Robots.txt, Terms, Retention, and Attribution
Dev.to · Iteration Layer 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
RAG from Public Documentation Websites: Robots.txt, Terms, Retention, and Attribution
Public Docs Are the Easiest RAG Source to Get Wrong Every AI support project eventually...
I built a GraphRAG demo with FalkorDB’s new SDK, then benchmarked it against Neo4j
Dev.to · Dan Shalev 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
I built a GraphRAG demo with FalkorDB’s new SDK, then benchmarked it against Neo4j
FalkorDB shipped graphrag-sdk v1.0.0rc1 and I wanted to see how it feels on real content, not a toy...
How I used Gemini CLI to orchestrate a complex RAG migration
Dev.to · Remigiusz Samborski 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
How I used Gemini CLI to orchestrate a complex RAG migration
Building a complex, multi-phase cloud project like a RAG migration is as much about orchestration as...
RAG in Practice — Part 8: RAG in Production — What Breaks After Launch
Dev.to · Gursharan Singh 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
RAG in Practice — Part 8: RAG in Production — What Breaks After Launch
Why production RAG drifts, degrades, and quietly fails — and the patterns and discipline that prevent it.
The 10-Layer Security System Your RAG Pipeline Is Missing
Dev.to · klement Gunndu 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
The 10-Layer Security System Your RAG Pipeline Is Missing
Your RAG pipeline has a front door and a back door. Both are wide open. The front door lets users...
RAG in Practice — Part 7: Your RAG System Is Wrong. Here's How to Find Out Why.
Dev.to · Gursharan Singh 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
RAG in Practice — Part 7: Your RAG System Is Wrong. Here's How to Find Out Why.
Evaluation, metrics, LLM-as-a-judge, and the diagnostic spine. The single most important debugging habit in RAG.
Speaking the Corpus's Language: How Multilingual RAG Stays Coherent Across Turns
Dev.to · HarinezumIgel 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
Speaking the Corpus's Language: How Multilingual RAG Stays Coherent Across Turns
This article assumes you already run a multi-turn RAG pipeline and have query rewriting...
10 Chunking Strategies That Make or Break Your RAG Pipeline
Dev.to · klement Gunndu 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
10 Chunking Strategies That Make or Break Your RAG Pipeline
A 2025 peer-reviewed study (Vectara, NAACL 2025) found something most RAG teams get...
Optimizing Web Scraping Data to Reduce RAG Token Costs
Dev.to · AlterLab 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
Optimizing Web Scraping Data to Reduce RAG Token Costs
Feeding raw HTML into a Retrieval-Augmented Generation (RAG) pipeline is a fast way to burn through...
RAG Is Failing in Production — Here’s Why (and What I’m Testing Instead)
Dev.to · Eduardo Borges 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
RAG Is Failing in Production — Here’s Why (and What I’m Testing Instead)
RAG (Retrieval-Augmented Generation) looks great in demos. But in real-world systems, it often fails...
FLAMEHAVEN FileSearch: Why This RAG Engine Feels Different from the Usual Stack
Dev.to · Kwansub Yun 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
FLAMEHAVEN FileSearch: Why This RAG Engine Feels Different from the Usual Stack
FLAMEHAVEN FileSearch: Why This RAG Engine Feels Different from the Usual Stack RAG is...
Why Running RAG Pipelines on Serverless Functions Was Harder Than I Expected
Dev.to · pythonassignmenthelp.com 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
Why Running RAG Pipelines on Serverless Functions Was Harder Than I Expected
You know that feeling when you think you’ve nailed a scalable architecture—only to watch your...
RAG vs GraphRAG: When to Use What (From a Builder’s Perspective)
Dev.to · System Rationale 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
RAG vs GraphRAG: When to Use What (From a Builder’s Perspective)
I wasted time overengineering a GraphRAG system… when a simple RAG pipeline would’ve done the job...
I built a production RAG pipeline. Here's what most tutorials skip.
Dev.to · Anurag Srivastava 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
I built a production RAG pipeline. Here's what most tutorials skip.
I wanted a RAG system that was fast to run and fast to set up for clients. Upload a PDF, ask...
Building RAG Pipelines That Actually Work: Lessons from Microsoft Copilot
Dev.to · Nrk Raju Guthikonda 🔍 RAG & Vector Search ⚡ AI Lesson 2mo ago
Building RAG Pipelines That Actually Work: Lessons from Microsoft Copilot
Most RAG tutorials show you the happy path. You chunk a handful of PDFs, toss them into a vector...
I Built an AI Chatbot That Knows Everything About Me
Dev.to · Akromdev 🔍 RAG & Vector Search 3mo ago
I Built an AI Chatbot That Knows Everything About Me
How I built a RAG chatbot for my portfolio site using local embeddings, a JSON file, and zero monthly cost.
Retrieval Finds Candidates. Reranking Finds the Right One.
Dev.to · Seenivasa Ramadurai 🔍 RAG & Vector Search 3mo ago
Retrieval Finds Candidates. Reranking Finds the Right One.
A hiring analogy that finally makes RAG Reranking click First, What Is RAG? Before we get...
Build Your Own AI Medical Brain: Transforming PDF Health Reports into a Graph-RAG Powerhouse with Neo4j and LangChain
Dev.to · Beck_Moulton 🔍 RAG & Vector Search 3mo ago
Build Your Own AI Medical Brain: Transforming PDF Health Reports into a Graph-RAG Powerhouse with Neo4j and LangChain
We’ve all been there: you get your annual health checkup results as a messy, 20-page PDF filled with...
Building Production RAG Systems with PostgreSQL: Complete Implementation Guide
Dev.to · Pablo Ifrán 🔍 RAG & Vector Search 3mo ago
Building Production RAG Systems with PostgreSQL: Complete Implementation Guide
Step-by-step guide to building RAG systems that actually work in production, with working code and performance optimization
Building a RAG System in Laravel from Scratch
Dev.to · PHP CMS Framework 🔍 RAG & Vector Search 3mo ago
Building a RAG System in Laravel from Scratch
Most RAG tutorials start with "first, sign up for Pinecone." I'm going to skip that entirely. For the...
The Search Router That Saved Me From One Index to Rule Them All: Azure AI Search for CRM, pgvector for Transcripts, the CRM a...
Dev.to · Daniel Romitelli 🔍 RAG & Vector Search 3mo ago
The Search Router That Saved Me From One Index to Rule Them All: Azure AI Search for CRM, pgvector for Transcripts, the CRM a...
I hit the moment every enterprise AI system eventually hits: one query needed faceting and synonym maps, another needed raw vector similarity over ...
Simple and cheap RAG - genai-toolbox and pgvector
Dev.to · Marcin Niemira 🔍 RAG & Vector Search 3mo ago
Simple and cheap RAG - genai-toolbox and pgvector
I recently hit a common architectural fork in the road while building my ADK (Agent Development Kit)...
Build a Local RAG Pipeline With Ollama + pgvector — No API Keys, No Cloud
Dev.to · SIGNAL 🔍 RAG & Vector Search 3mo ago
Build a Local RAG Pipeline With Ollama + pgvector — No API Keys, No Cloud
Retrieval-Augmented Generation is one of those ideas that sounds complex until you actually build it....
Building a Local RAG System with Citations (No Hallucinations)
Dev.to · Subrata Kumar Das 🔍 RAG & Vector Search 3mo ago
Building a Local RAG System with Citations (No Hallucinations)
Part of my 24-week journey: Mapping the Blueprint for the 2030 AI Stack By Subrata Kumar Das ...
Detén las Hallucinations en Agentes de IA: 4 Técnicas Esenciales
Dev.to · Elizabeth Fuentes L 🔍 RAG & Vector Search 3mo ago
Detén las Hallucinations en Agentes de IA: 4 Técnicas Esenciales
4 técnicas para detener las hallucinations en AI agents: Graph-RAG para recuperación precisa de...
Setting Up CocoIndex with Docker and pgvector - A Practical Guide
Dev.to · Steven Leggett 🔍 RAG & Vector Search 3mo ago
Setting Up CocoIndex with Docker and pgvector - A Practical Guide
A step-by-step guide to setting up CocoIndex with Docker, pgvector, and semantic search - covering all the gotchas the docs don't mention.
Building a Medical History Chatbot - FHIR, Vector Search and RAG for beginners
Dev.to · InterSystems Developer 🔍 RAG & Vector Search 3mo ago
Building a Medical History Chatbot - FHIR, Vector Search and RAG for beginners
Introduction Earlier this year, I set about creating kit to introduce young techy folk at...
SurfaceDocs + LlamaIndex: From RAG Pipeline to Shareable Report
Dev.to · SurfaceDocs 🔍 RAG & Vector Search 3mo ago
SurfaceDocs + LlamaIndex: From RAG Pipeline to Shareable Report
Your RAG pipeline answers questions beautifully — then the output prints to stdout and dies. Here's how to give it a permanent, shareable home with SurfaceDocs.
The Embedding Space: Where Words Become Geometry
Dev.to · Neural Download 🔍 RAG & Vector Search 3mo ago
The Embedding Space: Where Words Become Geometry
https://www.youtube.com/watch?v=1sQffYYe6Y8 How Computers Understand Meaning: The Math...
Stop Calling FAISS a Database: The VectorStore vs. VectorDB Showdown🧠⚡
Dev.to · Ananya S 🔍 RAG & Vector Search 3mo ago
Stop Calling FAISS a Database: The VectorStore vs. VectorDB Showdown🧠⚡
If you’ve been building with LangChain, you’ve probably used Chroma or FAISS and called them...
How I Built a Production RAG Pipeline with FastAPI, pgvector and Cross-Encoder Reranking
Dev.to · Martin Palopoli 🔍 RAG & Vector Search 3mo ago
How I Built a Production RAG Pipeline with FastAPI, pgvector and Cross-Encoder Reranking
I built a production RAG engine that combines hybrid search (pgvector + BM25), cross-encoder...
Building a Vector Database from Scratch
Dev.to · Kurotsuba 🔍 RAG & Vector Search 3mo ago
Building a Vector Database from Scratch
Building a Vector Database from Scratch I have built a vector database with Rust from...
Stop Overpaying for VectorDBs: Architecting Serverless RAG on AWS
Dev.to · Dhananjay Lakkawar 🔍 RAG & Vector Search 3mo ago
Stop Overpaying for VectorDBs: Architecting Serverless RAG on AWS
Building a Retrieval-Augmented Generation (RAG) prototype takes a weekend. Taking that prototype to...
I built ragway — a Python RAG library controlled by a single YAML file
Dev.to · Swapanth Vakapalli 🔍 RAG & Vector Search 3mo ago
I built ragway — a Python RAG library controlled by a single YAML file
I spent the last few months building ragway, a Python RAG library where the entire pipeline is...
RAG Is a Data Problem Before It’s a Prompt Problem
Dev.to · Lukas 🔍 RAG & Vector Search 3mo ago
RAG Is a Data Problem Before It’s a Prompt Problem
I made this mistake myself while debugging a RAG pipeline. If your RAG feature keeps returning...
Build an MCP Server That Finds Your RAG Chatbot's Blind Spots
Dev.to · Damla Hamurcu 🔍 RAG & Vector Search 3mo ago
Build an MCP Server That Finds Your RAG Chatbot's Blind Spots
RAG chatbots fail silently. Imagine you've hired a chatbot to handle FAQs on your behalf, so you can...
Building Production-Ready AI Document Processing Pipelines with RAG
Dev.to · Abhishek Nair 🔍 RAG & Vector Search 3mo ago
Building Production-Ready AI Document Processing Pipelines with RAG
A battle-tested guide to architecting, implementing, and scaling document intelligence systems that...
GPU-Bridge + LlamaIndex: Embeddings and Reranking in One Line
Dev.to · GPU-Bridge 🔍 RAG & Vector Search 3mo ago
GPU-Bridge + LlamaIndex: Embeddings and Reranking in One Line
Drop-in LlamaIndex integration for GPU-Bridge. High-throughput embeddings at $0.00002/call and semantic reranking for RAG pipelines.
Building Semantic Search with Spring Boot, PostgreSQL, and pgvector (RAG Retrieval)
Dev.to · Allan Roberto 🔍 RAG & Vector Search 3mo ago
Building Semantic Search with Spring Boot, PostgreSQL, and pgvector (RAG Retrieval)
In the previous article, we built the indexing pipeline for our knowledge base: documents are...
Moving Beyond Static RAG:Buiding a Live Financial Quant MCP Server for Real-Time Market Analysis
Dev.to · Praneet Gogoi 🔍 RAG & Vector Search 3mo ago
Moving Beyond Static RAG:Buiding a Live Financial Quant MCP Server for Real-Time Market Analysis
Most developers today associate Retrieval-Augmented Generation (RAG) with one thing: Embeddings +...
Beyond SEO: Generative Engine Optimization (GEO). How to Implement `llms.txt` and RAG-Friendly Markup
Dev.to · ensei mania 🔍 RAG & Vector Search 3mo ago
Beyond SEO: Generative Engine Optimization (GEO). How to Implement `llms.txt` and RAG-Friendly Markup
Hi DEV community! I'm Yusuke Sato, CEO of LIFRELL. I travel to tech conferences across the US,...
PostgreSQL: First Approach to Vector Databases with pgvector and Python
Dev.to · Mario García 🔍 RAG & Vector Search 3mo ago
PostgreSQL: First Approach to Vector Databases with pgvector and Python
If you're already familiar with relational databases like PostgreSQL, you're one step closer to start...
Build a GraphRAG Knowledge Base in NodeJS
Dev.to · Muhammad Arslan 🔍 RAG & Vector Search 3mo ago
Build a GraphRAG Knowledge Base in NodeJS
We've released hazeljs-rag-graph-starter — a production-ready example that indexes the entire HazelJS...
Fine-Tuned Small Models Beat RAG: The 2026 Economics
Dev.to · Dr Hernani Costa 🔍 RAG & Vector Search 3mo ago
Fine-Tuned Small Models Beat RAG: The 2026 Economics
When your support team processes 10,000 tickets monthly, the difference between a $0.001 and $0.1...
The 4-slot hook pipeline: how every CRUD operation feeds four systems at once"
Dev.to · Victor García 🔍 RAG & Vector Search 3mo ago
The 4-slot hook pipeline: how every CRUD operation feeds four systems at once"
A simple post-CRUD pipeline that feeds embeddings, heat tracking, entity extraction, and the changelog — without any of them blocking each other or the user.
Building WhisperGrid: The Future of Multimodal Semantic Search with Gemini Embedding 2
Dev.to · Harish Kotra (he/him) 🔍 RAG & Vector Search 3mo ago
Building WhisperGrid: The Future of Multimodal Semantic Search with Gemini Embedding 2
In the world of search, we've long been confined to keywords. Even with the advent of image search,...
RAG vs Agentic RAG: When to Use in NodeJS
Dev.to · Muhammad Arslan 🔍 RAG & Vector Search 3mo ago
RAG vs Agentic RAG: When to Use in NodeJS
Both RAG and Agentic RAG in HazelJS use the same foundation — vector stores, embeddings, and document...