Database Sharding Explained | Range vs Hash vs Directory Sharding

BazAI · Intermediate ·🏗️ Systems Design & Architecture ·5mo ago

Skills: Systems Design Basics90%Distributed Systems80%

Key Takeaways

The video explains database sharding, its importance in large-scale system design, and discusses range-based, hash-based, and directory-based sharding strategies. It highlights the trade-offs of each approach and the importance of choosing the right shard key.

Full Transcript

Welcome to design. Today we're breaking [music] down one of the most important concepts in large scale system design, database chararda. At the simplest level, chararda is about survival. When a single monolithic database can no longer handle growing data size, read traffic, or write throughput, vertical scaling stops working. You can't keep adding CPU or memory forever. Sharding solves this by splitting data horizontally across multiple independent databases called shards. Each shard holds only a subset of the total data, but together they represent the complete system. Once data is distributed, reads and writes happen in parallel. Throughput scales almost linearly and failures are isolated instead of taking down the entire system. There are multiple ways to shard data and each comes with trade-offs. Rangebased sharding splits data by value ranges such as price, user ID ranges, or timestamps. This is simple and efficient for range queries, but has a major weakness. If traffic concentrates on a specific range, you end up with hot shards, uneven load, and degraded performance. Keybase sharding, also known as hashbased sharding, uses a hash function on the shard key to determine where data lives. This gives excellent load distribution and avoids hot spots. However, it breaks natural ordering, which makes range query slower and more complex. Directory-based sharding uses a lookup service that maps attributes like region or tenant to specific shards. This approach is highly flexible and allows dynamic rebalancing, but introduces an additional dependency. If the directory becomes slow or unavailable, the entire system is affected. In practice, large-scale systems often combine multiple sharding strategies to balance performance, flexibility, and operational complexity. The most critical decision in any shard system is choosing the shard key. A good shard key must have high cardality so data spreads evenly. It should have uniform access frequency to avoid hotspots and it should not be monotonically increasing. Keys like time stamps or auto increment IDs may seem convenient but they silently create unbalanced shards and bottlenecks over time. Once data is shard aid every request must be routed correctly. This can be done in three ways. Shardaware clients route requests directly but are tightly coupled to shard topology. A dedicated routing tier centralizes shard logic and offers clean abstraction at scale. Shardaware nodes sit in between providing a balance of flexibility and control. Most modern architectures prefer a routing tier for long-term evolvability. Finally, sharding is almost always combined with replication. Each shard has a leader that handles rights and multiple followers that serve reads. Leaders are distributed across nodes, so no single machine becomes a bottleneck. This design delivers high availability, fall tolerance, and horizontal scalability, but it also introduces coordination and consistency challenges. Sharding is not just a database optimization. It's a foundational architectural decision that impacts APIs, query design, consistency models, and operational complexity. Master it, and you're no longer just building applications. You're designing distributed systems.

Original Description

Database sharding is one of the most critical concepts in large-scale system design. In this video, we break down database sharding in just 3 minutes—covering what sharding is, why monolithic databases fail at scale, and how modern distributed systems shard data efficiently. You’ll learn: What database sharding really means Range-based, hash-based, and directory-based sharding How to choose the right shard key Request routing strategies in sharded systems How replication works alongside sharding Real-world trade-offs used by large-scale platforms This walkthrough is perfect for: Software engineers Backend & platform engineers System design interview prep Architects building high-scale data systems If you want more deep, practical system design content, subscribe to Bazai.

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Playlist UUOthur5d9OxdqEh08Swtirw · BazAI · 34 of 49

← Previous Next →

How LLM Agents Actually Do Deep Research (Planning, Tools & Citations Explained

How LLM Agents Actually Do Deep Research (Planning, Tools & Citations Explained

Kafka vs RabbitMQ Explained: Which One Should You Use?

Kafka vs RabbitMQ Explained: Which One Should You Use?

#NOVER Explained: How AI Learns to Judge Its Own Reasoning (No Reward Model Needed)

#NOVER Explained: How AI Learns to Judge Its Own Reasoning (No Reward Model Needed)

The State of Enterprise AI 2025: How Workers Save 60 Minutes Daily & Adoption Explodes 9X

The State of Enterprise AI 2025: How Workers Save 60 Minutes Daily & Adoption Explodes 9X

NVIDIA Nemotron 3: 1M Context, Hybrid MoE Architecture, and Open Source AI Agents

NVIDIA Nemotron 3: 1M Context, Hybrid MoE Architecture, and Open Source AI Agents

How Service Mesh Works: Data Plane, Control Plane & Observability

How Service Mesh Works: Data Plane, Control Plane & Observability

How to Design Safe Retries in Microservices (No Duplicates, No Overload)

How to Design Safe Retries in Microservices (No Duplicates, No Overload)

Step-GUI: The Self-Evolving AI Agent for Android & PC (SOTA Performance!)

Step-GUI: The Self-Evolving AI Agent for Android & PC (SOTA Performance!)

NVIDIA's NitroGen: The First Generalist AI Trained to Play 1,000+ Games by Watching

NVIDIA's NitroGen: The First Generalist AI Trained to Play 1,000+ Games by Watching

How AI Agents Remember: The Evolution of Agentic Memory (2025 Guide)

How AI Agents Remember: The Evolution of Agentic Memory (2025 Guide)

Automate Your AI Data Pipelines: Introducing DataFlow & DataFlow-Agent

Automate Your AI Data Pipelines: Introducing DataFlow & DataFlow-Agent

Nemotron 3 Explained: Hybrid Mamba + MoE for 1M Token Agents

Nemotron 3 Explained: Hybrid Mamba + MoE for 1M Token Agents

Build Your Own AI Voice Agent (LangChain + OpenAI + AssemblyAI + Cartesia)

Build Your Own AI Voice Agent (LangChain + OpenAI + AssemblyAI + Cartesia)

Langflow 1.7 Explained: CUGA, ALTK, MCP & the Death of Prompt Engineering

Langflow 1.7 Explained: CUGA, ALTK, MCP & the Death of Prompt Engineering

HuatuoGPT-o1: The First Medical AI That "Thinks" Before It Answers

HuatuoGPT-o1: The First Medical AI That "Thinks" Before It Answers

Molmo2: Open-Source Vision-Language Models with State-of-the-Art Video Grounding

Molmo2: Open-Source Vision-Language Models with State-of-the-Art Video Grounding

MAI-UI: Alibaba’s New Foundation GUI Agents Outperforming Gemini & GPT-4o

MAI-UI: Alibaba’s New Foundation GUI Agents Outperforming Gemini & GPT-4o

Seamless AI Object Insertion: Bridging 4D Geometry and Diffusion Models

Seamless AI Object Insertion: Bridging 4D Geometry and Diffusion Models

5 AI Agentic Workflow Patterns-Reflection, Tools, ReAct, Planning, Multi‑Agent

5 AI Agentic Workflow Patterns-Reflection, Tools, ReAct, Planning, Multi‑Agent

#NVIDIA's New #SurgWorld: How AI is Learning Autonomous Surgery

#NVIDIA's New #SurgWorld: How AI is Learning Autonomous Surgery

CQRS Explained in 3 Minutes: How Modern Systems Scale Reads vs Writes

CQRS Explained in 3 Minutes: How Modern Systems Scale Reads vs Writes

Docker Explained in 3 Minutes: How Containers Actually Work

Docker Explained in 3 Minutes: How Containers Actually Work

6 Practical AWS Lambda Patterns in 3 Minutes (Real‑World Serverless Guide)

6 Practical AWS Lambda Patterns in 3 Minutes (Real‑World Serverless Guide)

Containerization Explained in 3 Minutes: From Dockerfile to Running Containers

Containerization Explained in 3 Minutes: From Dockerfile to Running Containers

Science Context Protocol (SCP)- Global Web of Autonomous Scientific Agents

Science Context Protocol (SCP)- Global Web of Autonomous Scientific Agents

Youtu-Agent: Scaling LLM Agent Productivity via Automated Generation and Hybrid RL

Youtu-Agent: Scaling LLM Agent Productivity via Automated Generation and Hybrid RL

#DeepSeek’s #mHC Breakthrough: Stabilizing Hyper-Connections for Large-Scale LLM Training

#DeepSeek’s #mHC Breakthrough: Stabilizing Hyper-Connections for Large-Scale LLM Training

Message Brokers 101 in 3 Minutes: Queues, Pub‑Sub & Competing Consumers Explained

Message Brokers 101 in 3 Minutes: Queues, Pub‑Sub & Competing Consumers Explained

Must‑Know Message Broker Patterns: Outbox, CQRS, Saga & More

Must‑Know Message Broker Patterns: Outbox, CQRS, Saga & More

Confucius Code Agent-Scalable Scaffolding for Large-Scale Repositories

Confucius Code Agent-Scalable Scaffolding for Large-Scale Repositories

#nvidia Just Fixed #GRPO! Meet #GDPO: The New Standard for Multi-Reward RL

#nvidia Just Fixed #GRPO! Meet #GDPO: The New Standard for Multi-Reward RL

NVIDIA Alpamayo-R1: Real-Time Reasoning for Level 4 Autonomy

NVIDIA Alpamayo-R1: Real-Time Reasoning for Level 4 Autonomy

The Future of AI Memory: Meet #AtomMem’s Learnable CRUD System

The Future of AI Memory: Meet #AtomMem’s Learnable CRUD System

Database Sharding Explained | Range vs Hash vs Directory Sharding

Database Sharding Explained | Range vs Hash vs Directory Sharding

12 Architecture Concepts Every Developer Must Know | System Design Explained

12 Architecture Concepts Every Developer Must Know | System Design Explained

5 Rate Limiting Strategies Explained | Protect Your System at Scale

5 Rate Limiting Strategies Explained | Protect Your System at Scale

How Live Streaming Works | System Design Explained

How Live Streaming Works | System Design Explained

5 Leader Election Algorithms Explained | Distributed Systems & Databases

5 Leader Election Algorithms Explained | Distributed Systems & Databases

6 Prompting Techniques to Get Better Results from ChatGPT

6 Prompting Techniques to Get Better Results from ChatGPT

Complete Guide to Storage Systems: RAM, SSD, SAN, Cloud & Databases

Complete Guide to Storage Systems: RAM, SSD, SAN, Cloud & Databases

Top 4 Authentication Mechanisms Explained | SSH, OAuth, SSL & Passwords

Top 4 Authentication Mechanisms Explained | SSH, OAuth, SSL & Passwords

Common Network Protocols Explained | TCP, UDP, HTTP, DNS & More

Common Network Protocols Explained | TCP, UDP, HTTP, DNS & More

Microservices Best Practices | 9 Rules Every Architect Must Know

Microservices Best Practices | 9 Rules Every Architect Must Know

8 Network Protocols Every Engineer Must Know | HTTP, TCP, UDP & More

8 Network Protocols Every Engineer Must Know | HTTP, TCP, UDP & More

Distributed Systems in 3 Minutes: CDNs, APIs, TCP & Idempotency Explained

Distributed Systems in 3 Minutes: CDNs, APIs, TCP & Idempotency Explained

Must‑Know Message Broker Patterns in 3 Minutes (Outbox, CQRS, Saga & More)

Must‑Know Message Broker Patterns in 3 Minutes (Outbox, CQRS, Saga & More)

Is OpenClaw Safe? The "Security Nightmare" Behind the Viral AI Agent

Is OpenClaw Safe? The "Security Nightmare" Behind the Viral AI Agent

JWT vs Sessions vs PASETO — Which Authentication Should You Use?

JWT vs Sessions vs PASETO — Which Authentication Should You Use?

Recursive LLMs vs Big Context Windows: Why RLM Wins

Recursive LLMs vs Big Context Windows: Why RLM Wins

This video teaches the fundamentals of database sharding, its importance in large-scale system design, and how to choose the right sharding strategy. It covers the trade-offs of range-based, hash-based, and directory-based sharding and the importance of selecting a good shard key.

Key Takeaways

Identify the need for database sharding
Choose a sharding strategy (range-based, hash-based, or directory-based)
Select a suitable shard key
Implement sharding
Configure replication
Route requests correctly

💡 Choosing the right shard key is crucial for efficient data distribution and avoiding hotspots

🔒 Pro feature: Ask AI to explain this lesson →

More on: Systems Design Basics

View skill →

Complete Application Deployment using Kubernetes Components | Kubernetes Tutorial 20

Complete Application Deployment using Kubernetes Components | Kubernetes Tutorial 20

TechWorld with Nana

How to write a Windows emulator for Linux from scratch

How to write a Windows emulator for Linux from scratch

Google for Developers

Deploying an ecommerce web app to GKE

Deploying an ecommerce web app to GKE

Getting started with Caddy the HTTPS Web Server from scratch

Getting started with Caddy the HTTPS Web Server from scratch

Build & Optimize React Native Product Listing Apps

Build & Optimize React Native Product Listing Apps

Serverless Functions with Zero Cold Starts: WebAssembly + Spin

Serverless Functions with Zero Cold Starts: WebAssembly + Spin

Akamai Developers

Related Reads

Your event store is already your audit log

Learn how to repurpose your event store as an audit log, reducing development overhead and improving data consistency

Distributed Transactions in System Design: Why Data Consistency Becomes Hard Once Your Application…

Learn how distributed transactions impact data consistency in system design and why it's crucial for scalable applications

Medium · Programming

Monolith vs Microservices: A Real-World Architectural Autopsy

Learn to decide between monolith and microservices architectures for your project and why it matters for scalability and maintainability

Dev.to · Erwin Wilson Ceniza2

FOV in FPS Games: The Math Behind Field of View Settings

Learn the math behind Field of View settings in FPS games and how to optimize your gameplay experience

Dev.to · Alex Carter

Retracing It All With My Son