Database Sharding and Partitioning

What:

Horizontal partitioning (sharding) splits single tables horizontally into separate database servers; vertical partitioning splits table columns into separate tables.

Primary purpose:

Scaling write throughput and storage capacity beyond the physical hardware limits of a single master server.

Usually used for:

High-volume transactional databases, heavy social timeline layers, and multi-tenant SaaS backends.

How should I think about this inside system architectures?

🔑 The Sharding Key Rule

Select a key (e.g. `user_id` or `tenant_id`) that routes over 90% of your common queries to a single shard, entirely avoiding cross-shard lookups.

⚖️ Even Distribution

Choose keys with high cardinality to distribute storage and write QPS uniformly across your database node pool.

🧬 Application Routing

Move routing logic either into an application middleware library (e.g., Vitess) or deploy a dedicated proxy tier (e.g., Citus).

Sharding Strategies Matrix: Categorizing partitioning models:

Strategy	Routing Model	Primary Benefit	Severe Cost
Range-Based	Map key intervals (e.g. A-E, F-L) to specific shard nodes.	Trivial to implement; ideal for clustered range queries.	Creates severe hotspots (e.g. all updates on active suffix keys).
Hash-Based	Apply hash function to partition key modulo node count: `hash(key) % N`.	Ensures uniform data distribution across nodes.	Adding/removing nodes remaps almost 100% of keys (requires Consistent Hashing).
Geo-Located	Route user tables to nodes inside their geographic region.	Sub-millisecond latency; solves local compliance laws.	Complex cross-region backups and failovers.

Benefit	Cost
Horizontal Scalability (bypasses the physical scale limit of a single server, scaling database QPS linearly)	No Distributed Joins (joining tables across distinct shards is extremely slow; forces application-layer joints)
Small Blast Radius (if shard node A crashes, only 10% of users are degraded, keeping the remaining pool alive)	Loss of ACID Atomicity (transactions span multiple shards, requiring complex 2-Phase Commit protocols)

Production System	Sharding Key Selection	Architectural Rationale
Twitter Timeline Shards	user_id (Hash-sharded)	Queries pull timeline segments per user; sharding by `user_id` keeps reads bound to a single shard.
E-Commerce Ledger	tenant_id (Lookup-sharded)	Multi-tenant B2B platforms isolate customers completely, routing all tenant requests to dedicated databases.

Vertical Partitioning vs Horizontal Sharding

1. Horizontal Sharding

2. Vertical Partitioning

When Sharding Is Premature

Cross-Shard Operations