Horizontal vs Vertical Scaling: When to Grow Out, When to Grow Up

Every team debates this eventually. Here's the math, the trade-offs, and why 'just scale out' is often the wrong first answer.

April 09, 20257 min readScalingArchitecture

Two ways to handle more load: make one machine bigger (vertical) or add more machines (horizontal). Most teams assume horizontal is the "real" answer. Most of the time that assumption is wrong — at least to start.

Vertical scaling

Bigger CPU, more RAM, faster disks. The app stays simple: one process, one database, one cache, one copy of state.

Pros:

No distributed-system problems. No eventual consistency, no split-brain, no consensus.
Code ports 1:1 from laptop to prod.
Modern single machines are huge: a $2k/month cloud VM goes to 96 vCPU / 384 GiB RAM. That handles an enormous amount of traffic.

Cons:

A single point of failure.
Hard ceiling: the biggest box ever.
Upgrades require downtime or a careful failover.

Horizontal scaling

More replicas behind a load balancer; state in shared systems (Postgres, Redis, S3). Autoscale based on CPU or request rate.

Pros:

Linear-ish scaling of stateless workloads (APIs, renderers).
Natural high availability — a dead node loses 1/N of capacity, not 100%.
Rolling deploys with zero downtime.

Cons:

State becomes someone else's problem (the DB, the cache) — and that "someone else" is now the bottleneck.
Session stickiness, cache coherence, idempotency, distributed tracing — all of which cost engineer-time.

The honest order of operations

Profile and optimise the single-node path first. A 3× speedup in code is a 3× capacity increase for free.
Scale vertical until it becomes awkward. Cheaper and simpler than making the app cluster-safe.
Scale the stateless tier horizontally. Put the API behind a load balancer; multiple pods, one DB.
Scale the stateful tier last. Read replicas → sharding → ultimately multi-region. Each step is a project.

The database is always the bottleneck

When people say "we're scaling horizontally," they usually mean the app tier. The database still stands alone and still eventually gets too big. That's when you pay the complexity tax: read replicas, sharding, or a move to a distributed database (Spanner, CockroachDB, Yugabyte). Plan for it, but don't volunteer early.

Horizontal vs Vertical Scaling: When to Grow Out, When to Grow Up

Vertical scaling

Horizontal scaling

The honest order of operations

The database is always the bottleneck

6 replies// weighed in

More from this topic

CAP Theorem: What It Actually Says (and What It Doesn't)

Idempotency in APIs: The Cheap Fix for Half Your Retry Bugs

How I Stopped Worrying and Learned to Love Cell-Based Architecture