Definition: Scalability strategy that increases resources (CPU, RAM, disk) of an existing machine to handle greater load.
— Source: NERVICO, Product Development Consultancy
What is Vertical Scaling
Vertical scaling, also known as scale-up, is a scalability strategy that adds more power to an existing machine: more CPU, more RAM, more storage, or greater network bandwidth. Instead of distributing the load across multiple servers, a single server is made powerful enough to handle all demand.
How it works
The typical process involves changing the instance type or hardware configuration of an existing server. In cloud environments, this means selecting a higher-capacity instance: for example, migrating from an instance with 2 vCPUs and 8 GB of RAM to one with 16 vCPUs and 64 GB of RAM. In on-premise infrastructure, it means adding RAM modules, faster disks, or additional processors. The application requires no code changes, as it simply has more resources to run the same logic. In most cases, vertical scaling requires a brief downtime period while the new configuration is provisioned.
Why it matters
Vertical scaling is the fastest and simplest way to respond to an immediate performance issue. It requires no changes to the application architecture, introduces no distributed systems complexity, and works with any type of software, including legacy applications not designed to run across multiple instances. However, it has a physical limit: even the largest instances from cloud providers have a resource ceiling, and costs grow non-linearly as you scale up.
Practical example
A PostgreSQL database begins experiencing slow query times as data volume grows. The team migrates the RDS instance from a db.r6g.large type (2 vCPUs, 16 GB RAM) to a db.r6g.4xlarge (16 vCPUs, 128 GB RAM). Query times improve immediately without changing a single line of code. The monthly cost increases from $200 to $1,600, but the solution is implemented in under an hour. Meanwhile, the team plans a horizontal scaling strategy with read replicas for a long-term solution.
Related terms
- Horizontal Scaling - Alternative strategy that distributes load across multiple machines
- Scalability - General concept encompassing both vertical and horizontal scaling
Last updated: February 2026