Autoscaling monitors your workloads and automatically adjusts capacity to maintain steady, predictable performance while also optimizing for cost. When using Kubernetes there are two main relevant mechanisms which can be used to scale automatically:
In this chapter, we'll explore the various mechanisms available for automatically scaling both the number of pods and a cluster's compute capacity.
Cluster Autoscaler (CA)
Installation
Scale with CA
Cluster Over-Provisioning
How it works
Setting up Over-Provisioning
Scaling further
Karpenter
Install Karpenter
Set up the Node Pool
Automatic Node Provisioning
Disruption (Consolidation)
Further learning