Kubernetes Pod Scaling: Horizontal Pod Autoscaler
Scale deployments automatically based on CPU utilization - and see why resource requests are a prerequisite for HPA to work.
Explore tutorials, challenges, courses, and more published by this author.
Scale deployments automatically based on CPU utilization - and see why resource requests are a prerequisite for HPA to work.
Start with nodeSelector for simple node targeting, then upgrade to node affinity for expressive matching that nodeSelector cannot handle.
Use LimitRange to set per-pod resource defaults and ResourceQuota to cap total namespace consumption.
See how Kubernetes determines BestEffort, Burstable, and Guaranteed classes from resource configuration - and which pods get evicted first under pressure.
See how CPU and memory requests influence where pods land - and what happens when no node can fit them.
Use PriorityClass to guarantee critical pods get scheduled even when the cluster is under resource pressure.
Nine hands-on challenges split into two sections. The resource section covers how requests, QoS classes, priority, and namespace quotas shape scheduling decisions. The placement section covers how affinity, topology spread, and dedicated nodes control where pods land. HPA closes the path by showing how requests drive autoscaling.