Right-Sizing EKS in Practice
EKS clusters rarely start inefficient. They drift there. What begins as a reasonable number of small nodes slowly becomes a fixed cost floor. HPAs are set optimistically. Storage is over-provisioned “just in case.” Instance types are chosen for safety, not fit. Recently, I revisited an EKS environment that had evolved into: 14 × t4g.medium nodes Predominantly gp2 volumes HPA minimum replicas set to 3 or 4 across multiple services Nothing was on fire. Services were stable. But the cost profile didn’t match the actual workload behaviour. ...