Disabled VPA (keeping data collection mode)
For dev environments, we switched node types to t3, reducing their cost by about 50%*
Configured node consolidation policies in Karpenter to remove underutilized instances
Set up spot instances for GitHub Actions runners
We conducted an infrastructure audit and implemented a series of optimizations:
Reconfigured the Descheduler, raising the allowable node utilization to 70%
Configured requests and limits for all services
Set up HPA (Horizontal Pod Autoscaler) for services with variable load
Set the minimum number of replicas in production services to 3