For multi-AZ clusters, failure to create a new node due to no capacity in one AZ should retry on another
Just ended up in a situation where my cluster was trying to autoscale but couldn't create a new node in zones/asia-east1-c because it didn't have resources available - this caused autoscaling to just give up.
However, my cluster spans asia-east1-b and asia-east1-c, and by manually creating a new GCE machine in c I was able to make it work.
It'd be good if this was automatic.
OOPS – sorry, just realized you’re suggesting for GKE. Not sure if RMIG is supported yet, so we’ll leave this as an open suggestion for now.
Hi, please check out the relatively new Regional Managed Instance Groups, which should do exactly what you want.