Session

Autoscaling services on all dimensions

Why doing toil, if the machine can do it for you? This talk covers all of the multitude of autoscaling mechanisms applicable to service meshes made by containers managed by systems like Borg, Kubernetes, Swarm or DC/OS. From vertical, horizontal, auto turnup, load shifting, etc.

When deploying containerised stateless services on a clusters managed by Kubernetes, for example, the most efficient way to run them is with the minimal number of replicas possible to cover the load, maximising the utilisation of resources. How to calculate the number of replicas to maintain a reliable service can be tricky: Pod restarts, traffic imbalances, load shifts, etc.

Further, vertically scaling services is a multi dimension problem and services based on virtual machines like the JVM present specific challenges for autoscaling.

Configuring the autoscaler for the right utilisation levels, using the right metrics and the right decaying factors is key for successfully scaling services.

Ramón Medrano Llamas

Senior Staff Site Reliability Engineer at Google

Zürich, Switzerland

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top