October 1, 2019

196 words 1 min read

Dynamic Pod Resource Boundary Adjustment in Web Scale Clusters

Dynamic Pod Resource Boundary Adjustment in Web Scale Clusters

Have you ever confused about how to set perfect resource limit for Pod? How do you balance resource efficiency with application's SLO? In this talk, we will share practices and lessons learned from a …

Talk Title Dynamic Pod Resource Boundary Adjustment in Web Scale Clusters
Speakers Xiaoyu Zhang (Senior Engineer, Alibaba), Cheng Wang (Software engineer, Alibaba)
Conference KubeCon + CloudNativeCon
Conf Tag
Location Shanghai, China
Date Jun 23-26, 2019
URL Talk Page
Slides Talk Slides
Video

Have you ever confused about how to set perfect resource limit for Pod? How do you balance resource efficiency with application’s SLO? In this talk, we will share practices and lessons learned from adjusting Pod resource limits dynamically for Web-scale clusters at Alibaba Group by co-locating Pods with different QoS classes on the same node and adjusting Pod resource limits dynamically (especially during resource contention). After applying this effort in production clusters, we were able to improve the cluster resource usage by 14%~30%, tail latency (95 percentile) by 76%~87%, and TPS (transactions per second) by 107%~163%, respectively. The audiences would benefit from experience of improving the resource utilization and application performance for their own clusters, with Kubernetes native approaches.

comments powered by Disqus