Dynamic Pod Resource Boundary Adjustment in Web Scale Clusters
Have you ever confused about how to set perfect resource limit for Pod? How do you balance resource efficiency with application's SLO? In this talk, we will share practices and lessons learned from a …
Talk Title | Dynamic Pod Resource Boundary Adjustment in Web Scale Clusters |
Speakers | Xiaoyu Zhang (Senior Engineer, Alibaba), Cheng Wang (Software engineer, Alibaba) |
Conference | KubeCon + CloudNativeCon |
Conf Tag | |
Location | Shanghai, China |
Date | Jun 23-26, 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Have you ever confused about how to set perfect resource limit for Pod? How do you balance resource efficiency with application’s SLO? In this talk, we will share practices and lessons learned from adjusting Pod resource limits dynamically for Web-scale clusters at Alibaba Group by co-locating Pods with different QoS classes on the same node and adjusting Pod resource limits dynamically (especially during resource contention). After applying this effort in production clusters, we were able to improve the cluster resource usage by 14%~30%, tail latency (95 percentile) by 76%~87%, and TPS (transactions per second) by 107%~163%, respectively. The audiences would benefit from experience of improving the resource utilization and application performance for their own clusters, with Kubernetes native approaches.