October 6, 2019

223 words 2 mins read

Understanding Scalability and Performance in the Kubernetes Master

Understanding Scalability and Performance in the Kubernetes Master

Currently, the scale limit of Kubernetes is 5k nodes, so if you want to use it to manage a web-scale cluster like 10k nodes, you probably can't make it. Have you wondered what is the performance bott …

Talk Title Understanding Scalability and Performance in the Kubernetes Master
Speakers Fansong Zeng (Staff Engineer, Alibaba), Xingyu Chen (software engineer, Alibaba)
Conference KubeCon + CloudNativeCon
Conf Tag
Location Shanghai, China
Date Jun 23-26, 2019
URL Talk Page
Slides Talk Slides
Video

Currently, the scale limit of Kubernetes is 5k nodes, so if you want to use it to manage a web-scale cluster like 10k nodes, you probably can’t make it. Have you wondered what is the performance bottleneck for Kubernetes to manage more than 5k nodes? When you want to expand its scalability to a new level, who’s to “blame” first? Etcd, apiserver, or scheduler? Understanding these questions is the key to operate a large-size kubernetes cluster. In Alibaba, we encountered many issues like pod creation gets extremely slower as the cluster grows to larger and larger. In this talk, we would like to share how we did various benchmark tests and profiling. And how we did tweaks/tunings on the master and achieved more than 100x performance improvement in the master. Currently, operating a 10K-node kubernetes cluster is just as smooth as a 2k-node one.

comments powered by Disqus