Living with the Pathology of the Cloud: How AWS Runs Lots of Clusters
Disk speed screeches to a crawl, packets get dropped, connections time out: welcome to the cloud! Most of the time the cloud "just works", but when it doesnt, how does Kubernetes and etcd handle fail …
Talk Title | Living with the Pathology of the Cloud: How AWS Runs Lots of Clusters |
Speakers | Micah Hausler (Sr System Development Engineer, Amazon Web Services) |
Conference | KubeCon + CloudNativeCon North America |
Conf Tag | |
Location | San Diego, CA, USA |
Date | Nov 15-21, 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Disk speed screeches to a crawl, packets get dropped, connections time out: welcome to the cloud! Most of the time the cloud “just works”, but when it doesn’t, how does Kubernetes and etcd handle failure? In this talk Micah will discuss considerations for building and configuring cloud native systems for failure including how Amazon EKS’s architecture and design accounts for outages and dependency failures. Micah will also cover and lessons learned from managing lots and lots of Kubernetes and etcd for customers around the world.