What Happens When Something Goes Wrong? On Kubernetes Reliability [I]
One of the best features of the Kubernetes is that it can automatically recover from various failures and keep your application working despite unfavorable circumstances. There are moments when this w …
Talk Title | What Happens When Something Goes Wrong? On Kubernetes Reliability [I] |
Speakers | Marek Grabowski (Site Reliability Engineer, Google), Tina Zhang (Site Reliability Engineer, Google) |
Conference | KubeCon + CloudNativeCon North America |
Conf Tag | |
Location | Austin, TX, United States |
Date | Dec 4- 8, 2017 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
One of the best features of the Kubernetes is that it can automatically recover from various failures and keep your application working despite unfavorable circumstances. There are moments when this works like magic and operators won’t even notice something was going on. Sadly, sometimes automation fails. In this talk we’re going to describe various policies and mechanisms that are implemented in the system designed to keep user applications and cluster in general running. We’ll talk both about things that will happen automatically and those that users need to configure.