October 10, 2019

199 words 1 min read

How Should You Effectively Use etcd Metrics

How Should You Effectively Use etcd Metrics

All production systems need monitoring, to detect problems in advance and troubleshoot with the right information. etcd is no exception. How to effectively use ~100 etcd metrics and how to interpret t …

Talk Title How Should You Effectively Use etcd Metrics
Speakers Wenjia Zhang (Software Engineer, Google), Jingyi Hu (Software Engineer, Google)
Conference KubeCon + CloudNativeCon
Conf Tag
Location Shanghai, China
Date Jun 23-26, 2019
URL Talk Page
Slides Talk Slides
Video

All production systems need monitoring, to detect problems in advance and troubleshoot with the right information. etcd is no exception. How to effectively use ~100 etcd metrics and how to interpret the values under different usages? First of all, one must monitor if a leader exists, otherwise the system becomes unavailable. Furthermore, frequent leadership changes can impair the performance of consensus systems. Therefore, leader related metrics are critical. Some other etcd metrics also need special attention. Disk I/O and networking I/O related metrics hint physical constraints. Latency and throughput metrics are meaningful only when cross referencing with hardware configurations. We will walk you through etcd benchmarking tool, explain the important etcd metrics, and eventually help you understand how to apply etcd metrics with some case studies.

comments powered by Disqus