December 10, 2019

226 words 2 mins read

Collecting Operational Metrics for a Cluster with 5,000 Namespaces

Collecting Operational Metrics for a Cluster with 5,000 Namespaces

Kubernetes is popular to provide a multi-tenant, shared infrastructure layer for many eng teams within an org. This is great for the teams, as they have a stable, scalable cluster to build upon. Howev …

Talk Title Collecting Operational Metrics for a Cluster with 5,000 Namespaces
Speakers Rob Szumski (Principal Product Manager for OpenShift, Red Hat), Chance Zibolski (Software Engineer, Red Hat)
Conference KubeCon + CloudNativeCon North America
Conf Tag
Location Seattle, WA, USA
Date Dec 9-14, 2018
URL Talk Page
Slides Talk Slides
Video

Kubernetes is popular to provide a multi-tenant, shared infrastructure layer for many eng teams within an org. This is great for the teams, as they have a stable, scalable cluster to build upon. However, the Ops team needs insight into resource usage, availability & more, especially once an app is running in multiple regions/clouds. This talk will introduce the open source Operator Metering project, which collects metrics from Prometheus about Pods/Namespaces & combines it against cloud resource usage and other metrics to compute cost per Pod and other reports on a periodic interval. This capability unlocks billing for internal biz units, capacity planning, and operational knowledge of Operators running within a cluster. At the end, we’ll show the project running against a real cluster with over 5,000 namespaces, proving the scale and insight the reports can provide to an Ops team.

comments powered by Disqus