Collecting Operational Metrics for a Cluster with 5,000 Namespaces
Kubernetes is popular to provide a multi-tenant, shared infrastructure layer for many eng teams within an org. This is great for the teams, as they have a stable, scalable cluster to build upon. Howev …
Talk Title | Collecting Operational Metrics for a Cluster with 5,000 Namespaces |
Speakers | Rob Szumski (Principal Product Manager for OpenShift, Red Hat), Chance Zibolski (Software Engineer, Red Hat) |
Conference | KubeCon + CloudNativeCon North America |
Conf Tag | |
Location | Seattle, WA, USA |
Date | Dec 9-14, 2018 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Kubernetes is popular to provide a multi-tenant, shared infrastructure layer for many eng teams within an org. This is great for the teams, as they have a stable, scalable cluster to build upon. However, the Ops team needs insight into resource usage, availability & more, especially once an app is running in multiple regions/clouds. This talk will introduce the open source Operator Metering project, which collects metrics from Prometheus about Pods/Namespaces & combines it against cloud resource usage and other metrics to compute cost per Pod and other reports on a periodic interval. This capability unlocks billing for internal biz units, capacity planning, and operational knowledge of Operators running within a cluster. At the end, we’ll show the project running against a real cluster with over 5,000 namespaces, proving the scale and insight the reports can provide to an Ops team.