September 26, 2019

131 words 1 min read

Volcano: Running AI/DL workload on Kubernetes

Volcano: Running AI/DL workload on Kubernetes

Kubernetes started as a general purpose orchestration framework with a focus on serving jobs. But as it gains popularity, users want to run AI/DL workloads on Kubernetes, such as TensorFlow, PyTorch e …


Talk Title	Volcano: Running AI/DL workload on Kubernetes
Speakers	Da Ma (Software Architect, Huawei)
Conference	KubeCon + CloudNativeCon
Conf Tag
Location	Shanghai, China
Date	Jun 23-26, 2019
URL	Talk Page
Slides	Talk Slides
Video

Kubernetes started as a general purpose orchestration framework with a focus on serving jobs. But as it gains popularity, users want to run AI/DL workloads on Kubernetes, such as TensorFlow, PyTorch etc. When running these workloads on Kubernetes, several advanced capability are required, e.g. fair-share sharing, queue, job management (suspend/resume), data management. This talk will demonstrate how to use volcano to bring “batch” capability.

pytorch management framework tensorflow kubernetes orchestration

comments powered by Disqus

Deep Dive: Rook

Deep Dive: Rook

September 26, 2019

In this talk, we will be taking a deep-dive through both the architecture and some of the more recent developments of the Rook project. Rook is an open source cloud-native storage orchestrator for Kub …

Gatekeeper: Flexible, Shareable Policy for Kubernetes

Gatekeeper: Flexible, Shareable Policy for Kubernetes

September 26, 2019

How do you ensure your Kubernetes resources conform to your internal policies and procedures? Every organization defines rules governing where images can be deployed from and what labels all resources …

Hybrid Cloud and Multi-Cluster Service Connectivity

Hybrid Cloud and Multi-Cluster Service Connectivity

September 26, 2019

Hybrid Cloud is becoming a common deployment these days. When your kubernetes clusters are spread across a mix of on-prem/public clouds, and you want your cluster local services (i.e., non-publicly ac …

Keynote: Tencent: Kubernetes in the Billions

Keynote: Tencent: Kubernetes in the Billions

September 24, 2019

At Tencent, our business touches everything from gaming, social media, payments, to cloud computing. Wed like to share our story of how K8s is broadly used at Tencent, taking care of our infrastructu …

Manage Multi-tenant ML Workloads Using Istio

Manage Multi-tenant ML Workloads Using Istio

September 24, 2019

With rapid growth of machine learning workloads deployed on Kubernetes, it is becoming a popular demand to offer a multi-tenant pipeline to manage machine learning workloads that facilitates different …

Porter - An Open Source Load Balancer for Bare Metal Kubernetes

Porter - An Open Source Load Balancer for Bare Metal Kubernetes

September 22, 2019

As we know, the backend workload can be exposed externally using service of type "LoadBalancer" in Kubernetes cluster. Cloud vendors often provide cloud LB plugins for Kubernetes which requires the cl …