RDMA Enabled Kubernetes for High Performance Computing
Adoption of Kubernetes in scientific workloads has been hampered by limited support for high speed interconnects used in HPC clusters. Fortunately, we can now solve this problem by enabling RDMA in Ku …
Talk Title | RDMA Enabled Kubernetes for High Performance Computing |
Speakers | Feng Pan (Software Engineering Manager, Red Hat), Jacob Anders (HPC Technical Lead, CSIRO) |
Conference | KubeCon + CloudNativeCon North America |
Conf Tag | |
Location | San Diego, CA, USA |
Date | Nov 15-21, 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Adoption of Kubernetes in scientific workloads has been hampered by limited support for high speed interconnects used in HPC clusters. Fortunately, we can now solve this problem by enabling RDMA in Kubernetes.In this session, we will describe the HPC use case and requirements from an end user’s perspective. We will discuss how RDMA, a highly efficient network transport protocol, can be used to address this challenge. We will then provide an overview of a community driven RDMA implementation for Kubernetes using CNI plugins and SR-IOV.Finally, we will demonstrate real-world applications running in RDMA-enabled Kubernetes environment and provide a performance comparison between standard and RDMA-enabled networking.You will leave this session understanding the state of the art for HPC networking on Kubernetes.