Towards Hyperscale High Performance Computing with RDMA
RDMA is increasingly deployed in datacenter environments. Deployment introduces an additional level of fabric complexity in management and diagnosis. This talk wil …
Talk Title | Towards Hyperscale High Performance Computing with RDMA |
Speakers | Omar Cardona, Microsoft |
Conference | NANOG76 |
Conf Tag | |
Location | Washington DC |
Date | Jun 10 2019 - Jun 12 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | Talk Video |
RDMA is increasingly deployed in datacenter environments. Deployment introduces an additional level of fabric complexity in management and diagnosis. This talk will introduce operators to RDMA concepts from a Fabric management point of view, inclusive of caveats, quirks, misconceptions which have skewed deployment decisions. With multiple variants of RDMA at play (IB, iWARP, RoCE) and differing qualities/properties of each, from workload level capabilities to E2E fabric resiliency; a thorough understanding of each is needed for successful deployment.