Advanced Model Inferencing Leveraging KNative, Istio and Kubeflow Serving


Talk Title	Advanced Model Inferencing Leveraging KNative, Istio and Kubeflow Serving
Speakers	Animesh Singh (STSM and Program Director, IBM), Clive Cox (CTO, Seldon)
Conference	KubeCon + CloudNativeCon North America
Conf Tag
Location	San Diego, CA, USA
Date	Nov 15-21, 2019
URL	Talk Page
Slides	Talk Slides
Video

Model Inferencing use cases are becoming a requirement for models moving into the next phase of production deployments. More and more users are now encountering use cases around canary deployments, scale-to-zero or serverless characteristics. And then there are also advanced use cases coming around model explainability, including A/B tests, ensemble models, multi-armed bandits, etc.In this talk, the speakers are going to detail how to handle these use cases using Kubeflow Serving and the native Kubernetes stack which is Istio and Knative. Knative and Istio help with autoscaling, scale-to-zero, canary deployments to be implemented, and scenarios where traffic is optimized to the best performing models. This can be combined with KNative eventing, Istio observability stack, KFServing Transformer to handle pre/post-processing and payload logging which consequentially can enable drift and outlier detection to be deployed. We will demonstrate where currently KFServing is, and where it’s heading towards.

Advanced Model Inferencing Leveraging KNative, Istio and Kubeflow Serving

Ready to Serve! Speeding-Up Startup Time of Istio-Powered Workloads

Serverless Platform for Large Scale Mini-Apps: From Knative to Production

KEDA: Event Driven and Serverless Containers in Kubernetes

Emitting, Consuming, and Presenting: The Event Lifecycle

Ingress V2 and Multicluster Services

FaaS is Not Only the Serverless: Stream Processing with Serverless