December 17, 2019

199 words 1 min read

Introducing KFServing: Serverless Model Serving on Kubernetes

Introducing KFServing: Serverless Model Serving on Kubernetes

Production-grade serving of ML models is a challenging task for data scientists. In this talk, we'll discuss how KFServing powers some real-world examples of inference in production at Bloomberg, whic …


Talk Title	Introducing KFServing: Serverless Model Serving on Kubernetes
Speakers	Dan Sun (Senior Software Engineer, Bloomberg), Ellis Bigelow (Software Engineer, Google)
Conference	KubeCon + CloudNativeCon North America
Conf Tag
Location	San Diego, CA, USA
Date	Nov 15-21, 2019
URL	Talk Page
Slides	Talk Slides
Video

Production-grade serving of ML models is a challenging task for data scientists. In this talk, we’ll discuss how KFServing powers some real-world examples of inference in production at Bloomberg, which supports the business domains of NLP, computer vision, and time-series analysis. KFServing (https://github.com/kubeflow/kfserving) provides a Kubernetes CRD for serving ML models on arbitrary frameworks. It aims to solve 80% of model serving use cases by providing performant, high abstraction interfaces for common ML frameworks. It provides a consistent and richly featured abstraction that supports bleeding-edge serving features like CPU/GPU auto-scaling, scale to and from 0, and canary rollouts. KFServing’s charter includes a rich roadmap to fulfill a complete story for mission critical ML, including inference graphs, model explainability, outlier detection, and payload logging.

roadm framework serverless github ml use case computer vision kubernetes nlp oadm

comments powered by Disqus

Rook: Cloud-Native Storage Orchestration (Introduction and Deep Dive)

Rook: Cloud-Native Storage Orchestration (Introduction and Deep Dive)

November 23, 2019

Rook is an open source cloud-native storage orchestrator for Kubernetes, providing the platform, framework, and support for a diverse set of storage solutions to natively integrate with cloud-native e …

The future of cloud-native programming (sponsored by IBM)

The future of cloud-native programming (sponsored by IBM)

December 15, 2019

Today, we are witnessing a great proliferation of cloud-native paradigms such as 12-factor apps, microservices, and serverless. Tamar Eilam discusses an emerging unified cloud platform (based on open source projects such as Kubernetes and Istio) and explains why the new frontier is its evolution to unify multiple programming paradigms for greater simplification with power of expression.

Advanced Model Inferencing Leveraging KNative, Istio and Kubeflow Serving

Advanced Model Inferencing Leveraging KNative, Istio and Kubeflow Serving

December 10, 2019

Model Inferencing use cases are becoming a requirement for models moving into the next phase of production deployments. More and more users are now encountering use cases around canary deployments, sc …

Day 2 Operations with Windows Containers

Day 2 Operations with Windows Containers

December 10, 2019

The chairs for SIG-Windows will provide an update on the efforts to bring Windows to Kubernetes. This session will concentrate on presenting new features and capabilities as well as focus on day 2 ope …

KubeEdge Deep Dive

KubeEdge Deep Dive

December 10, 2019

KubeEdge is an open source project extending native containerized application orchestration and device management to from central cloud to Edge. It is built upon Kubernetes and provides core infrastru …

Binary Authorization in Kubernetes

Binary Authorization in Kubernetes

December 9, 2019

Kritis is an open-source solution for securing your software supply chain for Kubernetes applications. Kritis enforces deploy-time security policies that ensures only trusted container images are depl …