January 24, 2020

266 words 2 mins read

The OS for AI: How serverless computing enables the next gen of machine learning

The OS for AI: How serverless computing enables the next gen of machine learning

ML has been advancing rapidly, but only a few contributors focus on the infrastructure and scaling challenges that come with it. Jonathan Peck explores why ML is a natural fit for serverless computing, a general architecture for scalable ML, and common issues when implementing on-demand scaling over GPU clusters, providing general solutions and a vision for the future of cloud-based ML.


Talk Title	The OS for AI: How serverless computing enables the next gen of machine learning
Speakers	Jonathan Peck (GitHub)
Conference	O’Reilly Open Source Software Conference
Conf Tag	Fueling innovative software
Location	Portland, Oregon
Date	July 15-18, 2019
URL	Talk Page
Slides	Talk Slides
Video

Machine learning has been advancing rapidly, but only a few contributors are focusing on the infrastructure and scaling challenges that come with it. When you have thousands of model versions, each written in any mix of frameworks (Python, R, Java, and Ruby, PyTorch, SciKit, Caffe, and TensorFlow, etc.), it’s difficult to know how to efficiently deploy them as elastic, scalable, secure APIs with 10 ms of latency and GPU access. Algorithmia has seen many of the challenges faced in this area. Jonathan Peck explores how the company built, deployed, and scaled thousands of algorithms and machine learning models using every kind of framework. You’ll learn some insights into the problems you’re likely to face and how to approach solving them. Jonathan examines the need for, and implementations of, a complete operating system for AI: a common interface for different algorithms to be used and combined, and a general architecture for serverless machine learning which is discoverable, versioned, scalable, and sharable.

api pytorch framework serverless gpu algorithm tensorflow infrastructure next gen machine learning python scalable

comments powered by Disqus

Deep learning for recommender systems

Deep learning for recommender systems

January 12, 2020

The success of deep learning has reached the realm of structured data in the past few years, where neural networks have been shown to improve the effectiveness and predictability of recommendation engines. Oliver Gindele offers a brief overview of such deep recommender systems and explains how they can be implemented in TensorFlow.

Unleashing Apache Kafka and TensorFlow in hybrid architectures

Unleashing Apache Kafka and TensorFlow in hybrid architectures

January 4, 2020

How do you leverage the flexibility and extreme scale of the public cloud and the Apache Kafka ecosystem to build scalable, mission-critical machine learning infrastructures that span multiple public cloudsor bridge your on-premises data center to the cloud? Join Kai Whner to learn how to use technologies such as TensorFlow with Kafkas open source ecosystem for machine learning infrastructures.

Unlocking your serverless functions with OpenFaaS for AI chatbot projects

Unlocking your serverless functions with OpenFaaS for AI chatbot projects

January 23, 2020

Sergio Mendez examines critical challenges when implementing AI chatbots and explains how Movistar designed an open source serverless architecture using OpenFaaS on top of Kubernetes and other complementary technologies like NoSQL, brokers to deploy Telegram AI chatbots. Sergio then compares these technologies to "vendor lock-in" services offered by major cloud providers.

Deep learning with TensorFlow and Spark using GPUs and Docker containers

Deep learning with TensorFlow and Spark using GPUs and Docker containers

January 12, 2020

Organizations need to keep ahead of their competition by using the latest AI, ML, and DL technologies such as Spark, TensorFlow, and H2O. The challenge is in how to deploy these tools and keep them running in a consistent manner while maximizing the use of scarce hardware resources, such as GPUs. Thomas Phelan discusses the effective deployment of such applications in a container environment.

Deploying deep learning models on GPU-enabled Kubernetes clusters

Deploying deep learning models on GPU-enabled Kubernetes clusters

January 1, 2020

Interested in deep learning models and how to deploy them on Kubernetes at production scale? Not sure if you need to use GPUs or CPUs? Mathew Salvaris and Fidan Boylu Uz help you out by providing a step-by-step guide to creating a pretrained deep learning model, packaging it in a Docker container, and deploying as a web service on a Kubernetes cluster.

Analytics Zoo: Distributed TensorFlow in production on Apache Spark

Analytics Zoo: Distributed TensorFlow in production on Apache Spark

December 27, 2019

Yuhao Yang and Jennie Wang demonstrate how to run distributed TensorFlow on Apache Spark with the open source software package Analytics Zoo. Compared to other solutions, Analytics Zoo is built for production environments and encourages more industry users to run deep learning applications with the big data ecosystems.