December 18, 2019

214 words 2 mins read

Integrating deep learning libraries with Apache Spark

Integrating deep learning libraries with Apache Spark

Joseph Bradley and Xiangrui Meng share best practices for integrating popular deep learning libraries with Apache Spark, covering cluster setup, data ingest, configuring clusters, and monitoring jobs. Joseph and Xiangrui then demonstrate these techniques using Googles TensorFlow library.


Talk Title	Integrating deep learning libraries with Apache Spark
Speakers	Joseph Bradley (Databricks), Xiangrui Meng (Databricks)
Conference	O’Reilly Artificial Intelligence Conference
Conf Tag	Put AI to Work
Location	New York, New York
Date	June 27-29, 2017
URL	Talk Page
Slides	Talk Slides
Video

The combination of deep learning with Apache Spark has the potential to make a huge impact. Joseph Bradley and Xiangrui Meng share best practices for integrating popular deep learning libraries with Apache Spark. Rather than comparing deep learning systems or specific optimizations, Joseph and Xiangrui focus on issues that are common to many deep learning frameworks when running on a Spark cluster, such as optimizing cluster setup and data ingest (clusters can be configured to avoid task conflicts on GPUs and to allow using multiple GPUs per worker), configuring the cluster (setting up pipelines for efficient data ingest improves job throughput), and monitoring long-running jobs (interactive monitoring facilitates both the work of configuration and checking the stability of deep learning jobs). Joseph and Xiangrui then demonstrate the techniques using Google’s popular TensorFlow library.

google apache framework gpu spark tensorflow deep learning optimization monitoring pipeline cluster

comments powered by Disqus

Distinguish pop music from heavy metal using Apache Spark MLlib

Distinguish pop music from heavy metal using Apache Spark MLlib

November 25, 2019

Taras Matyashovsky explains how to use Apache Spark MLlib to build a supervised learning NLP pipeline to distinguish pop music from heavy metaland have fun in the process.

Building Distributed TensorFlow Using Both GPU and CPU on Kubernetes [I]

Building Distributed TensorFlow Using Both GPU and CPU on Kubernetes [I]

November 24, 2019

Big Data and Machine Learning have become extremely hot topics in recent years. Google has announced its AI-centric strategy and released the deep learning toolkit TensorFlow. TensorFlow soon became t …

Lightning Talk: How Kubernetes is Helpful for Accelerating Machine Learning Research and Engineering [I]

Lightning Talk: How Kubernetes is Helpful for Accelerating Machine Learning Research and Engineering [I]

December 15, 2019

In this lightning talk, the presenter shares his experience on helping machine learning research and engineering with kubernetes. k8s is not only a tool for managing microservices but also helpful for …

Building GPU-Accelerated Workflows with TensorFlow and Kubernetes [I]

Building GPU-Accelerated Workflows with TensorFlow and Kubernetes [I]

December 8, 2019

GPUs are critical to some artificial intelligence workflows. In particular, workflows that utilize TensorFlow, or other deep learning frameworks, need GPUs to efficiently train models on image data. T …

Modern Big Data Pipelines over Kubernetes [I]

Modern Big Data Pipelines over Kubernetes [I]

December 3, 2019

Big data used to be synonymous with Hadoop, but our ecosystem has evolved over time with new database, streaming and machine learning solutions which dont necessarily benefit from the Hadoop deployme …

The state of Spark in the cloud

The state of Spark in the cloud

November 29, 2019

Nicolas Poggi evaluates the out-of-the-box support for Spark and compares the offerings, reliability, scalability, and price-performance from major PaaS providers, including Azure HDinsight, Amazon Web Services EMR, Google Dataproc, and Rackspace Cloud Big Data, with an on-premises commodity cluster as baseline.