January 4, 2020

295 words 2 mins read

Using the MapD kernel for the Jupyter Notebook

Using the MapD kernel for the Jupyter Notebook

MapD Core is an open source analytical SQL engine that has been designed from the ground up to harness the parallelism inherent in GPUs. This enables queries on billions of rows of data in milliseconds. Randy Zwitch offers an overview of the MapD kernel extension for the Jupyter Notebook and explains how to use it in a typical machine learning workflow.

Talk Title Using the MapD kernel for the Jupyter Notebook
Speakers Randy Zwitch (MapD)
Conference JupyterCon in New York 2018
Conf Tag The Official Jupyter Conference
Location New York, New York
Date August 22-24, 2018
URL Talk Page
Slides Talk Slides
Video

MapD Core is an open source analytical SQL engine that has been designed from the ground up to harness the parallelism inherent in GPUs. This enables queries on billions of rows of data in milliseconds. MapD Core also supports the GPU DataFrame (GDF) from GoAi (based on Apache Arrow) and is designed for passing data between processes while keeping it all in GPU memory. In order to provide data scientists with a seamless experience, MapD created a Jupyter Notebook kernel extension that can be installed from a MapD-managed Conda channel. Randy Zwitch offers an overview of the MapD kernel extension for the Jupyter Notebook and explains how to use it in a typical machine learning workflow. You’ll learn how to deploy a Jupyter notebook with the MapD kernel extension, see how the Jupyter Notebook MapD kernel connects to a MapD server backend, and discover how its magic function (%%sql) executes commands on the MapD Core SQL engine. These SQL queries return their results into the GPU memory data frame using the PyGDF library. The GPU resident DataFrame is then accessed by the machine learning modeling framework to test, train, and make predictions.

comments powered by Disqus