Using the MapD kernel for the Jupyter Notebook
MapD Core is an open source analytical SQL engine that has been designed from the ground up to harness the parallelism inherent in GPUs. This enables queries on billions of rows of data in milliseconds. Randy Zwitch offers an overview of the MapD kernel extension for the Jupyter Notebook and explains how to use it in a typical machine learning workflow.
Talk Title | Using the MapD kernel for the Jupyter Notebook |
Speakers | Randy Zwitch (MapD) |
Conference | JupyterCon in New York 2018 |
Conf Tag | The Official Jupyter Conference |
Location | New York, New York |
Date | August 22-24, 2018 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
MapD Core is an open source analytical SQL engine that has been designed from the ground up to harness the parallelism inherent in GPUs. This enables queries on billions of rows of data in milliseconds. MapD Core also supports the GPU DataFrame (GDF) from GoAi (based on Apache Arrow) and is designed for passing data between processes while keeping it all in GPU memory. In order to provide data scientists with a seamless experience, MapD created a Jupyter Notebook kernel extension that can be installed from a MapD-managed Conda channel. Randy Zwitch offers an overview of the MapD kernel extension for the Jupyter Notebook and explains how to use it in a typical machine learning workflow. You’ll learn how to deploy a Jupyter notebook with the MapD kernel extension, see how the Jupyter Notebook MapD kernel connects to a MapD server backend, and discover how its magic function (%%sql) executes commands on the MapD Core SQL engine. These SQL queries return their results into the GPU memory data frame using the PyGDF library. The GPU resident DataFrame is then accessed by the machine learning modeling framework to test, train, and make predictions.