Developer on the rise: Blurring the line between developer and data scientist with PixieDust
Ready to dip your toe into data science? Va Barbosa explains why you should start with notebooks and PixieDust, a new open source library that helps data scientists and developers working in the Jupyter Notebook and Apache Spark be more efficient.
Talk Title | Developer on the rise: Blurring the line between developer and data scientist with PixieDust |
Speakers | va barbosa (IBM) |
Conference | O’Reilly Open Source Convention |
Conf Tag | Making Open Work |
Location | Austin, Texas |
Date | May 8-11, 2017 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Ready to dip your toe into data science? Va Barbosa explains why you should start with notebooks and PixieDust, a new open source library that helps data scientists and developers working in the Jupyter Notebook and Apache Spark be more efficient. PixieDust speeds data manipulation and display with features like auto-visualization of Spark DataFrames, real-time Spark job progress monitoring directly from the notebook, seamless integration to cloud services, and automated local install of Python and Scala kernels running with Spark. And if you prefer working with a Scala notebook, no problem. PixieDust can also run on a Scala Kernel. Imagine being able to visualize your favorite Python chart engines from a Scala notebook. Join Va to learn how to use PixieDust in your own projects to visualize and explore data effortlessly with no coding. Va also shares a demo combining Twitter, Watson Tone Analyzer, Spark Streaming, and some fun real-time visualizations—all running within a notebook. This session is sponsored by IBM.