October 27, 2019

226 words 2 mins read

Building a scalable data science platform with R

Building a scalable data science platform with R

Hadoop is famously scalable, as is cloud computing. R, the thriving and extensible open source data science software. . .not so much. Mario Inchiosa and Roni Burd outline how to seamlessly combine Hadoop, cloud computing, and R to create a scalable data science platform that lets you explore, transform, model, and score data at any scale from the comfort of your favorite R environment.

Talk Title Building a scalable data science platform with R
Speakers Mario Inchiosa (Microsoft), Roni Burd (Microsoft)
Conference Strata + Hadoop World
Conf Tag Big Data Expo
Location San Jose, California
Date March 29-31, 2016
URL Talk Page
Slides Talk Slides
Video

Hadoop is famously scalable, as is cloud computing. R, the thriving and extensible open source data science software. . .not so much. But what if we seamlessly combined Hadoop, cloud computing, and R to create a scalable data science platform? Imagine exploring, transforming, modeling, and scoring data at any scale from the comfort of your favorite R environment. Now, imagine calling a simple R function to operationalize your predictive model as a scalable, cloud-based web services API. Mario Inchiosa and Roni Burd demonstrate how to use the magic of the cloud to run your R code, thousands of open source R extension packages, and distributed implementations of the most popular machine-learning algorithms at scale. This session is sponsored by Microsoft.

comments powered by Disqus