December 1, 2019

236 words 2 mins read

Presto: Distributed SQL done faster

Presto: Distributed SQL done faster

Wojciech Biela and ukasz Osipiuk offer an introduction to Presto, an open source distributed analytical SQL engine that enables users to run interactive queries over their datasets stored in various data sources, and explore its applications in various big data problems.

Talk Title Presto: Distributed SQL done faster
Speakers Wojciech Biela (Starburst), Łukasz Osipiuk (Teradata)
Conference Strata Data Conference
Conf Tag Making Data Work
Location London, United Kingdom
Date May 23-25, 2017
URL Talk Page
Slides Talk Slides
Video

Interactive analysis of data stored in HDFS and other data sources has been gaining traction, and the field has been rapidly growing in the past few years. Wojciech Biela and Łukasz Osipiuk offer an introduction to Presto, an open source distributed analytical SQL engine that enables users to run interactive queries over their datasets stored in various data sources, including HDFS (Hive/Hadoop), Amazon S3, and various SQL and NoSQL data stores. Presto is developed under the Apache 2.0 license. It was started at Facebook as an initiative to enable interactive querying across a variety of data stores. The project has a large and growing community of users that include Airbnb, LinkedIn, Netflix, Twitter, and Uber. Wojciech and Łukasz explore Presto’s design fundamentals and core capabilities and cover recent functional additions to Presto as well as current and future development themes. Along the way, they also describe the major Presto installations (Facebook, Netflix, Uber) and their usage scenarios.

comments powered by Disqus