Presto: Distributed SQL done faster
Wojciech Biela and ukasz Osipiuk offer an introduction to Presto, an open source distributed analytical SQL engine that enables users to run interactive queries over their datasets stored in various data sources, and explore its applications in various big data problems.
Talk Title | Presto: Distributed SQL done faster |
Speakers | Wojciech Biela (Starburst), Łukasz Osipiuk (Teradata) |
Conference | Strata Data Conference |
Conf Tag | Making Data Work |
Location | London, United Kingdom |
Date | May 23-25, 2017 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Interactive analysis of data stored in HDFS and other data sources has been gaining traction, and the field has been rapidly growing in the past few years. Wojciech Biela and Łukasz Osipiuk offer an introduction to Presto, an open source distributed analytical SQL engine that enables users to run interactive queries over their datasets stored in various data sources, including HDFS (Hive/Hadoop), Amazon S3, and various SQL and NoSQL data stores. Presto is developed under the Apache 2.0 license. It was started at Facebook as an initiative to enable interactive querying across a variety of data stores. The project has a large and growing community of users that include Airbnb, LinkedIn, Netflix, Twitter, and Uber. Wojciech and Łukasz explore Presto’s design fundamentals and core capabilities and cover recent functional additions to Presto as well as current and future development themes. Along the way, they also describe the major Presto installations (Facebook, Netflix, Uber) and their usage scenarios.