December 26, 2019

223 words 2 mins read

Cost-effective Presto on AWS with Spot nodes

Cost-effective Presto on AWS with Spot nodes

Did you know you can run Presto in AWS at a tenth of the cost with AWS Spot nodes, with just a few architectural enhancements to Presto. Shubham Tagra explores the gaps in Presto architecture, explains how to use Spot nodes, covers enhancements, and showcases the improvements in terms of reliability and TCO achieved through them.

Talk Title Cost-effective Presto on AWS with Spot nodes
Speakers Shubham Tagra (Qubole)
Conference Strata Data Conference
Conf Tag Big Data Expo
Location San Francisco, California
Date March 26-28, 2019
URL Talk Page
Slides Talk Slides
Video

Presto is gaining popularity for interactive and reporting use cases owing to its performance and ease of use. Presto is performant because it executes pipelined in-memory query plans, but its default architecture isn’t amenable to handling Spot node loss. However, many users still use Spot nodes for their cost benefits and tolerate query failures as a necessary evil. With Spot termination notification listeners and its query retry system, Presto improves the user experience. The changes also allow users who cannot tolerate failures to use Spot nodes and enjoy their cost benefits. Shubham Tagra explores the gaps in Presto architecture, explains how to use Spot nodes, covers enhancements like Spot termination notification listeners and the query retry system, and showcases the improvements in terms of reliability and TCO achieved through them.

comments powered by Disqus