Cost-effective Presto on AWS with Spot nodes
Did you know you can run Presto in AWS at a tenth of the cost with AWS Spot nodes, with just a few architectural enhancements to Presto. Shubham Tagra explores the gaps in Presto architecture, explains how to use Spot nodes, covers enhancements, and showcases the improvements in terms of reliability and TCO achieved through them.
Talk Title | Cost-effective Presto on AWS with Spot nodes |
Speakers | Shubham Tagra (Qubole) |
Conference | Strata Data Conference |
Conf Tag | Big Data Expo |
Location | San Francisco, California |
Date | March 26-28, 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Presto is gaining popularity for interactive and reporting use cases owing to its performance and ease of use. Presto is performant because it executes pipelined in-memory query plans, but its default architecture isn’t amenable to handling Spot node loss. However, many users still use Spot nodes for their cost benefits and tolerate query failures as a necessary evil. With Spot termination notification listeners and its query retry system, Presto improves the user experience. The changes also allow users who cannot tolerate failures to use Spot nodes and enjoy their cost benefits. Shubham Tagra explores the gaps in Presto architecture, explains how to use Spot nodes, covers enhancements like Spot termination notification listeners and the query retry system, and showcases the improvements in terms of reliability and TCO achieved through them.