December 19, 2019

211 words 1 min read

When SQL users run wild: Resource management features and techniques to tame Apache Impala

When SQL users run wild: Resource management features and techniques to tame Apache Impala

As the popularity and utilization of Apache Impala deployments increases, clusters often become victims of their own success when demand for resources exceeds the supply. Tim Armstrong dives into the latest resource management features in Impala to maintain high cluster availability and optimal performance and provides examples of how to configure them in your Impala deployment.

Talk Title When SQL users run wild: Resource management features and techniques to tame Apache Impala
Speakers Tim Armstrong (Cloudera)
Conference Strata Data Conference
Conf Tag Big Data Expo
Location San Francisco, California
Date March 26-28, 2019
URL Talk Page
Slides Talk Slides
Video

Apache Impala has offered fast SQL analytics over big data since its initial beta release in 2012. As the popularity and utilization of Impala deployments increases, clusters often become victims of their own success when demand for resources exceeds the supply. Tim Armstrong dives into the latest resource management features in Impala to maintain high cluster availability and optimal performance and provides examples of how to configure them in your Impala deployment. Tim also discusses ongoing work on Impala’s admission control to make workload management simpler, more flexible, and automatic, including how the setup of Impala admission control was streamlined and efforts to make out-of-memory errors a thing of the past.

comments powered by Disqus