When SQL users run wild: Resource management features and techniques to tame Apache Impala
As the popularity and utilization of Apache Impala deployments increases, clusters often become victims of their own success when demand for resources exceeds the supply. Tim Armstrong dives into the latest resource management features in Impala to maintain high cluster availability and optimal performance and provides examples of how to configure them in your Impala deployment.
Talk Title | When SQL users run wild: Resource management features and techniques to tame Apache Impala |
Speakers | Tim Armstrong (Cloudera) |
Conference | Strata Data Conference |
Conf Tag | Big Data Expo |
Location | San Francisco, California |
Date | March 26-28, 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Apache Impala has offered fast SQL analytics over big data since its initial beta release in 2012. As the popularity and utilization of Impala deployments increases, clusters often become victims of their own success when demand for resources exceeds the supply. Tim Armstrong dives into the latest resource management features in Impala to maintain high cluster availability and optimal performance and provides examples of how to configure them in your Impala deployment. Tim also discusses ongoing work on Impala’s admission control to make workload management simpler, more flexible, and automatic, including how the setup of Impala admission control was streamlined and efforts to make out-of-memory errors a thing of the past.