Lyft's analytics pipeline: From Redshift to Apache Hive and Presto
Lyfts business has grown over 100x in the past four years. Shenghu Yang explains how Lyfts data pipeline has evolved over the years to serve its ever-growing analytics use cases, migrating from the world's largest AWS Redshift clusters to Apache Hive and Presto for solving scalability and concurrency hard limits.
Talk Title | Lyft's analytics pipeline: From Redshift to Apache Hive and Presto |
Speakers | Shenghu Yang (Lyft) |
Conference | Strata Data Conference |
Conf Tag | Big Data Expo |
Location | San Jose, California |
Date | March 6-8, 2018 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Lyft’s business has grown over 100x in the past four years. Shenghu Yang explains how Lyft’s data pipeline has evolved over the years to serve its ever-growing analytics use cases, migrating from the world’s largest AWS Redshift clusters to Apache Hive and Presto for solving scalability and concurrency hard limits. Topics include: