November 23, 2019

142 words 1 min read

Lyft's analytics pipeline: From Redshift to Apache Hive and Presto

Lyft's analytics pipeline: From Redshift to Apache Hive and Presto

Lyfts business has grown over 100x in the past four years. Shenghu Yang explains how Lyfts data pipeline has evolved over the years to serve its ever-growing analytics use cases, migrating from the world's largest AWS Redshift clusters to Apache Hive and Presto for solving scalability and concurrency hard limits.

Talk Title Lyft's analytics pipeline: From Redshift to Apache Hive and Presto
Speakers Shenghu Yang (Lyft)
Conference Strata Data Conference
Conf Tag Big Data Expo
Location San Jose, California
Date March 6-8, 2018
URL Talk Page
Slides Talk Slides
Video

Lyft’s business has grown over 100x in the past four years. Shenghu Yang explains how Lyft’s data pipeline has evolved over the years to serve its ever-growing analytics use cases, migrating from the world’s largest AWS Redshift clusters to Apache Hive and Presto for solving scalability and concurrency hard limits. Topics include:

comments powered by Disqus