Zipline: Airbnb's data management platform for machine learning
Zipline is Airbnbs soon to be open-sourced data management platform specifically designed for ML use cases. It has taken the task of feature generation from months to days and offers features to support end-to-end data management for machine learning. Varant Zanoyan covers Zipline's architecture and dives into how it solves ML-specific problems.
Talk Title | Zipline: Airbnb's data management platform for machine learning |
Speakers | Varant Zanoyan (Airbnb) |
Conference | Strata Data Conference |
Conf Tag | Make Data Work |
Location | New York, New York |
Date | September 11-13, 2018 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
Zipline is Airbnb’s data management platform specifically designed for ML use cases. Previously, ML practitioners at Airbnb spent roughly 60% of their time collecting and writing transformations for machine learning tasks. Zipline reduces this task from months to about a day. It allows users to define features in a easy-to-use configuration language, then provides access to the following features: Varant Zanoyan covers Zipline’s architecture and dives into how it solves ML-specific problems. Despite being widespread, there are no open source solutions to these kinds of problems. As a result, Airbnb intends to open-source Zipline in the near future.