January 16, 2020

189 words 1 min read

Zipline: Airbnb's data management platform for machine learning

Zipline: Airbnb's data management platform for machine learning

Zipline is Airbnbs soon to be open-sourced data management platform specifically designed for ML use cases. It has taken the task of feature generation from months to days and offers features to support end-to-end data management for machine learning. Varant Zanoyan covers Zipline's architecture and dives into how it solves ML-specific problems.

Talk Title Zipline: Airbnb's data management platform for machine learning
Speakers Varant Zanoyan (Airbnb)
Conference Strata Data Conference
Conf Tag Make Data Work
Location New York, New York
Date September 11-13, 2018
URL Talk Page
Slides Talk Slides
Video

Zipline is Airbnb’s data management platform specifically designed for ML use cases. Previously, ML practitioners at Airbnb spent roughly 60% of their time collecting and writing transformations for machine learning tasks. Zipline reduces this task from months to about a day. It allows users to define features in a easy-to-use configuration language, then provides access to the following features: Varant Zanoyan covers Zipline’s architecture and dives into how it solves ML-specific problems. Despite being widespread, there are no open source solutions to these kinds of problems. As a result, Airbnb intends to open-source Zipline in the near future.

comments powered by Disqus