Amazon for information: Building a modern data catalog
A data catalog provides context to help data analysts, data scientists, and other data consumers (including those with little technical background) find a relevant dataset, determine if it can be trusted, understand what it means, and utilize it to make better products and better decisions. Aaron Kalb explores how enterprises build interfaces that make sourcing data as easy as shopping on Amazon.
Talk Title | Amazon for information: Building a modern data catalog |
Speakers | Aaron Kalb (Alation) |
Conference | Strata + Hadoop World |
Conf Tag | Big Data Expo |
Location | San Jose, California |
Date | March 29-31, 2016 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
A data catalog provides context to help data analysts, data scientists, and other data consumers (including those with little technical background) find a relevant dataset, determine if it can be trusted, understand what it means, and utilize it to make better products and better decisions. Aaron Kalb explores how enterprises build interfaces that make sourcing data as easy as shopping on Amazon. Aaron gives an overview of data catalogs and explains how they relate to concepts like data dictionaries or data inventories. He also covers some of the fastest and most effective ways to build a data catalog, discussing the roles crowds, experts, and machines play. Topics include: