October 19, 2019

219 words 2 mins read

Protecting the Data Lake

Protecting the Data Lake

In todays world, data is at the core of every business decision made. As data grows, companies have started implementing their own Data Lakes to store and run analytics on the data. Ceph is widely us …

Talk Title Protecting the Data Lake
Speakers Ash Narkar (Senior Software Engineer, Styra Inc)
Conference KubeCon + CloudNativeCon Europe
Conf Tag
Location Barcelona, Spain
Date May 19-23, 2019
URL Talk Page
Slides Talk Slides
Video

In today’s world, data is at the core of every business decision made. As data grows, companies have started implementing their own Data Lakes to store and run analytics on the data. Ceph is widely used to implement a Data Lake. Securing the data is a priority for every organization and is influenced by the technologies they use, legal regulations, internal conventions, and so on. Enforcing policies to protect the data is difficult because it often affects the entire stack, requires state from multiple locations, and must evolve over time as business needs change. In this talk, we will see how the Open Policy Agent (OPA) can be integrated with Ceph to guard access to sensitive data while satisfying strict latency and availability requirements. In our demo we will deploy Ceph in Kubernetes using Rook and show how to enforce custom policies over the Ceph Storage Cluster.

comments powered by Disqus