Why data scientists should love Linux containers

Containers are a hot technology for application developers, but they also provide key benefits for data scientists. William Benton details the advantages of containers for data scientists and AI developers, focusing on high-level tools that will enable you to become more productive and collaborate more effectively.


Talk Title	Why data scientists should love Linux containers
Speakers	William Benton (Red Hat)
Conference	Strata Data Conference
Conf Tag	Make Data Work
Location	New York, New York
Date	September 11-13, 2018
URL	Talk Page
Slides	Talk Slides
Video

Linux containers make it easy for teams to deploy, manage, and scale distributed applications and for operators to exploit compute capacity in the cloud. Although it might not be obvious, a great foundation for production applications can also support the exploratory work of data scientists and machine learning engineers. William Benton details the advantages of containers for data scientists and AI developers, focusing on high-level tools that will enable you to become more productive and collaborate more effectively. To provide context, William briefly explains what containers are and why developers love them. He then covers several key benefits of containers for data scientists, focusing on repeatability, collaboration, scalability, and compliance. You’ll learn how containers fulfill the promise of reproducible research, ease moving techniques from prototype to production, enable painless publishing and collaboration workflows, and empower you to safely develop techniques against sensitive data in a production environment from the comfort of your laptop. There are myriad tutorial resources explaining how to build and run container images, but these largely assume an audience whose primary responsibilities include packaging, releasing, and managing applications. William focuses on why data scientists should care about containers and the high-level tools built on top of containers that will enhance their daily work. Data scientists will leave with a better understanding of the advantages of containers and concrete suggestions for how to use higher-level tools to make their work more productive. Application and AI developers will learn about the commonalities between engineering workflows and data science workflows and leave with a better understanding of how containers can support their data scientist colleagues and enable cross-functional collaboration.

Why data scientists should love Linux containers

Tutorial: Kubeflow End-to-End: GitHub Issue Summarization

Executive Briefings: Killer robots and how not to do data science

Accelerate application delivery with a cloud-native mindset (sponsored by IBM)

Beyond hype: AI in the real world

Container4NFV: How Refactoring VM-based VNF to Container-Based Microservice VNF

Customized ML for the enterprise (sponsored by Google Cloud)