Open Source Tools for ML Experiments Management
The rise of new AI and ML requires new workflows and new tools: data versioning, ML pipeline versioning, experiments metrics visualization and others that have not been formalized and even named yet.T …
Talk Title | Open Source Tools for ML Experiments Management |
Speakers | Ruslan Kuprieiev (Software Engineer, Iterative AI), Dmitry Petrov (Co-Founder & CEO, DVC) |
Conference | Open Source Summit + ELC North America |
Conf Tag | |
Location | San Diego, CA, USA |
Date | Aug 19-23, 2019 |
URL | Talk Page |
Slides | Talk Slides |
Video | |
The rise of new AI and ML requires new workflows and new tools: data versioning, ML pipeline versioning, experiments metrics visualization and others that have not been formalized and even named yet.The traditional software engineering toolset does not fully cover ML team’s needs. We will discuss the current practices of organizing ML workflow using traditional open-source tools like Git and Git-LFS as well as their limitations. Thereby motivation for developing new ML specific experiments and data management systems will be explained.ML workflow differs from software engineering. Experimentation, trials-and-errors nature of ML projects and the need in more granular and efficient data artifacts management requires new sets of development tools. We will show ideas behind open source tool DVC or http://dvc.org which focuses on working with ML experiments, managing large datasets, and ML model.