December 8, 2019

369 words 2 mins read

Improving DevOps and QA efficiency using machine learning and NLP methods

Improving DevOps and QA efficiency using machine learning and NLP methods

DevOps and QA engineers spend a significant amount of time investigating reoccurring issues. These issues are often represented by large configuration and log files, so the process of investigating whether two issues are duplicates can be a very tedious task. Ran Taig and Omer Sagi outline a solution that leverages NLP and machine learning algorithms to automatically identify duplicate issues.

Talk Title Improving DevOps and QA efficiency using machine learning and NLP methods
Speakers Ran Taig (Dell), Omer Sagi (Dell)
Conference Strata Data Conference
Conf Tag Making Data Work
Location London, United Kingdom
Date May 22-24, 2018
URL Talk Page
Slides Talk Slides
Video

DevOps and QA issues usually consist of large files with numerous log events and configuration data. Resolving such issues is sometimes like finding a needle in a haystack. It requires the execution of highly technical procedures by experienced DevOps and QA engineers. When the issue is a duplication of an existing issue (for example, an additional failure of a certain test), this tedious investigation process may be avoided. However, identifying such duplications is a complicated task that depends on the investigator’s familiarity with past issues, knowledge sharing within the team, and thorough investigation of candidate issues. Software tracking systems (like JIRA or Bugzilla) typically enable textual querying for locating items of interest (log content, system documentations, configuration properties, labels, etc.). These search tools assume that data is of high quality and that user descriptions are semantically accurate. In reality, these conditions are not met and the investigation becomes a frustrating and time consuming task. Ran Taig and Omer Sagi outline a solution that leverages NLP and machine learning algorithms to automatically identify duplicate issues. The solution creates a “fingerprint” vector representation for each issue and stores each ‘fingerprint’ in a designated knowledge base. When a new issue arrives, its configuration and log files are processed through a pipeline that converts them into a new fingerprint. Recommendations for potential duplicates can be drawn using the designated knowledge base and a machine learning algorithm that aims to find similar fingerprints. Ran and Omer describe the successful implementation of this solution in Dell’s production systems, which has led to a significant reduction in the resolution time for issues.

comments powered by Disqus