January 8, 2020

322 words 2 mins read

Mutant tests too: The SQL

Mutant tests too: The SQL

Elliot West and Jay Green share approaches for applying software engineering best practices to SQL-based data applications to improve maintainability and data quality. Using open source tools, Elliot and Jay show how to build effective test suites for Apache Hive code bases and offer an overview of Mutant Swarm, a tool to identify weaknesses in tests and to measure SQL code coverage.

Talk Title Mutant tests too: The SQL
Speakers Elliot West (Hotels.com), Jaydene Green (Hotels.com)
Conference Strata Data Conference
Conf Tag Making Data Work
Location London, United Kingdom
Date April 30-May 2, 2019
URL Talk Page
Slides Talk Slides
Video

The big data platforms of many organizations are underpinned by a technology that will soon celebrate its 45th birthday: SQL. This industry stalwart is applied in a multitude of critical points in business data flows; the results that these processes generate may significantly influence business and financial decision making. However, the SQL ecosystem has been overlooked and ignored by more recent innovations in the field of software engineering best practices, such as fine-grained automated testing and code quality metrics. This exposes organizations to poor application maintainability, high bug rates, and ultimately corporate risk. Elliot West and Jay Green share approaches for applying software engineering best practices to SQL-based data applications to improve maintainability and data quality. They first define the relevance of such approaches and demonstrate how automated testing can be applied to Hive SQL using HiveRunner, a JUnit based testing framework. They then consider how best to structure Hive queries to yield meaningful test scenarios that are maintainable and performant. Elliot and Jay conclude by demonstrating how test coverage reports can highlight areas of risk in SQL code bases and weaknesses in the testing process and offer an overview of Mutant Swarm, an open source mutation testing tool for SQL languages developed by Hotels.com that can deliver insights similar to those produced by Java-focused tools such as Jacoco and PIT.

comments powered by Disqus