November 18, 2019

274 words 2 mins read

Working with the data of sports

Working with the data of sports

Sports analytics today is more than a matter of analyzing box scores and play-by-play statistics. Faced with detailed on-field or on-court data from every game, sports teams face challenges in data management, data engineering, and analytics. Thomas Miller details the challenges faced by a Major League Baseball team as it sought competitive advantage through data science and deep learning.

Talk Title Working with the data of sports
Speakers Thomas Miller (Northwestern University)
Conference Strata Data Conference
Conf Tag Big Data Expo
Location San Jose, California
Date March 6-8, 2018
URL Talk Page
Slides Talk Slides
Video

There is a rich history of baseball fans and sports analysts using play-by-play information to compute traditional performance measures, such as batting average, on-base percentage, and slugging percentage for batters and earned-run average for pitchers. And there is no shortage of data: baseball records show team matchups, batter-pitcher matchups, outs and runners-on-base situations, and player on-field positions, and event codes represent the outcome of each play, along with runs scored. However, sports analytics today is more than a matter of analyzing box scores and play-by-play statistics. Faced with detailed on-field or on-court data from every game, sports teams face challenges in data management, data engineering, and analytics. Thomas Miller details the challenges faced by a Major League Baseball team as it sought competitive advantage through data science and deep learning. Thomas demonstrates how neural network models (methods from deep learning and natural language processing) can generate vector representations of teams and players, providing more complete measures of on-field performance. These vector representations can then be used to evaluate teams and players, predict runs scored, and guide in-game strategy.

comments powered by Disqus