-
A Statistical Model of Serve Return Impact Patterns in Professional Tennis
Authors:
Stephanie A. Kovalchik,
Jim Albert
Abstract:
The spread in the use of tracking systems in sport has made fine-grained spatiotemporal analysis a primary focus of an emerging sports analytics industry. Recently publicized tracking data for men's professional tennis allows for the first detailed spatial analysis of return impact. Mixture models are an appealing model-based framework for spatial analysis in sport, where latent variable discovery…
▽ More
The spread in the use of tracking systems in sport has made fine-grained spatiotemporal analysis a primary focus of an emerging sports analytics industry. Recently publicized tracking data for men's professional tennis allows for the first detailed spatial analysis of return impact. Mixture models are an appealing model-based framework for spatial analysis in sport, where latent variable discovery is often of primary interest. Although finite mixture models have the advantages of interpretability and scalability, most implementations assume standard parametric distributions for outcomes conditioned on latent variables. In this paper, we present a more flexible alternative that allows the latent conditional distribution to be a mixed member of finite Gaussian mixtures. Our model was motivated by our efforts to describe common styles of return impact location of professional tennis players and is the reason we name the approach a 'latent style allocation' model. In a fully Bayesian implementation, we apply the model to 142,803 return points played by 141 top players at Association of Tennis Professional events between 2018 and 2020 and show that the latent style allocation improves predictive performance over a finite Gaussian mixture model and identifies six unique impact styles on the first and second serve return.
△ Less
Submitted 1 February, 2022;
originally announced February 2022.
-
A Markov process approach to untangling intention versus execution in tennis
Authors:
Timothy C. Y. Chan,
Douglas S. Fearing,
Craig Fernandes,
Stephanie Kovalchik
Abstract:
Value functions are used in sports applications to determine the optimal action players should employ. However, most literature implicitly assumes that the player can perform the prescribed action with known and fixed probability of success. The effect of varying this probability or, equivalently, "execution error" in implementing an action (e.g., hitting a tennis ball to a specific location on th…
▽ More
Value functions are used in sports applications to determine the optimal action players should employ. However, most literature implicitly assumes that the player can perform the prescribed action with known and fixed probability of success. The effect of varying this probability or, equivalently, "execution error" in implementing an action (e.g., hitting a tennis ball to a specific location on the court) on the design of optimal strategies, has received limited attention. In this paper, we develop a novel modeling framework based on Markov reward processes and Markov decision processes to investigate how execution error impacts a player's value function and strategy in tennis. We power our models with hundreds of millions of simulated tennis shots with 3D ball and 2D player tracking data. We find that optimal shot selection strategies in tennis become more conservative as execution error grows, and that having perfect execution with the empirical shot selection strategy is roughly equivalent to choosing one or two optimal shots with average execution error. We find that execution error on backhand shots is more costly than on forehand shots, and that optimal shot selection on a serve return is more valuable than on any other shot, over all values of execution error.
△ Less
Submitted 4 October, 2021;
originally announced October 2021.
-
Space-Time VON CRAMM: Evaluating Decision-Making in Tennis with Variational generatiON of Complete Resolution Arcs via Mixture Modeling
Authors:
Stephanie Kovalchik,
Martin Ingram,
Kokum Weeratunga,
Cagatay Goncu
Abstract:
Sports tracking data are the high-resolution spatiotemporal observations of a competitive event. The growing collection of these data in professional sport allows us to address a fundamental problem of modern sport: how to attribute value to individual actions? Taking advantage of the smoothness of ball and player movement in tennis, we present a functional data framework for estimating expected s…
▽ More
Sports tracking data are the high-resolution spatiotemporal observations of a competitive event. The growing collection of these data in professional sport allows us to address a fundamental problem of modern sport: how to attribute value to individual actions? Taking advantage of the smoothness of ball and player movement in tennis, we present a functional data framework for estimating expected shot value (ESV) in continuous time. Our approach is a three-step recipe: 1) a generative model for a full-resolution functional representation of ball and player trajectories using an infinite Bayesian Gaussian mixture model (GMM), 2) conditioning of the GMM on observed positional data, and 3) the prediction of shot outcomes given the functional encoding of a shot event. From the ESV we derive three metrics of central interest: value added with shot taking (VAST), Shot IQ, and value added with court coverage (VACC), which respectively attribute value to shot execution, shot selection and movement around the court. We rate player performance at the 2019 US Open on these advanced metrics and show how each adds a novel perspective to performance evaluation in tennis that goes beyond simple counts of outcomes by quantitatively assessing the decisions players make throughout a point.
△ Less
Submitted 22 May, 2020;
originally announced May 2020.