Multiple window discrete scan statistic for higher-order Markovian sequences |
| |
Authors: | Deidra A Coleman Donald EK Martin Brian J Reich |
| |
Institution: | Department of Statistics, North Carolina State University, Raleigh, NC, USA |
| |
Abstract: | Accurate and efficient methods to detect unusual clusters of abnormal activity are needed in many fields such as medicine and business. Often the size of clusters is unknown; hence, multiple (variable) window scan statistics are used to identify clusters using a set of different potential cluster sizes. We give an efficient method to compute the exact distribution of multiple window discrete scan statistics for higher-order, multi-state Markovian sequences. We define a Markov chain to efficiently keep track of probabilities needed to compute p-values for the statistic. The state space of the Markov chain is set up by a criterion developed to identify strings that are associated with observing the specified values of the statistic. Using our algorithm, we identify cases where the available approximations do not perform well. We demonstrate our methods by detecting unusual clusters of made free throw shots by National Basketball Association players during the 2009–2010 regular season. |
| |
Keywords: | one-dimensional scan statistics p-values variable windows |
|
|