首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Statistical modelling of sports data has become more and more popular in the recent years and different types of models have been proposed to achieve a variety of objectives: from identifying the key characteristics which lead a team to win or lose to predicting the outcome of a game or the team rankings in national leagues. Although not as popular as football or basketball, volleyball is a team sport with both national and international level competitions in almost every country. However, there is almost no study investigating the prediction of volleyball game outcomes and team rankings in national leagues. We propose a Bayesian hierarchical model for the prediction of the rankings of volleyball national teams, which also allows to estimate the results of each match in the league. We consider two alternative model specifications of different complexity which are validated using data from the women''s volleyball Italian Serie A1 2017–2018 season.  相似文献   

2.
The effectiveness of coach turnover on team performance is widely discussed in the literature due to the indirect impact of a team's performance on a club's revenues. This study examines the effect of coach turnover within a competition season by focusing on the change in team quality and the change in home team advantage under the new coach. The change in team quality or home team advantage can vary according to the team (team specific) or might be an independent quantity (non-team specific). We estimated nine possible regression models, given no change, team-specific change and non-team-specific change in quality or home team advantage. The data are the match results of Belgian male soccer teams playing in the highest national division during seven seasons. Results point to a team-specific effect of a new coach on a team's quality. This article further contributes by evaluating the new coach's success with regard to whether his ability to improve team quality also results in a better position of the team in the final ranking. A new coach will be able to improve the ranking of the team if the improved team quality under the new coach renders a positive team quality.  相似文献   

3.
The scoring and defensive abilities of Australian Rules Football teams change over time as a result of evolving player rosters, tactics and other management factors. We develop a dynamic model based on the Poisson difference (Skellam) distribution which simultaneously models the two different point scoring mechanisms in Australian Rules Football, the motivation for which comes from work on predicting outcomes in soccer matches. Our model is developed in a Bayesian framework and is fitted using the Stan modelling language. Model validation is performed on the 2015 Australian Football league (AFL) home and away season.  相似文献   

4.
This article attempts to predict home run hitting performance of Major League Baseball players using a Bayesian semiparametric model. Following Berry, Reese and Larkey we include in the model effects for era of birth, season of play, and home ball park. We estimate performance curves for each player using orthonormal quartic polynomials. We use a Dirichlet process prior on the unknown distribution for the coefficients of the polynomials, and parametric priors for the other effects. Dirichlet process priors are useful in prediction for two reasons: (1) an increased probability of obtaining more precise prediction comes with the increased flexibility of the prior specification, and (2) the clustering inherent in the Dirichlet process provides the means to share information across players. Data from 1871 to 2008 were used to fit the model. Data from 2009 to 2016 were used to test the predictive ability of the model. A parametric model was also fit to compare the predictive performance of the models. We used what we called “pure performance” curves to predict future performance for 22 players. The nonparametric method provided superior predictive performance.  相似文献   

5.
Fantasy sports, particularly the daily variety in which new lineups are selected each day, are a rapidly growing industry. The two largest companies in the daily fantasy business, DraftKings and Fanduel, have been valued as high as $2 billion. This research focuses on the development of a complete system for daily fantasy basketball, including both the prediction of player performance and the construction of a team. First, a Bayesian random effects model is used to predict an aggregate measure of daily NBA player performance. The predictions are then used to construct teams under the constraints of the game, typically related to a fictional salary cap and player positions. Permutation based and K-nearest neighbors approaches are compared in terms of the identification of “successful” teams—those who would be competitive more often than not based on historical data. We demonstrate the efficacy of our system by comparing our predictions to those from a well-known analytics website, and by simulating daily competitions over the course of the 2015–2016 season. Our results show an expected profit of approximately $9,000 on an initial $500 investment using the K-nearest neighbors approach, a 36% increase relative to using the permutation-based approach alone. Supplementary materials for this article are available online.  相似文献   

6.
Summary.  When an individual player or team enjoys periods of good form, and when these occur, is a widely observed phenomenon typically called 'streakiness'. It is interesting to assess which team is a streaky team, or who is a streaky player in sports. Such competitors might have a large number of successes during some periods and few or no successes during other periods. Thus, their success rate is not constant over time. We provide a Bayesian binary segmentation procedure for locating changepoints and the associated success rates simultaneously for these competitors. The procedure is based on a series of nested hypothesis tests each using the Bayes factor or the Bayesian information criterion. At each stage, we only need to compare a model with one changepoint with a model based on a constant success rate. Thus, the method circumvents the computational complexity that we would normally face in problems with an unknown number of changepoints. We apply the procedure to data corresponding to sports teams and players from basketball, golf and baseball.  相似文献   

7.
The problem of modelling football data has become increasingly popular in the last few years and many different models have been proposed with the aim of estimating the characteristics that bring a team to lose or win a game, or to predict the score of a particular match. We propose a Bayesian hierarchical model to fulfil both these aims and test its predictive strength based on data about the Italian Serie A 1991–1992 championship. To overcome the issue of overshrinkage produced by the Bayesian hierarchical model, we specify a more complex mixture model that results in a better fit to the observed data. We test its performance using an example of the Italian Serie A 2007–2008 championship.  相似文献   

8.
In this paper, we provide probabilistic predictions for soccer games of the 2010 FIFA World Cup modelling the number of goals scored in a game by each team. We use a Poisson distribution for the number of goals for each team in a game, where the scoring rate is considered unknown. We use a Gamma distribution for the scoring rate and the Gamma parameters are chosen using historical data and difference among teams defined by a strength factor for each team. The strength factor is a measure of discrimination among the national teams obtained from their memberships to fuzzy clusters. The clusters are obtained with the use of the Fuzzy C-means algorithm applied to a vector of variables, most of them available on the official FIFA website. Static and dynamic models were used to predict the World Cup outcomes and the performance of our predictions was evaluated using two comparison methods.  相似文献   

9.
The paper examines to what extent a player's market value depends on his skills. Therefore, a data set covering 28 performance measures and the market values of about 493 players from 1. and 2. German Bundesliga is analysed. Applying robust analysis techniques, we are able to robustly estimate market values of soccer players. The results show (1) that there are significantly underrated and overrated players and (2) that a player's affiliation to a certain team may contribute to his market value. We conclude that a club's reputation affects the market values of its players and that star players are in tendency overrated.  相似文献   

10.
Numerous statistics have been proposed to evaluate offensive performance in baseball. Each statistic or estimator can be related to an explicit run production model. An evaluation of ten run production models is presented based on major league team run production in the period 1969–1976. The results are applied to evaluating individual players in both the American and National baseball leagues.  相似文献   

11.
Recent changes in European family dynamics are often linked to common latent trends of economic and ideational change. Using Bayesian factor analysis, we extract three latent variables from eight socio-demographic indicators related to family formation, dissolution, and gender system and collected on 19 European countries within four periods (1970, 1980, 1990, 1998). The flexibility of the Bayesian approach allows us to introduce an innovative temporal factor model, adding the temporal dimension to the traditional factorial analysis. The underlying structure of the Bayesian factor model proposed reflects our idea of an autoregressive pattern in the latent variables relative to adjacent time periods. The results we obtain are consistent with current interpretations in European demographic trends.  相似文献   

12.
Models in which the number of goals scored by a team in a soccer match follow a Poisson distribution, or a closely related one, have been widely discussed. We here consider a soccer match as an experiment to assess which of two teams is superior and examine the probability that the outcome of the experiment (match) truly represents the relative abilities of the two teams. Given a final score, it is possible by using a Bayesian approach to quantify the probability that it was or was not the case that ‘the best team won’. For typical scores, the probability of a misleading result is significant. Modifying the rules of the game to increase the typical number of goals scored would improve the situation, but a level of confidence that would normally be regarded as satisfactory could not be obtained unless the character of the game was radically changed.  相似文献   

13.
Using play-by-play data from the very beginning of the professional football league in Turkey, a semi-Markov model is presented for describing the performance of football teams. The official match results of the selected teams during 55 football seasons are used and winning, drawing and losing are considered as Markov states. The semi-Markov model is constructed with transition rates inferred from the official match results. The duration between the last match of a season and the very first match of the following season is much longer than any other duration during the season. Therefore these values are considered as missing values and estimated by using expectation–maximization algorithm. The effect of the sojourn time in a state to the performance of a team is discussed as well as mean sojourn times after losing/winning are estimated. The limiting probabilities of winning, drawing and losing are calculated. Some insights about the performance of the selected teams are presented.  相似文献   

14.
ABSTRACT

The living hours data of individuals' time spent on daily activities are compositional and include many zeros because individuals do not pursue all activities every day. Thus, we should exercise caution in using such data for empirical analyses. The Bayesian method offers several advantages in analyzing compositional data. In this study, we analyze the time allocation of Japanese married couples using the Bayesian model. Based on the Bayes factors, we compare models that consider and do not consider the correlations between married couples' time use data. The model that considers the correlation shows superior performance. We show that the Bayesian method can adequately take into account the correlations of wives' and husbands' living hours, facilitating the calculation of partial effects that their activities' variables have on living hours. The partial effects of the model that considers the correlations between the couples' time use are easily calculated from the posterior results.  相似文献   

15.
Measuring players'' performance in team sports is fundamental since managers need to evaluate players with respect to the ability to score during crucial moments of the game. Using Classification and Regression Trees (CART) and play-by-play basketball data, we estimate the probabilities to score the shot with respect to a selection of game covariates related to game pressure. We use scoring probabilities to develop a player-specific shooting performance index that takes into account for the difficulty associated to score different types of shots. By applying this procedure to a large sample of 2016–2017 Basketball Champions League (BCL) and 2017–2018 National Basketball Association (NBA) games, we compare the factors affecting shooting performance in Europe and in the United States and we evaluate a selection of players in terms of the proposed shooting performance index with the final aim of providing useful guidelines for the team strategy.  相似文献   

16.
SUMMARY Using a common framework, this paper presents a survey of the major world sports rating systems (WSRSs) in skiing (sponsored by the International Skiing Federation (FIS)), men's tennis (Association of Tennis Professionals (ATP)), women's tennis (Women's Tennis Association (WTA)), soccer (Federation of International Football Associations (FIFA)) and golf (Royal and Ancient Golf Club of St Andrews). These systems are not otherwise available in the literature. Each of the WSRSs has three phases: first, the observed results are weighted to provide points for each competition; second, these points are combined to provide a seasonal value; third, the seasonal values are combined to provide a rating. The final result or placement (and not the score or time) is the most important factor in determining points for a given competition. In skiing, men's tennis and women's tennis, the rating is calculated from results over one season, while three seasons are used in golf and six seasons are used in soccer. In cross-country skiing and men's tennis, the seasonal value is calculated from the sum of the best values from that season's competitions. In alpine skiing and women's tennis, the sum of all values from that season's competitions is used. In golf and soccer, an averaging process is used. Besides potentially encouraging more entries, a 'best' system and one using all values also generates simple integer ratings rather than decimal ratings as are obtained with an averaging system. The simplest system is that of FIS in skiing, where one table of points is used for all alpine and cross-country disciplines. In contrast, considering that soccer (as a sport) prides itself on the simplicity of the game, it is surprising that FIFA's system is so complex, It is also surprising in soccer that a 'friendly' (often a pick-up exhibition used for player development) counts two-thirds as much as does a World Cup final played before a worldwide TV audience. It is hoped that this survey will serve as a valuable resource for those studying sports rating systems.  相似文献   

17.
In a previous paper, it was demonstrated that distinctly different prediction methods when applied to 2435 American college and professional football games resulted in essentially the same fraction of correct selections of the winning team and essentially the same average absolute error for predicting the margin of victory. These results are now extended to 1446 Australian rules football games. Two distinctly different prediction methods are applied. A least-squares method provides a set of ratings. The predicted margin of victory in the next contest is less than the rating difference, corrected for home-ground advantage, while a 0.75 power method shrinks the ratings compared with those found by the least-squares technique and then performs predictions based on the rating difference and home-ground advantage. Both methods operate upon past margins of victory corrected for home advantage to obtain the ratings. It is shown that both methods perform similarly, based on the fraction of correct selections of the winning team and the average absolute error for predicting the margin of victory. That is, differing predictors using the same information tend to converge to a limiting level of accuracy. The least-squares approach also provides estimates of the accuracy of each prediction. The home advantage is evaluated for all teams collectively and also for individual teams. The data permit comparisons with other sports in other countries. The home team appears to have an advantage (the visiting team has a disadvantage) due to three factors:the visiting team suffers from travel fatigue; crowd intimidation by the home team fans; lack of familiarity with the playing conditions  相似文献   

18.
This paper uses a new bivariate negative binomial distribution to model scores in the 1996 Australian Rugby League competition. First, scores are modelled using the home ground advantage but ignoring the actual teams playing. Then a bivariate negative binomial regression model is introduced that takes into account the offensive and defensive capacities of each team. Finally, the 1996 season is simulated using the latter model to determine whether or not Manly did indeed deserve to win the competition.  相似文献   

19.
Summary. We model daily catches of fishing boats in the Grand Bank fishing grounds. We use data on catches per species for a number of vessels collected by the European Union in the context of the Northwest Atlantic Fisheries Organization. Many variables can be thought to influence the amount caught: a number of ship characteristics (such as the size of the ship, the fishing technique used and the mesh size of the nets) are obvious candidates, but one can also consider the season or the actual location of the catch. Our database leads to 28 possible regressors (arising from six continuous variables and four categorical variables, whose 22 levels are treated separately), resulting in a set of 177 million possible linear regression models for the log-catch. Zero observations are modelled separately through a probit model. Inference is based on Bayesian model averaging, using a Markov chain Monte Carlo approach. Particular attention is paid to the prediction of catches for single and aggregated ships.  相似文献   

20.
Inference in hybrid Bayesian networks using dynamic discretization   总被引:1,自引:0,他引:1  
We consider approximate inference in hybrid Bayesian Networks (BNs) and present a new iterative algorithm that efficiently combines dynamic discretization with robust propagation algorithms on junction trees. Our approach offers a significant extension to Bayesian Network theory and practice by offering a flexible way of modeling continuous nodes in BNs conditioned on complex configurations of evidence and intermixed with discrete nodes as both parents and children of continuous nodes. Our algorithm is implemented in a commercial Bayesian Network software package, AgenaRisk, which allows model construction and testing to be carried out easily. The results from the empirical trials clearly show how our software can deal effectively with different type of hybrid models containing elements of expert judgment as well as statistical inference. In particular, we show how the rapid convergence of the algorithm towards zones of high probability density, make robust inference analysis possible even in situations where, due to the lack of information in both prior and data, robust sampling becomes unfeasible.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号