首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In this paper, we provide probabilistic predictions for soccer games of the 2010 FIFA World Cup modelling the number of goals scored in a game by each team. We use a Poisson distribution for the number of goals for each team in a game, where the scoring rate is considered unknown. We use a Gamma distribution for the scoring rate and the Gamma parameters are chosen using historical data and difference among teams defined by a strength factor for each team. The strength factor is a measure of discrimination among the national teams obtained from their memberships to fuzzy clusters. The clusters are obtained with the use of the Fuzzy C-means algorithm applied to a vector of variables, most of them available on the official FIFA website. Static and dynamic models were used to predict the World Cup outcomes and the performance of our predictions was evaluated using two comparison methods.  相似文献   

2.
ABSTRACT

In successive sampling some recent works depict the use of super-population models where information on stable auxiliary variable over occasions has been utilized. Stability character of auxiliary variable may not sustain, if the duration between occasions is large. To cope with such situations, the present work is an attempt to develop some estimation procedures by utilizing the information on two independent auxiliary variables through a linear super-population model. Some estimators are proposed to estimate the current population mean in two occasions successive (rotation) sampling. Optimum replacement strategies are formulated and performances of the proposed estimators have been discussed. Results are interpreted through empirical studies.  相似文献   

3.
The problem of simultaneously estimating p Gamma means is investigated when the means are believed a priori to satisfy an r-dimensional generalized linear model. Using a Bayesian hierarchical model to reflect the uncertainty in the linear model, approximate methods are proposed to compute the posterior densities. The resulting estimator shrinks the usual estimator toward a prior estimator where the size of the shrinkage depends upon the agreement of the observed data with the proposed generalized linear model.  相似文献   

4.
In this paper we determine the Gauss–Markov predictor of the nonobservable part of a random vector satisfying the linear model under a linear constraint.  相似文献   

5.
Zhijun Liu 《Statistics》2013,47(2):109-119
In this paper, the robustness of the least distances (LD) estimate in multivariate linear models, as defined by Bai, Chen, Miao and Rao (1990), is discussed in terms of the influence function as well as the breakdown point. The LD estimate is shown to be more robust than the least squares (LS) estimate. The robustness of the LD is similar to that of the least absolute deviations (LAD) estimate, a well studied robust estimate in the univariate case. In particular, if there are no outliers in the design matrices, the breakdown point of the LD estimate reaches the highest value, 1/2.  相似文献   

6.
Semi-parametric modelling of interval-valued data is of great practical importance, as exampled by applications in economic and financial data analysis. We propose a flexible semi-parametric modelling of interval-valued data by integrating the partial linear regression model based on the Center & Range method, and investigate its estimation procedure. Furthermore, we introduce a test statistic that allows one to decide between a parametric linear model and a semi-parametric model, and approximate its null asymptotic distribution based on wild Bootstrap method to obtain the critical values. Extensive simulation studies are carried out to evaluate the performance of the proposed methodology and the new test. Moreover, several empirical data sets are analysed to document its practical applications.  相似文献   

7.
This work introduces specific tools based on phi-divergences to select and check generalized linear models with binary data. A backward selection criterion that helps to reduce the number of explanatory variables is considered. Diagnostic methods based on divergence measures such as a new measure to detect leverage points and two indicators to detect influential points are introduced. As an illustration, the diagnostics are applied to human psychology data.  相似文献   

8.
The geometric approach to the general linear model is in accessible to the majorityof statistics students be cause the computations require matrix algebra.This article presents the geometric approach for the special case of the bivariate linear model,for which the only tool require dis the in ner product.The geometric approach is introduced by showing the dual2-dimensional and5-dimensional representations of several bivariate samples x of size5.The assumptions of the bivariate model are stated geometrically,and the distributions of the regression coefficient sare derived.Theanalysis of variance(ANOVA)right triangle is pictured and the sides of the triang leare associated with their corresponding entries in the ANOVA table.  相似文献   

9.
Factor screening designs for searching two and three effective factors using the search linear model are discussed. The construction of such factor screening designs involved finding a fraction with small number of treatments of a 2m factorial experiment having the property P2t (no 2t columns are linearly dependent) for t=2 and 3. A ‘Packing Problem’ is introduced in this connection. A complete solution of the problem in one case and partial solutions for the other cases are presented. Many practically useful new designs are listed.  相似文献   

10.
Physical activity measurements derived from self-report surveys are prone to measurement errors. Monitoring devices like accelerometers offer more objective measurements of physical activity, but are impractical for use in large-scale surveys. A model capable of predicting objective measurements of physical activity from self-reports would offer a practical alternative to obtaining measurements directly from monitoring devices. Using data from National Health and Nutrition Examination Survey 2003–2006, we developed and validated models for predicting objective physical activity from self-report variables and other demographic characteristics. The prediction intervals produced by the models were large, suggesting that the ability to predict objective physical activity for individuals from self-reports is limited.  相似文献   

11.
Loss reserving is an important subject of actuarial mathematics. It aims at the prediction of future losses caused by claims which have incurred in the past but have not yet been closed. The problem of predicting such losses is particularly important in liability insurance. In the present paper we study conjoint prediction of paid and incurred losses in a linear model with a linear constraint which is intended to reduce the gap between the predictors of ultimate paid and incurred losses. We thus present an application to actuarial mathematics of the general result established by Kloberdanz and Schmidt (AStA Adv. Stat. Anal. 92:207–215, 2008).  相似文献   

12.
In this paper we present a Wald or distance test for testing the stability of a linear dynamic model. Stability requires that all latent roots of the system simultaneously satisfy inequality restrictions. Unlike previous tests proposed in the literature our procedure is capable of testing the restrictions simultaneously. Therefore, the test asymptotically has the correct size. The procedure can be applied in practice if stability is not a requirement for identification of the dynamic model.  相似文献   

13.
The demand for reliable statistics in subpopulations, when only reduced sample sizes are available, has promoted the development of small area estimation methods. In particular, an approach that is now widely used is based on the seminal work by Battese et al. [An error-components model for prediction of county crop areas using survey and satellite data, J. Am. Statist. Assoc. 83 (1988), pp. 28–36] that uses linear mixed models (MM). We investigate alternatives when a linear MM does not hold because, on one side, linearity may not be assumed and/or, on the other, normality of the random effects may not be assumed. In particular, Opsomer et al. [Nonparametric small area estimation using penalized spline regression, J. R. Statist. Soc. Ser. B 70 (2008), pp. 265–283] propose an estimator that extends the linear MM approach to the case in which a linear relationship may not be assumed using penalized splines regression. From a very different perspective, Chambers and Tzavidis [M-quantile models for small area estimation, Biometrika 93 (2006), pp. 255–268] have recently proposed an approach for small-area estimation that is based on M-quantile (MQ) regression. This allows for models robust to outliers and to distributional assumptions on the errors and the area effects. However, when the functional form of the relationship between the qth MQ and the covariates is not linear, it can lead to biased estimates of the small area parameters. Pratesi et al. [Semiparametric M-quantile regression for estimating the proportion of acidic lakes in 8-digit HUCs of the Northeastern US, Environmetrics 19(7) (2008), pp. 687–701] apply an extended version of this approach for the estimation of the small area distribution function using a non-parametric specification of the conditional MQ of the response variable given the covariates [M. Pratesi, M.G. Ranalli, and N. Salvati, Nonparametric m-quantile regression using penalized splines, J. Nonparametric Stat. 21 (2009), pp. 287–304]. We will derive the small area estimator of the mean under this model, together with its mean-squared error estimator and compare its performance to the other estimators via simulations on both real and simulated data.  相似文献   

14.
The problem of testing suspected outliers from a linear model with constant intraclass correlation is considered from a Bayesian viewpoint. The main objective of this paper is to develop an outlier test procedure based on the predictive distribution of suspected outlier observations given a set of existing inlier observations. The test procedure is easily performed with the usual F and t distributions.  相似文献   

15.
We provide an application of a variety of predicting densities to quality control involving multivariate normal linear models. We produce optimal control designs for single muleivaiiate future observations using predicting densities employing estimative, profile likelihood, Hinkley-Lauritzen, Butler, Bayesian, and Parametric Bootstrap methodologies. The decision-theoretic optimality criterion is an intuitively appealing quadratic consumer-producer risk function. The optimal control design arising from an optimal Kullback-Leibler frequentist prediction density is shown to coincide with that arising from an optimal Kullback-Leibler Bayesian predictive density. An example involving EVOP is provided to illustrate the methodology and to raise questions concerning the relative merics of the variety of predictive approaches in the quality control context.  相似文献   

16.
A generalized linear empirical Bayes model is developed for empirical Bayes analysis of several means in natural exponential families. A unified approach is presented for all natural exponential families with quadratic variance functions (the Normal, Poisson, Binomial, Gamma, and two others.) The hyperparameters are estimated using the extended quasi-likelihood of Nelder and Pregibon (1987), which is easily implemented via the GLIM package. The accuracy of these estimates is developed by asymptotic approximation of the variance. Two data examples are illustrated.  相似文献   

17.
The present paper considers the weighted mixed regression estimation of the coefficient vector in a linear regression model with stochastic linear restrictions binding the regression coefficients. We introduce a new two-parameter-weighted mixed estimator (TPWME) by unifying the weighted mixed estimator of Schaffrin and Toutenburg [1] and the two-parameter estimator (TPE) of Özkale and Kaç?ranlar [2]. This new estimator is a general estimator which includes the weighted mixed estimator, the TPE and the restricted two-parameter estimator (RTPE) proposed by Özkale and Kaç?ranlar [2] as special cases. Furthermore, we compare the TPWME with the weighted mixed estimator and the TPE with respect to the matrix mean square error criterion. A numerical example and a Monte Carlo simulation experiment are presented by using different estimators of the biasing parameters to illustrate some of the theoretical results.  相似文献   

18.
A simple method of setting linear hypotheses testable by F-tests in a general linear model when the covariance matrix has a general form and is completely unknown, is provided. With some additional conditions imposed on the covariance matrix, there exist the UMP invariant tests of certain linear hypotheses. We derive them to compare the powers with those of F-tests obtained under no restrictions on the covariance matrix. The results are illustrated in a multiple regression model with some examples.  相似文献   

19.
The Cash statistic, also known as the C statistic, is commonly used for the analysis of low-count Poisson data, including data with null counts for certain values of the independent variable. The use of this statistic is especially attractive for low-count data that cannot be combined, or re-binned, without loss of resolution. This paper presents a new maximum-likelihood solution for the best-fit parameters of a linear model using the Poisson-based Cash statistic. The solution presented in this paper provides a new and simple method to measure the best-fit parameters of a linear model for any Poisson-based data, including data with null counts. In particular, the method enforces the requirement that the best-fit linear model be non-negative throughout the support of the independent variable. The method is summarized in a simple algorithm to fit Poisson counting data of any size and counting rate with a linear model, by-passing entirely the use of the traditional χ2 statistic.  相似文献   

20.
ABSTRACT

In this paper, we consider the best linear unbiased estimators (BLUEs) based on double ranked set sampling (DRSS) and ordered DRSS (ODRSS) schemes for the simple linear regression model with replicated observations. We assume three symmetric distributions for the random error term, i.e., normal, Laplace and some scale contaminated normal distributions. The proposed BLUEs under DRSS (BLUEs-DRSS) and ODRSS (BLUEs-ODRSS) are compared with the BLUEs based on ordered simple random sampling (OSRS), ranked set sampling (RSS), and ordered RSS (ORSS) schemes. These estimators are compared in terms of relative efficiency (RE), RE of determinant (RED), and RE of trace (RET). It is found that the BLUEs-ODRSS are uniformly better than the BLUEs based on OSRS, RSS, ORSS, and DRSS schemes. We also compare the estimators based on imperfect RSS (IRSS) schemes. It is worth mentioning here that the BLUEs under ordered imperfect DRSS (OIDRSS) are better than their counterparts based on IRSS, ordered IRSS (OIRSS), and imperfect DRSS (IDRSS) methods. Moreover, for sensitivity analysis of the BLUEs, we calculate REs and REDs of the BLUEs under the assumption of normality when in fact the parent distribution follows a non normal symmetric distribution. It turns out that even under violation of normality assumptions, BLUEs of the intercept and the slope parameters are found to be unbiased with equal REs under each sampling scheme. It is also observed that the BLUEs under ODRSS are more efficient than the existing BLUEs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号