共查询到20条相似文献,搜索用时 15 毫秒
1.
D. Firth & K. E. Bennett 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1998,60(1):3-21
In the estimation of a population mean or total from a random sample, certain methods based on linear models are known to be automatically design consistent, regardless of how well the underlying model describes the population. A sufficient condition is identified for this type of robustness to model failure; the condition, which we call 'internal bias calibration', relates to the combination of a model and the method used to fit it. Included among the internally bias-calibrated models, in addition to the aforementioned linear models, are certain canonical link generalized linear models and nonparametric regressions constructed from them by a particular style of local likelihood fitting. Other models can often be made robust by using a suboptimal fitting method. Thus the class of model-based, but design consistent, analyses is enlarged to include more realistic models for certain types of survey variable such as binary indicators and counts. Particular applications discussed are the estimation of the size of a population subdomain, as arises in tax auditing for example, and the estimation of a bootstrap tail probability. 相似文献
2.
《Journal of statistical planning and inference》1997,57(2):233-244
In this paper we review existing work on robust estimation for simultaneous equations models. Then we sketch three strategies for obtaining estimators with a high breakdown point and a controllable efficiency: (a) robustifying three-stage least squares, (b) robustifying the full information maximum likelihood method by minimizing the determinant of a robust covariance matrix of residuals, and (c) generalizing multivariate tau-estimators (Lopuhaä, 1992, Can. J. Statist., 19, 307–321) to these models. They have the same order of computational complexity as high breakdown point multivariate estimators. The latter seems the most promising approach. 相似文献
3.
The growth curve model introduced by potthoff and Roy 1964 is a general statistical model which includes as special cases regression models and both univariate and multivariate analysis of variance models. The methods currently available for estimating the parameters of this model assume an underlying multivariate normal distribution of errors. In this paper, we discuss tw robst estimators of the growth curve loction and scatter parameters based upon M-estimation techniques and the work done by maronna 1976. The asymptotic distribution of these robust estimators are discussed and a numerical example given. 相似文献
4.
《Journal of statistical planning and inference》1996,55(2):205-217
Robust tests for testing subhypotheses in nonlinear models are developed. These are drop-in-dispersion testing procedures, score-type and Wald-type testing procedures. The asymptotic properties and influence functions are obtained. Robust tests that perform well in the presence of heteroscedasticity are also developed. Simulation results are provided to illustrate these procedures. 相似文献
5.
AbstractModel misspecification in generalized linear models (GLMs) occurs usually when the linear predictor and/or the link function assumed are incorrect. This article discusses the effect of such misspecification on design selection for multinomial GLMs and proposes the use of quantile dispersion graphs to select robust designs. Due to misspecification in the model, parameter estimates are usually biased and the designs are compared on the basis of their mean squared error of prediction. Several numerical examples including a real data set are presented to illustrate the proposed methodology. 相似文献
6.
In this paper we present two robust estimates for GARCH models. The first is defined by the minimization of a conveniently modified likelihood and the second is similarly defined, but includes an additional mechanism for restricting the propagation of the effect of one outlier on the next estimated conditional variances. We study the asymptotic properties of our estimates proving consistency and asymptotic normality. A Monte Carlo study shows that the proposed estimates compare favorably with respect to other robust estimates. Moreover, we consider some real examples with financial data that illustrate the behavior of these estimates. 相似文献
7.
Ross H. Taplin 《Revue canadienne de statistique》1999,27(2):361-371
The author presents a robust F-test for comparing nested linear models. It is suggested that the approach will be attractive to practitioners because it is based on the familiar F-statistic and corresponds to the common practice of reporting F-statistics after removing obvious outliers. It is calibrated in terms of a real parameter that can be directly interpreted as the willingness of the data analyst to remove observations, and the sensitivity of the F-statistic to this parameter is easily examined. The procedure is evaluated with a simulation study where a scale mixture distribution is used to generate outliers. The procedure is also applied to some data where the occurrence of an outlier is confounded with the significance of a regression term. This provides a comparison of two competing models for the data: one removing an outlier and the other including an additional regression term instead. 相似文献
8.
Kelvin K. W. Yau & Anthony Y. C. Kuk 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(1):101-117
Generalized linear mixed models (GLMMs) are widely used to analyse non-normal response data with extra-variation, but non-robust estimators are still routinely used. We propose robust methods for maximum quasi-likelihood and residual maximum quasi-likelihood estimation to limit the influence of outlying observations in GLMMs. The estimation procedure parallels the development of robust estimation methods in linear mixed models, but with adjustments in the dependent variable and the variance component. The methods proposed are applied to three data sets and a comparison is made with the nonparametric maximum likelihood approach. When applied to a set of epileptic seizure data, the methods proposed have the desired effect of limiting the influence of outlying observations on the parameter estimates. Simulation shows that one of the residual maximum quasi-likelihood proposals has a smaller bias than those of the other estimation methods. We further discuss the equivalence of two GLMM formulations when the response variable follows an exponential family. Their extensions to robust GLMMs and their comparative advantages in modelling are described. Some possible modifications of the robust GLMM estimation methods are given to provide further flexibility for applying the method. 相似文献
9.
Lifetime Data Analysis - The accelerated failure time model is widely used for analyzing censored survival times often observed in clinical studies. It is well-known that the ordinary maximum... 相似文献
10.
11.
Wolfgang Polasek Klaus Ptzelberger 《Journal of statistical planning and inference》1994,40(2-3):295-311
A robust Bayesian analysis in a conjugate normal framework for the simple ANOVA model is suggested. By fixing the prior mean and varying the prior covariance matrix over a restricted class, we obtain the so-called HiFi and core region, a union and intersection of HPD regions. Based on these robust HPD regions we develop the concept of a ‘robust Bayesian judgement’ procedure. We apply this approach to the simple analysis of variance model with orthogonal designs. The example analyses the costs of an asthma medication obtained by a two-way cross-over study. 相似文献
12.
Statistical problems in modelling personal-income distributions include estimation procedures, testing, and model choice. Typically, the parameters of a given model are estimated by classical procedures such as maximum-likelihood and least-squares estimators. Unfortunately, the classical methods are very sensitive to model deviations such as gross errors in the data, grouping effects, or model misspecifications. These deviations can ruin the values of the estimators and inequality measures and can produce false information about the distribution of the personal income in a country. In this paper we discuss the use of robust techniques for the estimation of income distributions. These methods behave like the classical procedures at the model but are less influenced by model deviations and can be applied to general estimation problems. 相似文献
13.
We develop criteria that generate robust designs and use such criteria for the construction of designs that insure against possible misspecifications in logistic regression models. The design criteria we propose are different from the classical in that we do not focus on sampling error alone. Instead we use design criteria that account as well for error due to bias engendered by the model misspecification. Our robust designs optimize the average of a function of the sampling error and bias error over a specified misspecification neighbourhood. Examples of robust designs for logistic models are presented, including a case study implementing the methodologies using beetle mortality data. 相似文献
14.
《Journal of Statistical Computation and Simulation》2012,82(1-3):165-175
In this paper we consider the problem of estimating the locations of several normal populations when an order relation between them is known to be true. We compare the maximum likelihood estimator, the M-estimators based on Huber’s ψ function, a robust weighted likelihood estimator, the Gastworth estimator and the trimmed mean estimator. A Monte-Carlo study illustrates the performance of the methods considered. 相似文献
15.
Matthias Kohl Peter Ruckdeschel Helmut Rieder 《Statistical Methods and Applications》2010,19(3):333-354
The aim of the paper is to give a coherent account of the robustness approach based on shrinking neighborhoods in the case of i.i.d. observations, and add some theoretical complements. An important aspect of the approach is that it does not require any particular model structure but covers arbitrary parametric models if only smoothly parametrized. In the meantime, equal generality has been achieved by object-oriented implementation of the optimally robust estimators. Exponential families constitute the main examples in this article. Not pretending a complete data analysis, we evaluate the robust estimates on real datasets from literature by means of our R packages ROptEst and RobLox. 相似文献
16.
The second-order least-squares estimator (SLSE) was proposed by Wang (Statistica Sinica 13:1201–1210, 2003) for measurement
error models. It was extended and applied to linear and nonlinear regression models by Abarin and Wang (Far East J Theor Stat
20:179–196, 2006) and Wang and Leblanc (Ann Inst Stat Math 60:883–900, 2008). The SLSE is asymptotically more efficient than
the ordinary least-squares estimator if the error distribution has a nonzero third moment. However, it lacks robustness against
outliers in the data. In this paper, we propose a robust second-order least squares estimator (RSLSE) against X-outliers. The RSLSE is highly efficient with high breakdown point and is asymptotically normally distributed. We compare
the RSLSE with other estimators through a simulation study. Our results show that the RSLSE performs very well. 相似文献
17.
Ruben Crevits 《统计学通讯:模拟与计算》2019,48(6):1694-1705
The model parameters of linear state space models are typically estimated with maximum likelihood estimation, where the likelihood is computed analytically with the Kalman filter. Outliers can deteriorate the estimation. Therefore we propose an alternative estimation method. The Kalman filter is replaced by a robust version and the maximum likelihood estimator is robustified as well. The performance of the robust estimator is investigated in a simulation study. Robust estimation of time varying parameter regression models is considered as a special case. Finally, the methodology is applied to real data. 相似文献
18.
Douglas P. Wiens 《Revue canadienne de statistique》1996,24(1):67-79
We consider the problem of the sequential choice of design points in an approximately linear model. It is assumed that the fitted linear model is only approximately correct, in that the true response function contains a nonrandom, unknown term orthogonal to the fitted response. We also assume that the parameters are estimated by M-estimation. The goal is to choose the next design point in such a way as to minimize the resulting integrated squared bias of the estimated response, to order n-1. Explicit applications to analysis of variance and regression are given. In a simulation study the sequential designs compare favourably with some fixed-sample-size designs which are optimal for the true response to which the sequential designs must adapt. 相似文献
19.
《Journal of statistical planning and inference》2003,117(2):305-321
In this article, we consider robust designs for approximate polynomial regression models, by applying the theory of canonical moments. The design criterion, first given in Liu and Wiens (J. Statist. Planning Inference 64 (1997) 369), is to maximize the determinant of the information matrix subject to a side condition of bounding the bias arising from model misspecification. We give a new proof of, and extend, the main theorem in Liu and Wiens (op. cit.); in so doing we shed new light on the structure of this problem. New designs, with the further property of minimizing the generalized variance of the additional regression coefficients when an enlarged model is fitted, are derived and assessed. These provide additional robustness against uncertainty regarding the proper degree of the fitted polynomial response. 相似文献
20.
《Journal of statistical planning and inference》2005,131(2):297-311
This article introduces adaptive weighted maximum likelihood estimators for binary regression models. The asymptotic distribution under the model is established, and asymptotic confidence intervals are derived. Finite-sample properties are studied by simulation. For clean datasets, the proposed adaptive estimators are more efficient than the non-adaptive ones even for moderate sample sizes, and for outlier-contaminated datasets they show a comparable robustness. As for the asymptotic confidence intervals, the actual coverage levels under the model are very close to the nominal levels (even for moderate sample sizes), and they are reasonably stable under contamination. 相似文献