首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
This article describes an algorithm for the identification of outliers in multivariate data based on the asymptotic theory for location estimation as described typically for the trimmed likelihood estimator and in particular for the minimum covariance determinant estimator. The strategy is to choose a subset of the data which minimizes an appropriate measure of the asymptotic variance of the multivariate location estimator. Observations not belonging to this subset are considered potential outliers which should be trimmed. For α less than about 0.5, the correct trimming proportion is taken to be that α > 0 for which the minimum of any minima of this measure of the asymptotic variance occurs. If no minima occur for an α > 0 then the data set will be considered outlier free.  相似文献   

2.
This paper discusses the large sample theory of the two-stage Welsh's trimmed mean for the limited information simultaneous equations model. Besides having asymptotic normality, this trimmed mean, as the two-stage least squares estimator, is a generalized least squares estimator. It also acts as a robust Aitken estimator for the simultaneous equations model. Examples illustrate real data analysis and large sample inferences based on this trimmed mean.  相似文献   

3.
An asymptotic normality result is given for an adaptive trimmed likelihood estimator of location, which parallels the asymptotic normality result for the adaptive trimmed mean. The new result comes out of studying the adaptive trimmed likelihood estimator modelled parametrically by a normal family but then examining the behavior when the underlying distribution is in fact some F different from normal. The asymptotic variance of the adaptive estimator is equal to the asymptotic variance of the trimmed likelihood estimator at the optimal trimming proportion for the distribution F, subject to that trimming proportion being positive and F being suitably smooth.  相似文献   

4.
A robust test for the one-way ANOVA model under heteroscedasticity is developed in this paper. The data are assumed to be symmetrically distributed, apart from some outliers, although the assumption of normality may be violated. The test statistic to be used is a weighted sum of squares similar to the Welch [1951. On the comparison of several mean values: an alternative approach. Biometrika 38, 330-336.] test statistic, but any of a variety of robust measures of location and scale for the populations of interest may be used instead of the usual mean and standard deviation. Under the commonly occurring condition that the robust measures of location and scale are asymptotically normal, we derive approximations to the distribution of the test statistic under the null hypothesis and to its distribution under alternative hypotheses. An expression for relative efficiency is derived, thus allowing comparison of the efficiency of the test as a function of the choice of the location and scale estimators used in the test statistic. As an illustration of the theory presented here, we apply it to three commonly used robust location–scale estimator pairs: the trimmed mean with the Winsorized standard deviation; the Huber Proposal 2 estimator pair; and the Hampel robust location estimator with the median absolute deviation.  相似文献   

5.
The most popular and perhaps universal estimator of location and scale in robust estimation, where the population is normal with possible small departures, is Huber's Proposal‐2 M‐estimator. This paper gives the first‐order small sample bias correction for the scale estimator, verifying the calculation through theory and simulation. Other ways of reducing small sample bias, say by jackknifing or bootstrapping, can be computationally intensive, and would not be routinely used with this iteratively derived estimator. It is suggested that bias‐reduced estimates of scale are most useful when forming confidence intervals for location and/or scale based on the asymptotic distribution.  相似文献   

6.
A class of trimmed linear conditional estimators based on regression quantiles for the linear regression model is introduced. This class serves as a robust analogue of non-robust linear unbiased estimators. Asymptotic analysis then shows that the trimmed least squares estimator based on regression quantiles ( Koenker and Bassett ( 1978 ) ) is the best in this estimator class in terms of asymptotic covariance matrices. The class of trimmed linear conditional estimators contains the Mallows-type bounded influence trimmed means ( see De Jongh et al ( 1988 ) ) and trimmed instrumental variables estimators. A large sample methodology based on trimmed instrumental variables estimator for confidence ellipsoids and hypothesis testing is also provided.  相似文献   

7.
Simultaneous robust estimates of location and scale parameters are derived from a class of M-estimating equations. A coefficient p ( p > 0), which plays a role similar to that of a tuning constant in the theory of M-estimation, determines the estimating equations. These estimating equations may be obtained as the gradient of a strictly convex criterion function. This article shows that the estimators are uniquely defined, asymptotically bi-variate normal and have positive breakdown for some choices of p . When p = 0.12 and p = 0.3, the estimators are almost fully efficient for normal and exponential distributions: efficiencies with respect to the maximum likelihood estimators are 1.00 and 0.99, respectively. It is shown that the location estimator for known scale has the maximum breakdown point 0.5 independent of p , when the target model is symmetric. Also it is shown that the scale estimator has a positive breakdown point which depends on the choice of p . A simulation study finds that the proposed location estimator has smaller variance than the Hodges–Lehmann estimator, Huber's minimax and bisquare M-estimators.  相似文献   

8.
The trimmed mean is well‐known in literature for being more robust and for having better efficiency than the sample mean when data is generated from heavy‐tailed distributions. In this article, the trimmed mean in the isotonic regression setup is proposed, and the asymptotic as well as the robustness properties of the estimator are studied. The usefulness of the proposed estimator is illustrated using different real and simulated data. Further, the performance of the estimator is compared with that of the mean and the median isotonic regression estimators.  相似文献   

9.
For the lifetime (or negative) exponential distribution, the trimmed likelihood estimator has been shown to be explicit in the form of a β‐trimmed mean which is representable as an estimating functional that is both weakly continuous and Fréchet differentiable and hence qualitatively robust at the parametric model. It also has high efficiency at the model. The robustness is in contrast to the maximum likelihood estimator (MLE) involving the usual mean which is not robust to contamination in the upper tail of the distribution. When there is known right censoring, it may be perceived that the MLE which is the most asymptotically efficient estimator may be protected from the effects of ‘outliers’ due to censoring. We demonstrate that this is not the case generally, and in fact, based on the functional form of the estimators, suggest a hybrid defined estimator that incorporates the best features of both the MLE and the β‐trimmed mean. Additionally, we study the pure trimmed likelihood estimator for censored data and show that it can be easily calculated and that the censored observations are not always trimmed. The different trimmed estimators are compared by a modest simulation study.  相似文献   

10.
In this paper, we study the robustness properties of several procedures for the joint estimation of shape and scale in a generalized Pareto model. The estimators that we primarily focus upon, most bias robust estimator (MBRE) and optimal MSE-robust estimator (OMSE), are one-step estimators distinguished as optimally robust in the shrinking neighbourhood setting; that is, they minimize the maximal bias, respectively, on such a specific neighbourhood, the maximal mean squared error (MSE). For their initialization, we propose a particular location–dispersion estimator, MedkMAD, which matches the population median and kMAD (an asymmetric variant of the median of absolute deviations) against the empirical counterparts. These optimally robust estimators are compared to the maximum-likelihood, skipped maximum-likelihood, Cramér–von-Mises minimum distance, method-of-medians, and Pickands estimators. To quantify their deviation from robust optimality, for each of these suboptimal estimators, we determine the finite-sample breakdown point and the influence function, as well as the statistical accuracy measured by asymptotic bias, variance, and MSE – all evaluated uniformly on shrinking neighbourhoods. These asymptotic findings are complemented by an extensive simulation study to assess the finite-sample behaviour of the considered procedures. The applicability of the procedures and their stability against outliers are illustrated for the Danish fire insurance data set from the package evir.  相似文献   

11.
To perform regression analysis in high dimensions, lasso or ridge estimation are a common choice. However, it has been shown that these methods are not robust to outliers. Therefore, alternatives as penalized M-estimation or the sparse least trimmed squares (LTS) estimator have been proposed. The robustness of these regression methods can be measured with the influence function. It quantifies the effect of infinitesimal perturbations in the data. Furthermore, it can be used to compute the asymptotic variance and the mean-squared error (MSE). In this paper we compute the influence function, the asymptotic variance and the MSE for penalized M-estimators and the sparse LTS estimator. The asymptotic biasedness of the estimators make the calculations non-standard. We show that only M-estimators with a loss function with a bounded derivative are robust against regression outliers. In particular, the lasso has an unbounded influence function.  相似文献   

12.
The bias bound function of an estimator is an important quantity in order to perform globally robust inference. We show how to evaluate the exact bias bound for the minimax estimator of the location parameter for a wide class of unimodal symmetric location and scale family. We show, by an example, how to obtain an upper bound of the bias bound for a unimodal asymmetric location and scale family. We provide the exact bias bound of the minimum distance/disparity estimators under a contamination neighborhood generated from the same distance.  相似文献   

13.
Outlier detection algorithms are intimately connected with robust statistics that down‐weight some observations to zero. We define a number of outlier detection algorithms related to the Huber‐skip and least trimmed squares estimators, including the one‐step Huber‐skip estimator and the forward search. Next, we review a recently developed asymptotic theory of these. Finally, we analyse the gauge, the fraction of wrongly detected outliers, for a number of outlier detection algorithms and establish an asymptotic normal and a Poisson theory for the gauge.  相似文献   

14.
In this paper we consider the problem of estimating the locations of several normal populations when an order relation between them is known to be true. We compare the maximum likelihood estimator, the M-estimators based on Huber’s ψ function, a robust weighted likelihood estimator, the Gastworth estimator and the trimmed mean estimator. A Monte-Carlo study illustrates the performance of the methods considered.  相似文献   

15.
We consider the problem of choosing among a class of possible estimators by selecting the estimator with the smallest bootstrap estimate of finite sample variance. This is an alternative to using cross-validation to choose an estimator adaptively. The problem of a confidence interval based on such an adaptive estimator is considered. We illustrate the ideas by applying the method to the problem of choosing the trimming proportion of an adaptive trimmed mean. It is shown that a bootstrap adaptive trimmed mean is asymptotically normal with an asymptotic variance equal to the smallest among trimmed means. The asymptotic coverage probability of a bootstrap confidence interval based on such adaptive estimators is shown to have the nominal level. The intervals based on the asymptotic normality of the estimator share the same asymptotic result, but have poor small-sample properties compared to the bootstrap intervals. A small-sample simulation demonstrates that bootstrap adaptive trimmed means adapt themselves rather well even for samples of size 10.  相似文献   

16.
Simultaneous robust estimates of location and scale parameters are derived from minimizing a minimum-distance criterion function. The criterion function measures the squared distance between the pth power (p > 0) of the empirical distribution function and the pth power of the imperfectly determined model distribution function over the real line. We show that the estimator is uniquely defined, is asymptotically bivariate normal and for p > 0.3 has positive breakdown. If the scale parameter is known, when p = 0.9 the asymptotic variance (1.0436) of the location estimator for the normal model is smaller than the asymptotic variance of the Hodges-Lehmann (HL)estimator (1.0472). Efficiencies with respect to HL and maximum-likelihood estimators (MLE) are 1.0034 and 0.9582, respectively. Similarly, if the location parameter is known, when p = 0.97 the asymptotic variance (0.6158) of the scale estimator is minimum. The efficiency with respect to the MLE is 0.8119. We show that the estimator can tolerate more corrupted observations at oo than at – for p < 1, and vice versa for p > 1.  相似文献   

17.
This paper considers the problem of estimating the probability P = Pr(X < Y) when X and Y are independent exponential random variables with unequal scale parameters and a common location parameter. Uniformly minimum variance unbiased estimator of P is obtained. The asymptotic distribution of the maximum likelihood estimator is obtained and then the asymptotic equivalence of the two estimators is established. Performance of the two estimators for moderate sample sizes is studied by Monte Carlo simulation. An approximate interval estimator is also obtained.  相似文献   

18.
We study robustness properties of two types of M-estimators of scale when both location and scale parameters are unknown: (i) the scale estimator arising from simultaneous M-estimation of location and scale; and (ii) its symmetrization about the sample median. The robustness criteria considered are maximal asymptotic bias and maximal asymptotic variance when the known symmetric unimodal error distribution is subject to unknown, possibly asymmetric, £-con-tamination. Influence functions and asymptotic variance functionals are derived, and computations of asymptotic biases and variances, under the normal distribution with ε-contamination at oo, are presented for the special subclass arising from Huber's Proposal 2 and its symmetrized version. Symmetrization is seen to reduce both asymptotic bias and variance. Some complementary theoretical results are obtained, and the tradeoff between asymptotic bias and variance is discussed.  相似文献   

19.
This paper considers quantile regression for a wide class of time series models including autoregressive and moving average (ARMA) models with asymmetric generalized autoregressive conditional heteroscedasticity errors. The classical mean‐variance models are reinterpreted as conditional location‐scale models so that the quantile regression method can be naturally geared into the considered models. The consistency and asymptotic normality of the quantile regression estimator is established in location‐scale time series models under mild conditions. In the application of this result to ARMA‐generalized autoregressive conditional heteroscedasticity models, more primitive conditions are deduced to obtain the asymptotic properties. For illustration, a simulation study and a real data analysis are provided.  相似文献   

20.
In this paper, a new estimator combined estimator (CE) is proposed for estimating the finite population mean ¯ Y N in simple random sampling assuming a long-tailed symmetric super-population model. The efficiency and robustness properties of the CE is compared with the widely used and well-known estimators of the finite population mean ¯ Y N by Monte Carlo simulation. The parameter estimators considered in this study are the classical least squares estimator, trimmed mean, winsorized mean, trimmed L-mean, modified maximum-likelihood estimator, Huber estimator (W24) and the non-parametric Hodges–Lehmann estimator. The mean square error criteria are used to compare the performance of the estimators. We show that the CE is overall more efficient than the other estimators. The CE is also shown to be more robust for estimating the finite population mean ¯ Y N , since it is insensitive to outliers and to misspecification of the distribution. We give a real life example.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号