期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

AN ADAPTIVE TRIMMED LIKELIHOOD ALGORITHM FOR IDENTIFICATION OF MULTIVARIATE OUTLIERS

Brenton R. Clarke Daniel D. Schubert 《Australian & New Zealand Journal of Statistics》2006,48(3):353-371

This article describes an algorithm for the identification of outliers in multivariate data based on the asymptotic theory for location estimation as described typically for the trimmed likelihood estimator and in particular for the minimum covariance determinant estimator. The strategy is to choose a subset of the data which minimizes an appropriate measure of the asymptotic variance of the multivariate location estimator. Observations not belonging to this subset are considered potential outliers which should be trimmed. For α less than about 0.5, the correct trimming proportion is taken to be that α > 0 for which the minimum of any minima of this measure of the asymptotic variance occurs. If no minima occur for an α > 0 then the data set will be considered outlier free. 相似文献

2.

Two-Stage Welsh's Trimmed Mean for the Simultaneous Equations Model

Lin-An Chen Kuo-Yuan Liang & Chwen-Chi Liu 《Australian & New Zealand Journal of Statistics》2001,43(4):481-492

This paper discusses the large sample theory of the two-stage Welsh's trimmed mean for the limited information simultaneous equations model. Besides having asymptotic normality, this trimmed mean, as the two-stage least squares estimator, is a generalized least squares estimator. It also acts as a robust Aitken estimator for the simultaneous equations model. Examples illustrate real data analysis and large sample inferences based on this trimmed mean. 相似文献

3.

Asymptotics for an Adaptive Trimmed Likelihood Location Estimator

Tadeusz Bednarski Brenton R. Clarke 《Statistics》2013,47(1):1-8

An asymptotic normality result is given for an adaptive trimmed likelihood estimator of location, which parallels the asymptotic normality result for the adaptive trimmed mean. The new result comes out of studying the adaptive trimmed likelihood estimator modelled parametrically by a normal family but then examining the behavior when the underlying distribution is in fact some F different from normal. The asymptotic variance of the adaptive estimator is equal to the asymptotic variance of the trimmed likelihood estimator at the optimal trimming proportion for the distribution F, subject to that trimming proportion being positive and F being suitably smooth. 相似文献

4.

Robust weighted one-way ANOVA: Improved approximation and efficiency

Elena Kulinskaya Michael B. Dollinger 《Journal of statistical planning and inference》2007

A robust test for the one-way ANOVA model under heteroscedasticity is developed in this paper. The data are assumed to be symmetrically distributed, apart from some outliers, although the assumption of normality may be violated. The test statistic to be used is a weighted sum of squares similar to the Welch [1951. On the comparison of several mean values: an alternative approach. Biometrika 38, 330-336.] test statistic, but any of a variety of robust measures of location and scale for the populations of interest may be used instead of the usual mean and standard deviation. Under the commonly occurring condition that the robust measures of location and scale are asymptotically normal, we derive approximations to the distribution of the test statistic under the null hypothesis and to its distribution under alternative hypotheses. An expression for relative efficiency is derived, thus allowing comparison of the efficiency of the test as a function of the choice of the location and scale estimators used in the test statistic. As an illustration of the theory presented here, we apply it to three commonly used robust location–scale estimator pairs: the trimmed mean with the Winsorized standard deviation; the Huber Proposal 2 estimator pair; and the Hampel robust location estimator with the median absolute deviation. 相似文献

5.

SMALL SAMPLE BIAS CORRECTION FOR HUBER'S PROPOSAL-2 SCALE M-ESTIMATOR

Brenton R. Clarke Christopher J. Milne 《Australian & New Zealand Journal of Statistics》2004,46(4):649-656

The most popular and perhaps universal estimator of location and scale in robust estimation, where the population is normal with possible small departures, is Huber's Proposal‐2 M‐estimator. This paper gives the first‐order small sample bias correction for the scale estimator, verifying the calculation through theory and simulation. Other ways of reducing small sample bias, say by jackknifing or bootstrapping, can be computationally intensive, and would not be routinely used with this iteratively derived estimator. It is suggested that bias‐reduced estimates of scale are most useful when forming confidence intervals for location and/or scale based on the asymptotic distribution. 相似文献

6.

Trimmed least squares estimator as best trimmed linear conditional estimator for linear regression model

Lin-An Chen Peter Thompson 《统计学通讯:理论与方法》2013,42(7):1835-1849

A class of trimmed linear conditional estimators based on regression quantiles for the linear regression model is introduced. This class serves as a robust analogue of non-robust linear unbiased estimators. Asymptotic analysis then shows that the trimmed least squares estimator based on regression quantiles ( Koenker and Bassett ( 1978 ) ) is the best in this estimator class in terms of asymptotic covariance matrices. The class of trimmed linear conditional estimators contains the Mallows-type bounded influence trimmed means ( see De Jongh et al ( 1988 ) ) and trimmed instrumental variables estimators. A large sample methodology based on trimmed instrumental variables estimator for confidence ellipsoids and hypothesis testing is also provided. 相似文献

7.

A Robust and Almost Fully Efficient M-Estimator

Ömer Öztürk 《Australian & New Zealand Journal of Statistics》1998,40(4):415-424

Simultaneous robust estimates of location and scale parameters are derived from a class of M-estimating equations. A coefficient p ( p > 0), which plays a role similar to that of a tuning constant in the theory of M-estimation, determines the estimating equations. These estimating equations may be obtained as the gradient of a strictly convex criterion function. This article shows that the estimators are uniquely defined, asymptotically bi-variate normal and have positive breakdown for some choices of p . When p = 0.12 and p = 0.3, the estimators are almost fully efficient for normal and exponential distributions: efficiencies with respect to the maximum likelihood estimators are 1.00 and 0.99, respectively. It is shown that the location estimator for known scale has the maximum breakdown point 0.5 independent of p , when the target model is symmetric. Also it is shown that the scale estimator has a positive breakdown point which depends on the choice of p . A simulation study finds that the proposed location estimator has smaller variance than the Hodges–Lehmann estimator, Huber's minimax and bisquare M-estimators. 相似文献

8.

Trimmed Mean Isotonic Regression

下载免费PDF全文

Subhra sankar Dhar 《Scandinavian Journal of Statistics》2016,43(1):202-212

The trimmed mean is well‐known in literature for being more robust and for having better efficiency than the sample mean when data is generated from heavy‐tailed distributions. In this article, the trimmed mean in the isotonic regression setup is proposed, and the asymptotic as well as the robustness properties of the estimator are studied. The usefulness of the proposed estimator is illustrated using different real and simulated data. Further, the performance of the estimator is compared with that of the mean and the median isotonic regression estimators. 相似文献

9.

Investigation of the performance of trimmed estimators of life time distributions with censoring

下载免费PDF全文

Brenton R. Clarke Alexandra Höller Christine H. Müller Karuru Wamahiu 《Australian & New Zealand Journal of Statistics》2017,59(4):513-525

For the lifetime (or negative) exponential distribution, the trimmed likelihood estimator has been shown to be explicit in the form of a β‐trimmed mean which is representable as an estimating functional that is both weakly continuous and Fréchet differentiable and hence qualitatively robust at the parametric model. It also has high efficiency at the model. The robustness is in contrast to the maximum likelihood estimator (MLE) involving the usual mean which is not robust to contamination in the upper tail of the distribution. When there is known right censoring, it may be perceived that the MLE which is the most asymptotically efficient estimator may be protected from the effects of ‘outliers’ due to censoring. We demonstrate that this is not the case generally, and in fact, based on the functional form of the estimators, suggest a hybrid defined estimator that incorporates the best features of both the MLE and the β‐trimmed mean. Additionally, we study the pure trimmed likelihood estimator for censored data and show that it can be easily calculated and that the censored observations are not always trimmed. The different trimmed estimators are compared by a modest simulation study. 相似文献

10.

Optimally robust estimators in generalized Pareto models

Peter Ruckdeschel Nataliya Horbenko 《Statistics》2013,47(4):762-791

In this paper, we study the robustness properties of several procedures for the joint estimation of shape and scale in a generalized Pareto model. The estimators that we primarily focus upon, most bias robust estimator (MBRE) and optimal MSE-robust estimator (OMSE), are one-step estimators distinguished as optimally robust in the shrinking neighbourhood setting; that is, they minimize the maximal bias, respectively, on such a specific neighbourhood, the maximal mean squared error (MSE). For their initialization, we propose a particular location–dispersion estimator, MedkMAD, which matches the population median and kMAD (an asymmetric variant of the median of absolute deviations) against the empirical counterparts. These optimally robust estimators are compared to the maximum-likelihood, skipped maximum-likelihood, Cramér–von-Mises minimum distance, method-of-medians, and Pickands estimators. To quantify their deviation from robust optimality, for each of these suboptimal estimators, we determine the finite-sample breakdown point and the influence function, as well as the statistical accuracy measured by asymptotic bias, variance, and MSE – all evaluated uniformly on shrinking neighbourhoods. These asymptotic findings are complemented by an extensive simulation study to assess the finite-sample behaviour of the considered procedures. The applicability of the procedures and their stability against outliers are illustrated for the Danish fire insurance data set from the package evir. 相似文献

11.

The influence function of penalized regression estimators

Viktoria Öllerer Christophe Croux Andreas Alfons 《Statistics》2015,49(4):741-765

To perform regression analysis in high dimensions, lasso or ridge estimation are a common choice. However, it has been shown that these methods are not robust to outliers. Therefore, alternatives as penalized M-estimation or the sparse least trimmed squares (LTS) estimator have been proposed. The robustness of these regression methods can be measured with the influence function. It quantifies the effect of infinitesimal perturbations in the data. Furthermore, it can be used to compute the asymptotic variance and the mean-squared error (MSE). In this paper we compute the influence function, the asymptotic variance and the MSE for penalized M-estimators and the sparse LTS estimator. The asymptotic biasedness of the estimators make the calculations non-standard. We show that only M-estimators with a loss function with a bounded derivative are robust against regression outliers. In particular, the lasso has an unbounded influence function. 相似文献

12.

Bias bound for the minimax estimator

Claudio Agostinelli 《Journal of statistical planning and inference》2009

The bias bound function of an estimator is an important quantity in order to perform globally robust inference. We show how to evaluate the exact bias bound for the minimax estimator of the location parameter for a wide class of unimodal symmetric location and scale family. We show, by an example, how to obtain an upper bound of the bias bound for a unimodal asymmetric location and scale family. We provide the exact bias bound of the minimum distance/disparity estimators under a contamination neighborhood generated from the same distance. 相似文献

13.

Asymptotic Theory of Outlier Detection Algorithms for Linear Time Series Regression Models

Søren Johansen Bent Nielsen 《Scandinavian Journal of Statistics》2016,43(2):321-348

Outlier detection algorithms are intimately connected with robust statistics that down‐weight some observations to zero. We define a number of outlier detection algorithms related to the Huber‐skip and least trimmed squares estimators, including the one‐step Huber‐skip estimator and the forward search. Next, we review a recently developed asymptotic theory of these. Finally, we analyse the gauge, the fraction of wrongly detected outliers, for a number of outlier detection algorithms and establish an asymptotic normal and a Poisson theory for the gauge. 相似文献

14.

Robust estimates of ordered means in normal models

《Journal of Statistical Computation and Simulation》2012,82(1-3):165-175

In this paper we consider the problem of estimating the locations of several normal populations when an order relation between them is known to be true. We compare the maximum likelihood estimator, the M-estimators based on Huber’s ψ function, a robust weighted likelihood estimator, the Gastworth estimator and the trimmed mean estimator. A Monte-Carlo study illustrates the performance of the methods considered. 相似文献

15.

Bootstrap adaptive estimation: The trimmed-mean example

Christian Lger Joseph P. Romano 《Revue canadienne de statistique》1990,18(4):297-314

We consider the problem of choosing among a class of possible estimators by selecting the estimator with the smallest bootstrap estimate of finite sample variance. This is an alternative to using cross-validation to choose an estimator adaptively. The problem of a confidence interval based on such an adaptive estimator is considered. We illustrate the ideas by applying the method to the problem of choosing the trimming proportion of an adaptive trimmed mean. It is shown that a bootstrap adaptive trimmed mean is asymptotically normal with an asymptotic variance equal to the smallest among trimmed means. The asymptotic coverage probability of a bootstrap confidence interval based on such adaptive estimators is shown to have the nominal level. The intervals based on the asymptotic normality of the estimator share the same asymptotic result, but have poor small-sample properties compared to the bootstrap intervals. A small-sample simulation demonstrates that bootstrap adaptive trimmed means adapt themselves rather well even for samples of size 10. 相似文献

16.

Simultaneous robust estimation of location and scale parameters: A minimum-distance approach

mer ztürk Thomas P. Hettmansperger 《Revue canadienne de statistique》1998,26(2):217-229

Simultaneous robust estimates of location and scale parameters are derived from minimizing a minimum-distance criterion function. The criterion function measures the squared distance between the pth power (p > 0) of the empirical distribution function and the pth power of the imperfectly determined model distribution function over the real line. We show that the estimator is uniquely defined, is asymptotically bivariate normal and for p > 0.3 has positive breakdown. If the scale parameter is known, when p = 0.9 the asymptotic variance (1.0436) of the location estimator for the normal model is smaller than the asymptotic variance of the Hodges-Lehmann (HL)estimator (1.0472). Efficiencies with respect to HL and maximum-likelihood estimators (MLE) are 1.0034 and 0.9582, respectively. Similarly, if the location parameter is known, when p = 0.97 the asymptotic variance (0.6158) of the scale estimator is minimum. The efficiency with respect to the MLE is 0.8119. We show that the estimator can tolerate more corrupted observations at oo than at – for p < 1, and vice versa for p > 1. 相似文献

17.

Estimation of Pr( x<y ) in the exponential case with common location parameter

D.S. Bai Y.W. Hong 《统计学通讯:理论与方法》2013,42(1):269-282

This paper considers the problem of estimating the probability P = Pr(X < Y) when X and Y are independent exponential random variables with unequal scale parameters and a common location parameter. Uniformly minimum variance unbiased estimator of P is obtained. The asymptotic distribution of the maximum likelihood estimator is obtained and then the asymptotic equivalence of the two estimators is established. Performance of the two estimators for moderate sample sizes is studied by Monte Carlo simulation. An approximate interval estimator is also obtained. 相似文献

18.

Comparisons of asymptotic biases and variances of m-estimators of scale under asymmetric contamination

John R Collins Boll Wu 《统计学通讯:理论与方法》2013,42(7):1791-1810

We study robustness properties of two types of M-estimators of scale when both location and scale parameters are unknown: (i) the scale estimator arising from simultaneous M-estimation of location and scale; and (ii) its symmetrization about the sample median. The robustness criteria considered are maximal asymptotic bias and maximal asymptotic variance when the known symmetric unimodal error distribution is subject to unknown, possibly asymmetric, £-con-tamination. Influence functions and asymptotic variance functionals are derived, and computations of asymptotic biases and variances, under the normal distribution with ε-contamination at oo, are presented for the special subclass arising from Huber's Proposal 2 and its symmetrized version. Symmetrization is seen to reduce both asymptotic bias and variance. Some complementary theoretical results are obtained, and the tradeoff between asymptotic bias and variance is discussed. 相似文献

19.

Quantile Regression for Location‐Scale Time Series Models with Conditional Heteroscedasticity

Jungsik Noh Sangyeol Lee 《Scandinavian Journal of Statistics》2016,43(3):700-720

This paper considers quantile regression for a wide class of time series models including autoregressive and moving average (ARMA) models with asymmetric generalized autoregressive conditional heteroscedasticity errors. The classical mean‐variance models are reinterpreted as conditional location‐scale models so that the quantile regression method can be naturally geared into the considered models. The consistency and asymptotic normality of the quantile regression estimator is established in location‐scale time series models under mild conditions. In the application of this result to ARMA‐generalized autoregressive conditional heteroscedasticity models, more primitive conditions are deduced to obtain the asymptotic properties. For illustration, a simulation study and a real data analysis are provided. 相似文献

20.

Comparison of estimation methods for the finite population mean in simple random sampling: symmetric super-populations

Arzu Altin Yavuz Birdal Senoglu 《Journal of applied statistics》2011,38(6):1277-1288

In this paper, a new estimator combined estimator (CE) is proposed for estimating the finite population mean ¯ Y _N in simple random sampling assuming a long-tailed symmetric super-population model. The efficiency and robustness properties of the CE is compared with the widely used and well-known estimators of the finite population mean ¯ Y _N by Monte Carlo simulation. The parameter estimators considered in this study are the classical least squares estimator, trimmed mean, winsorized mean, trimmed L-mean, modified maximum-likelihood estimator, Huber estimator (W24) and the non-parametric Hodges–Lehmann estimator. The mean square error criteria are used to compare the performance of the estimators. We show that the CE is overall more efficient than the other estimators. The CE is also shown to be more robust for estimating the finite population mean ¯ Y _N, since it is insensitive to outliers and to misspecification of the distribution. We give a real life example. 相似文献