期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Robust estimation of the mean vector for high-dimensional data set using robust clustering

Hamid Shahriari 《Journal of applied statistics》2015,42(6):1183-1205

The first step in statistical analysis is the parameter estimation. In multivariate analysis, one of the parameters of interest to be estimated is the mean vector. In multivariate statistical analysis, it is usually assumed that the data come from a multivariate normal distribution. In this situation, the maximum likelihood estimator (MLE), that is, the sample mean vector, is the best estimator. However, when outliers exist in the data, the use of sample mean vector will result in poor estimation. So, other estimators which are robust to the existence of outliers should be used. The most popular robust multivariate estimator for estimating the mean vector is S-estimator with desirable properties. However, computing this estimator requires the use of a robust estimate of mean vector as a starting point. Usually minimum volume ellipsoid (MVE) is used as a starting point in computing S-estimator. For high-dimensional data computing, the MVE takes too much time. In some cases, this time is so large that the existing computers cannot perform the computation. In addition to the computation time, for high-dimensional data set the MVE method is not precise. In this paper, a robust starting point for S-estimator based on robust clustering is proposed which could be used for estimating the mean vector of the high-dimensional data. The performance of the proposed estimator in the presence of outliers is studied and the results indicate that the proposed estimator performs precisely and much better than some of the existing robust estimators for high-dimensional data. 相似文献

2.

Robust bootstrap estimates in heteroscedastic semi-varying coefficient models and applications in analyzing Australia CPI data

Yan-Yong Zhao Hong-Xia Wang 《统计学通讯:模拟与计算》2017,46(4):2638-2653

This article deals with the estimation of the parametric component, which is of primary interest, in the heteroscedastic semi-varying coefficient models. Based on the bootstrap technique, we present a procedure for estimating the parameters, which can provide a reliable approximation to the asymptotic distribution of the profile least-square (PLS) estimator. Furthermore, a bootstrap-type estimator of covariance matrix is developed, which is proved to be a consistent estimator of the covariance matrix. Moreover, some simulation experiments are conducted to evaluate the finite sample performance for the proposed methodology. Finally, the Australia CPI dataset is analyzed to demonstrate the application of the methods. 相似文献

3.

Control charts based on quasi-likelihood estimation for monitoring profiles

Chung-I Li 《Journal of Statistical Computation and Simulation》2018,88(3):457-470

In some applications, the quality of the process or product is characterized and summarized by a functional relationship between a response variable and one or more explanatory variables. Profile monitoring is a technique for checking the stability of the relationship over time. Existing linear profile monitoring methods usually assumed the error distribution to be normal. However, this assumption may not always be true in practice. To address this situation, we propose a method for profile monitoring under the framework of generalized linear models when the relationship between the mean and variance of the response variable is known. Two multivariate exponentially weighted moving average control schemes are proposed based on the estimated profile parameters obtained using a quasi-likelihood approach. The performance of the proposed methods is evaluated by simulation studies. Furthermore, the proposed method is applied to a real data set, and the R code for profile monitoring is made available to users. 相似文献

4.

Robust estimation of AR coefficients under simultaneously influencing outliers and missing values

Yuriy S. Kharin Valeriy A. Voloshko 《Journal of statistical planning and inference》2011,141(9):3276-3288

A family of robust estimators for coefficients of Gaussian AR(p) time series under simultaneously influencing distortions of two types: outliers and missing values, is proposed. The estimators are based on special properties of the Cauchy probability distribution; consistency and the asymptotic normality of these estimators are proven. An approximate solution of the problem of minimization of the asymptotic variance within the proposed family of estimators is found. Performance of the proposed estimators is illustrated for simulated time series and for real data sets. 相似文献

5.

Robust estimation for zero-inflated poisson autoregressive models based on density power divergence

Byungsoo Kim 《Journal of Statistical Computation and Simulation》2017,87(15):2981-2996

In this study, we consider a robust estimation for zero-inflated Poisson autoregressive models using the minimum density power divergence estimator designed by Basu et al. [Robust and efficient estimation by minimising a density power divergence. Biometrika. 1998;85:549–559]. We show that under some regularity conditions, the proposed estimator is strongly consistent and asymptotically normal. The performance of the estimator is evaluated through Monte Carlo simulations. A real data analysis using New South Wales crime data is also provided for illustration. 相似文献

6.

Robust estimation of the ratio of scale parameters for positive random variables

Russell F. Kappenman 《统计学通讯:理论与方法》2013,42(10):2983-2996

An estimator of the ratio of scale parameters of the distributions of two positive random variables is developed for the case where the only difference between the distributions is a difference in scale. Simulation studies demonstrate that the estimator performs much better, in terms of mean squared error, than the most popular one among those estimators currently available. 相似文献

7.

Robust online scale estimation in time series: A model-free approach

Sarah Gelper Karen Schettlinger Christophe Croux Ursula Gather 《Journal of statistical planning and inference》2009

This paper presents variance extraction procedures for univariate time series. The volatility of a times series is monitored allowing for non-linearities, jumps and outliers in the level. The volatility is measured using the height of triangles formed by consecutive observations of the time series. This idea was proposed by Rousseeuw and Hubert [1996. Regression-free and robust estimation of scale for bivariate data. Comput. Statist. Data Anal. 21, 67–85] in the bivariate setting. This paper extends their procedure to apply for online scale estimation in time series analysis. The statistical properties of the new methods are derived and finite sample properties are given. A financial and a medical application illustrate the use of the procedures. 相似文献

8.

Robust estimation for the coefficient of a first order autoregressive process

Jiin-Huarng Guo 《统计学通讯:理论与方法》2013,42(1):55-66

Nonparametric methods, Theil's method and Hussain's method have been applied to simple linear regression problems for estimating the slope of the regression line.We extend these methods and propose a robust estimator to estimate the coefficient of a first order autoregressive process under various distribution shapes, A simulation study to compare Theil's estimator, Hus-sain's estimator, the least squares estimator, and the proposed estimator is also presented. 相似文献

9.

Robust Estimation for Parameters of the Extended Burr Type III Distribution

Yeliz Mert Kantar Vural Yildirim 《统计学通讯:模拟与计算》2015,44(7):1901-1930

We consider various robust estimators for the extended Burr Type III (EBIII) distribution for complete data with outliers. The considered robust estimators are M-estimators, least absolute deviations, Theil, Siegel's repeated median, least trimmed squares, and least median of squares. Before we perform the aforementioned estimators for the EBIII, we adapt the quantiles method to the estimation of the shape parameter k of the EBIII. The simulation results show that the considered robust estimators generally outperform the existing estimation approaches for data with upper outliers, with certain of them retaining a relatively high degree of efficiency for small sample sizes. 相似文献

10.

Phase I monitoring and change point estimation of autocorrelated poisson regression profiles

M. R. Maleki A. R. Taheriyoun P. Castagliola 《统计学通讯:理论与方法》2018,47(24):5885-5903

The objective of this paper is to study the Phase I monitoring and change point estimation of autocorrelated Poisson profiles where the response values within each profile are autocorrelated. Two charts, the SLRT and the Hotelling's T², are proposed along with an algorithm for parameter estimation. The detecting power of the proposed charts is compared using simulations in terms of the signal probability criterion. The performance of the SLRT method in estimating the change point in the regression parameters is also evaluated. Moreover, a real data example is presented to illustrate the application of the methods. 相似文献

11.

Robust estimates of ordered means in normal models

《Journal of Statistical Computation and Simulation》2012,82(1-3):165-175

In this paper we consider the problem of estimating the locations of several normal populations when an order relation between them is known to be true. We compare the maximum likelihood estimator, the M-estimators based on Huber’s ψ function, a robust weighted likelihood estimator, the Gastworth estimator and the trimmed mean estimator. A Monte-Carlo study illustrates the performance of the methods considered. 相似文献

12.

Robust hierarchical Bayes estimation of exchangeable means

Jean-Franois Angers James O. Berger 《Revue canadienne de statistique》1991,19(1):39-56

Estimation of the mean of a multivariate normal distribution is considered. The components of the mean vector θ are assumed to be exchangeable; this is modelled in a hierarchical fashion with independent Cauchy distributions as the first-stage prior. The resulting generalized Bayes estimator is calculated and shown to be robust with respect to the presence of outlying means. Alternative estimators that have similar behaviour but are cheaper to compute are also derived. 相似文献

13.

基于稳健MM估计的统计数据质量评估方法 总被引：1，自引：1，他引：1

下载免费PDF全文

卢二坡黄炳艺《统计研究》2010,27(12):16-22

政府统计数据质量是当前各界关注的热点问题,如何采用严谨的诊断方法,对我国统计数据进行科学的评估具有重要的现实意义。稳健回归方法可使求出的回归估计不受异常值的强烈影响,并且能更好的识别异常点。本文首次运用基于稳健MM估计的异常值诊断方法,在生产函数模型的框架下,分别使用两种不同的劳动投入数据,对改革以来我国GDP数据质量进行了评估。结果表明,基于稳健MM估计的异常值诊断方法可有效的解决传统方法容易出现的多个异常点的掩盖现象,改革以来我国的GDP数据是相对可靠的。相似文献

14.

Phase I monitoring of generalized linear model-based regression profiles

《Journal of Statistical Computation and Simulation》2012,82(14):2839-2859

In some industrial applications, the quality of a process or product is characterized by a relationship between the response variable and one or more independent variables which is called as profile. There are many approaches for monitoring different types of profiles in the literature. Most researchers assume that the response variable follows a normal distribution. However, this assumption may be violated in many cases. The most likely situation is when the response variable follows a distribution from generalized linear models (GLMs). For example, when the response variable is the number of defects in a certain area of a product, the observations follow Poisson distribution and ignoring this fact will cause misleading results. In this paper, three methods including a T²-based method, likelihood ratio test (LRT) method and F method are developed and modified in order to be applied in monitoring GLM regression profiles in Phase I. The performance of the proposed methods is analysed and compared for the special case that the response variable follows Poisson distribution. A simulation study is done regarding the probability of the signal criterion. Results show that the LRT method performs better than two other methods and the F method performs better than the T²-based method in detecting either small or large step shifts as well as drifts. Moreover, the F method performs better than the other two methods, and the LRT method performs poor in comparison with the F and T²-based methods in detecting outliers. A real case, in which the size and number of agglomerates ejected from a volcano in successive days form the GLM profile, is illustrated and the proposed methods are applied to determine whether the number of agglomerates of each size is under statistical control or not. Results showed that the proposed methods could handle the mentioned situation and distinguish the out-of-control conditions. 相似文献

15.

Robust estimation of error scale in nonparametric regression models

Isabella Rodica Ghement Marcelo Ruiz Ruben Zamar 《Journal of statistical planning and inference》2008

相似文献

16.

Robust estimation for order of hidden Markov models based on density power divergences

《Journal of Statistical Computation and Simulation》2012,82(5):503-512

In this paper, we study the robust estimation for the order of hidden Markov model (HMM) based on a penalized minimum density power divergence estimator, which is obtained by utilizing the finite mixture marginal distribution of HMM. For this task, we adopt the locally conic parametrization method used in [D. Dacunha-Castelle and E. Gassiate, Testing in locally conic models and application to mixture models. ESAIM Probab. Stat. (1997), pp. 285–317; D. Dacunha-Castelle and E. Gassiate, Testing the order of a model using locally conic parametrization: population mixtures and stationary arma processes, Ann. Statist. 27 (1999), pp. 1178–1209; T. Lee and S. Lee, Robust and consistent estimation of the order of finite mixture models based on the minimizing a density power divergence estimator, Metrika 68 (2008), pp. 365–390] to avoid the difficulties that arise in handling mixture marginal models, such as the non-identifiability of the parameter space and the singularity problem with the asymptotic variance. We verify that the estimated order is consistent and simulation results are provided for illustration. 相似文献

17.

On the choice of co-ordinates in simultaneous estimation of normal means under misspecification of normal priors

Dipak K. Dey 《统计学通讯:理论与方法》2013,42(6):661-673

The problem of choice of coordinates in Stein-type estimators,when simultaneously estimating normal means, is considered. The question of deciding whether to use all coordinates in one combined shrinkage estimators or to separate into groups and use separate shrinkage estimators on each group is considered in the situation in which part of the prior information may be " misspecified". It is observed that the amount of misspecification determines whether to use the combined shrinkage estimator the separate shrinkage estimator. 相似文献

18.

Addressing the effect of parameter estimation on phase II monitoring of multivariate multiple linear profiles via a new cluster-based approach

Ahmad Ahmadi Yazdi Ali Zeinal Hamadani 《统计学通讯:理论与方法》2020,49(17):4117-4132

Abstract

Profile monitoring is applied when the quality of a product or a process can be determined by the relationship between a response variable and one or more independent variables. In most Phase II monitoring approaches, it is assumed that the process parameters are known. However, it is obvious that this assumption is not valid in many real-world applications. In fact, the process parameters should be estimated based on the in-control Phase I samples. In this study, the effect of parameter estimation on the performance of four Phase II control charts for monitoring multivariate multiple linear profiles is evaluated. In addition, since the accuracy of the parameter estimation has a significant impact on the performance of Phase II control charts, a new cluster-based approach is developed to address this effect. Moreover, we evaluate and compare the performance of the proposed approach with a previous approach in terms of two metrics, average of average run length and its standard deviation, which are used for considering practitioner-to-practitioner variability. In this approach, it is not necessary to know the distribution of the chart statistic. Therefore, in addition to ease of use, the proposed approach can be applied to other type of profiles. The superior performance of the proposed method compared to the competing one is shown in terms of all metrics. Based on the results obtained, our method yields less bias with small-variance Phase I estimates compared to the competing approach. 相似文献

19.

A two-step robust estimation of the process mean using M-estimator

Hamid Shahriari Orod Ahmadi Amir H. Shokouhi 《Journal of applied statistics》2011,38(6):1289-1301

Parameter estimation is the first step in constructing control charts. One of these parameters is the process mean. The classical estimators of the process mean are sensitive to the presence of outlying data and subgroups which contaminate the whole data. In existing robust estimators for the process mean, the effects of the presence of the individual outliers are being considered, while, in this paper, a robust estimator is being proposed to reduce the effect of outlying subgroups as well as the individual outliers within a subgroup. The proposed estimator was compared with some classical and robust estimators of the process mean. Although, its relative efficiency is fourth among the estimators tested, its robustness and efficiency are large when the outlying subgroups are present. Evaluation of the results indicated that the proposed estimator is less sensitive to the presence of outliers and the process mean performs well when there are no individual outliers or outlying subgroups. 相似文献

20.

Robust wavelet shrinkage using robust selection of thresholds

Hee-Seok Oh Donghoh Kim Yongdai Kim 《Statistics and Computing》2009,19(1):27-34

This paper considers the problem of selecting a robust threshold of wavelet shrinkage. Previous approaches reported in literature to handle the presence of outliers mainly focus on developing a robust procedure for a given threshold; this is related to solving a nontrivial optimization problem. The drawback of this approach is that the selection of a robust threshold, which is crucial for the resulting fit is ignored. This paper points out that the best fit can be achieved by a robust wavelet shrinkage with a robust threshold. We propose data-driven selection methods for a robust threshold. These approaches are based on a coupling of classical wavelet thresholding rules with pseudo data. The concept of pseudo data has influenced the implementation of the proposed methods, and provides a fast and efficient algorithm. Results from a simulation study and a real example demonstrate the promising empirical properties of the proposed approaches. 相似文献