首页 | 本学科首页   官方微博 | 高级检索  
 共查询到20条相似文献,搜索用时 31 毫秒
Nonparametric and parametric estimators are combined to minimize the mean squared error among their linear combinations. The combined estimator is consistent and for large sample sizes has a smaller mean squared error than the nonparametric estimator when the parametric assumption is violated. If the parametric assumption holds, the combined estimator has a smaller MSE than the parametric estimator. Our simulation examples focus on mean estimation when data may follow a lognormal distribution, or can be a mixture with an exponential or a uniform distribution. Motivating examples illustrate possible application areas.  相似文献   

This paper discusses the sensitivity of the sequential normal-based triple sampling procedure for estimating the population mean to departures from normality. We assume that the underlying population has finite absolute sixth moment and find that asymptotically the behavior of the estimator and of the sample size depend on the skewness and kurtosis of the underlying distribution when using a squared error loss function with linear sampling cost. These results enable the effects of non-normality easily to be assessed both qualitatively and quantitatively. We supplement our asymptotic results with a simulation experiment to study the performance of the estimator and the sample size in a range of conditions.  相似文献   

Searls in 1964 showed that when the coefficient of variation is known, the sample mean is dominated with respect to mean squared error by an improved estimator that makes use of that coefficient. In this article we illustrate that this is true for a general class of estimators. Expressions for the minimum mean squared error and the relative efficiency are given for general distributions. The improvement, as measured by relative efficiency, is seen to be independent of the form of the distribution.  相似文献   

Based on right-censored data from a lifetime distribution F , a smooth nonparametric estimator of the quantile function Q (p) is given by Qn(p)=h 1jjQn(t)K((t-p)/h)dt, where QR(p) denotes the product-limit quantile function. Extensive Monte Carlo simulations indicate that at fixed p this kernel-type quantile estimator has smaller mean squared error than (L(p) for a range of values of the bandwidth h. A method of selecting an "optimal" bandwidth (in the sense of small estimated mean squared error) based on the bootstrap is investigated yielding results consistent with the simulation study. The bootstrap is also used to obtain interval estimates for Q (p) after determining the optimal value of h.  相似文献   

The author compares two estimators of a continuous, concave distribution function having support on the positive half line. In terms of samples from uniform distributions, he gives stochastic bounds for the pointwise and sup‐norm differences between the least concave majorant of the empirical distribution function and the underlying distribution function. He also offers evidence demonstrating the almost paradoxical result that the empirical distribution function is not as good an estimator as its least concave majorant in terms of sup‐norm error but a better pointwise estimator of the true distribution function in terms of mean squared error.  相似文献   

This paper proposes an adaptive estimator that is more precise than the ordinary least squares estimator if the distribution of random errors is skewed or has long tails. The adaptive estimates are computed using a weighted least squares approach with weights based on the lengths of the tails of the distribution of residuals. Smaller weights are assigned to those observations that have residuals in the tails of long-tailed distributions and larger weights are assigned to observations having residuals in the tails of short-tailed distributions. Monte Carlo methods are used to compare the performance of the proposed estimator and the performance of the ordinary least squares estimator. The estimates that were studied in this simulation include the difference between the means of two populations, the mean of a symmetric distribution, and the slope of a regression line. The adaptive estimators are shown to have lower mean squared errors than those for the ordinary least squares estimators for short-tailed, long-tailed, and skewed distributions, provided the sample size is at least 20. The ordinary least squares estimator has slightly lower mean squared error for normally distributed errors. The adaptive estimator is recommended for general use for studies having sample sizes of at least 20 observations unless the random errors are known to be normally distributed.  相似文献   

By assuming that the underlying distribution belongs to the domain of attraction of an extreme value distribution, one can extrapolate the data to a far tail region so that a rare event can be predicted. However, when the distribution is in the domain of attraction of a Gumbel distribution, the extrapolation is quite limited generally in comparison with a heavy tailed distribution. In view of this drawback, a Weibull tailed distribution has been studied recently. Some methods for choosing the sample fraction in estimating the Weibull tail coefficient and some bias reduction estimators have been proposed in the literature. In this paper, we show that the theoretical optimal sample fraction does not exist and a bias reduction estimator does not always produce a smaller mean squared error than a biased estimator. These are different from using a heavy tailed distribution. Further we propose a refined class of Weibull tailed distributions which are more useful in estimating high quantiles and extreme tail probabilities.  相似文献   

Necessary and sufficient conditions for a linear estimator to dominate another linear estimator of a location parameter under the Pitman's criterion of comparison are discussed. Consequently it is demonstrated that a linear biased estimator can not dominate a linear unbiased estimator under Pitman's criterion and that the sample mean is the Closest Linear Unbiased Estimator (CLUE). It is also shown that the ridge regression estimator with a known biasing constant can not dominate the ordinary least squares estimator. If an estimator δdominates an estimator δin the average loss sense then sufficient conditions are obtained under which δis also preferred over δunder Pitman's criterion. Further we obtain sufficient conditions under which preference under the Pitman's criterion will lead to preference under the mean squared error sense.  相似文献   

We derive analytic expressions for the biases of the maximum likelihood estimators of the scale parameter in the half-logistic distribution with known location, and of the location parameter when the latter is unknown. Using these expressions to bias-correct the estimators is highly effective, without adverse consequences for estimation mean squared error. The overall performance of the first of these bias-corrected estimators is slightly better than that of a bootstrap bias-corrected estimator. The bias-corrected estimator of the location parameter significantly out-performs its bootstrapped-based counterpart. Taking computational costs into account, the analytic bias corrections clearly dominate the use of the bootstrap.  相似文献   


In this paper, we discuss an estimation problem of the mean in the inverse Gaussian distribution with a known coefficient of variation. Two types of linear estimators for the mean, the linear minimum variance unbiased estimator and the linear minimum mean squared error estimator, are constructed by using the squared error loss function and their properties are examined. It is observed that, for small samples the performance of the proposed estimators is better than that of the maximum likelihood estimator, when the coefficient of variation is large.  相似文献   

We consider the problem of data-based choice of the bandwidth of a kernel density estimator, with an aim to estimate the density optimally at a given design point. The existing local bandwidth selectors seem to be quite sensitive to the underlying density and location of the design point. For instance, some bandwidth selectors perform poorly while estimating a density, with bounded support, at the median. Others struggle to estimate a density in the tail region or at the trough between the two modes of a multimodal density. We propose a scale invariant bandwidth selection method such that the resulting density estimator performs reliably irrespective of the density or the design point. We choose bandwidth by minimizing a bootstrap estimate of the mean squared error (MSE) of a density estimator. Our bootstrap MSE estimator is different in the sense that we estimate the variance and squared bias components separately. We provide insight into the asymptotic accuracy of the proposed density estimator.  相似文献   


This paper is concerned with properties (bias, standard deviation, mean square error and efficiency) of twenty six estimators of the intraclass correlation in the analysis of binary data. Our main interest is to study these properties when data are generated from different distributions. For data generation we considered three over-dispersed binomial distributions, namely, the beta-binomial distribution, the probit normal binomial distribution and a mixture of two binomial distributions. The findings regarding bias, standard deviation and mean squared error of all these estimators, are that (a) in general, the distributions of biases of most of the estimators are negatively skewed. The biases are smallest when data are generated from the beta-binomial distribution and largest when data are generated from the mixture distribution; (b) the standard deviations are smallest when data are generated from the beta-binomial distribution; and (c) the mean squared errors are smallest when data are generated from the beta-binomial distribution and largest when data are generated from the mixture distribution. Of the 26, nine estimators including the maximum likelihood estimator, an estimator based on the optimal quadratic estimating equations of Crowder (1987), and an analysis of variance type estimator is found to have least amount of bias, standard deviation and mean squared error. Also, the distributions of the bias, standard deviation and mean squared error for each of these estimators are, in general, more symmetric than those of the other estimators. Our findings regarding efficiency are that the estimator based on the optimal quadratic estimating equations has consistently high efficiency and least variability in the efficiency results. In the important range in which the intraclass correlation is small (≤0 5), on the average, this estimator shows best efficiency performance. The analysis of variance type estimator seems to do well for larger values of the intraclass correlation. In general, the estimator based on the optimal quadratic estimating equations seems to show best efficiency performance for data from the beta-binomial distribution and the probit normal binomial distribution, and the analysis of variance type estimator seems to do well for data from the mixture distribution.  相似文献   

It is well known that adaptive sequential nonparametric estimation of differentiable functions with assigned mean integrated squared error and minimax expected stopping time is impossible. In other words, no sequential estimator can compete with an oracle estimator that knows how many derivatives an estimated curve has. Differentiable functions are typical in probability density and regression models but not in spectral density models, where considered functions are typically smoother. This paper shows that for a large class of spectral densities, which includes spectral densities of classical autoregressive moving average processes, an adaptive minimax sequential estimation with assigned mean integrated squared error is possible. Furthermore, a two‐stage sequential procedure is proposed, which is minimax and adaptive to smoothness of an underlying spectral density.  相似文献   

In this paper, we discuss the estimation of population characteristics using stratified random sampling in an infinite population framework, including ranked set sampling as a special case. The use of prior values is considered and the underlying distribution is assumed to be unknown. The estimator considered in each stratum is the weighted mean of the U-statistic and prior value. The optimum weight is obtained by minimizing the mean squared error of the estimator of the population characteristics, but it contains unknown parameters and those parameters are replaced with their estimates. Simulation results show the gains in efficiency of the proposed estimator, yielding gains of at least 1.2 times larger than the usual unbiased estimator under certain condition specified in the text. Guidelines for the usage of the proposed estimator are shown and an application to a real data set is provided.  相似文献   

Much of the small‐area estimation literature focuses on population totals and means. However, users of survey data are often interested in the finite‐population distribution of a survey variable and in the measures (e.g. medians, quartiles, percentiles) that characterize the shape of this distribution at the small‐area level. In this paper we propose a model‐based direct estimator (MBDE, Chandra and Chambers) of the small‐area distribution function. The MBDE is defined as a weighted sum of sample data from the area of interest, with weights derived from the calibrated spline‐based estimate of the finite‐population distribution function introduced by Harms and Duchesne, under an appropriately specified regression model with random area effects. We also discuss the mean squared error estimation of the MBDE. Monte Carlo simulations based on both simulated and real data sets show that the proposed MBDE and its associated mean squared error estimator perform well when compared with alternative estimators of the area‐specific finite‐population distribution function.  相似文献   

We propose the total bootstrap median (TBM) as a robust and efficient estimator of location and scale for small samples. We demonstrate its performance by estimating the mean and variance of a variety of distributions. We also show that, if the underlying distribution is unknown and there is either no contamination or low to moderate contamination, the TBM provides a better estimate of the mean, in mean square terms, than the sample mean or the sample median. In addition, the TBM is a better estimator of the variance of the underlying distribution than the sample variance or the square of the bias-corrected median absolute deviation from the median estimator. We also show that the TBM is an explicit L-estimator, which allows a direct study of its properties.  相似文献   

Stein [Stein, C. (1956). Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. In Proc. 3rd Berkeley symp. math. statist. and pro. (pp. 197–206). University of California Press], in his seminal paper, came up with the surprising discovery that the sample mean is an inadmissible estimator of the population mean in three or higher dimensions under squared error loss. The past five decades have witnessed multiple extensions and variations of Stein’s results. In this paper we develop Stein-type estimators in a semiparametric framework and prove their coordinatewise asymptotic dominance over the sample mean in terms of Bayes risks.  相似文献   

In this paper, we consider the estimation of the probability density function and the cumulative distribution function of the inverse Rayleigh distribution. In this regard, the following estimators are considered: uniformly minimum variance unbiased estimator, maximum likelihood (ML) estimator, percentile estimator, least squares estimator and weighted least squares estimator. To do so, analytical expressions are derived for the mean integrated squared error. As the result of simulation studies and real data applications indicate, when the sample size is not very small the ML estimator performs better than the others.  相似文献   

The small-sample bias and root mean squared error of several distribution-free estimators of the variance of the sample median are examined. A new estimator is proposed that is easy to compute and tends to have the smallest bias and root mean squared error.  相似文献   

Three combined estimators for the bivariate normal correlation parameter are considered. The data consist of k independent sample correlation coefficients and it is assumed that the underlying correlation parameters are all equal to ρ. Based upon the joint density function of the sample correlations a combined estimator of ρ is obtained as an approximation to the maximum likelihood solution. Two linearly combined estimators are also considered. One of them is based on Fisher's z-transformation of the sample correlations and the other on an unbiased estimator of ρ. The comparison of these three estimators indicates that the combined (approximate) MLE has a slightly smaller estimated mean squared error relative to the other two combined methods of estimation, but it does so at the expense of a relatively larger bias.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号