期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Asymptotic theory and inference of predictive mean matching imputation using a superpopulation model framework

Shu Yang Jae Kwang Kim 《Scandinavian Journal of Statistics》2020,47(3):839-861

Predictive mean matching imputation is popular for handling item nonresponse in survey sampling. In this article, we study the asymptotic properties of the predictive mean matching estimator for finite-population inference using a superpopulation model framework. We also clarify conditions for its robustness. For variance estimation, the conventional bootstrap inference is invalid for matching estimators with a fixed number of matches due to the nonsmoothness nature of the matching estimator. We propose a new replication variance estimator, which is asymptotically valid. The key strategy is to construct replicates directly based on the linear terms of the martingale representation for the matching estimator, instead of individual records of variables. Simulation studies confirm that the proposed method provides valid inference. 相似文献

2.

Estimation of the distribution function under hybrid ranked set sampling

Abdul Haq 《Journal of Statistical Computation and Simulation》2017,87(2):313-327

Recently, a hybrid ranked set sampling (HRSS) scheme has been proposed in the literature. The HRSS scheme encompasses several existing ranked set sampling (RSS) schemes, and it is a cost-effective alternative to the classical RSS and double RSS schemes. In this paper, we propose an improved estimator for estimating the cumulative distribution function (CDF) using HRSS. It is shown, both theoretically and numerically, that the CDF estimator under HRSS scheme is unbiased and its variance is always less than the variance of the CDF estimator with simple random sampling (SRS). An unbiased estimator of the variance of CDF estimator using HRSS is also derived. Using Monte Carlo simulations, we also study the performances of the proposed and existing CDF estimators under both perfect and imperfect rankings. It turns out that the proposed CDF estimator is by far a superior alternative to the existing CDF estimators with SRS, RSS and L-RSS schemes. For a practical application, a real data set is considered on the bilirubin level of babies in neonatal intensive care. 相似文献

3.

Estimation of Fibre Length Distributions from Fibre Endpoints

下载免费PDF全文

Malte Kuhlmann Claudia Redenbach 《Scandinavian Journal of Statistics》2015,42(4):1010-1022

Estimating the fibre length distribution in composite materials is of practical relevance in materials science. We propose an estimator for the fibre length distribution using the point process of fibre endpoints as input. Assuming that this point process is a realization of a Neyman–Scott process, we use results for the reduced second moment measure to derive a consistent and unbiased estimator for the fibre length distribution. We introduce various versions of the estimator taking anisotropy or errors in the observation into account. The estimator is evaluated using a heuristic for its mean squared error as well as a simulation study. Finally, the estimator is applied to the fibre endpoint process extracted from a tomographic image of a glass fibre composite. 相似文献

4.

Empirical Uncertain Bayes Methods in Area‐level Models

下载免费PDF全文

Shonosuke Sugasawa Tatsuya Kubokawa Kota Ogasawara 《Scandinavian Journal of Statistics》2017,44(3):684-706

Random effects model can account for the lack of fitting a regression model and increase precision of estimating area‐level means. However, in case that the synthetic mean provides accurate estimates, the prior distribution may inflate an estimation error. Thus, it is desirable to consider the uncertain prior distribution, which is expressed as the mixture of a one‐point distribution and a proper prior distribution. In this paper, we develop an empirical Bayes approach for estimating area‐level means, using the uncertain prior distribution in the context of a natural exponential family, which we call the empirical uncertain Bayes (EUB) method. The regression model considered in this paper includes the Poisson‐gamma and the binomial‐beta, and the normal‐normal (Fay–Herriot) model, which are typically used in small area estimation. We obtain the estimators of hyperparameters based on the marginal likelihood by using a well‐known expectation‐maximization algorithm and propose the EUB estimators of area means. For risk evaluation of the EUB estimator, we derive a second‐order unbiased estimator of a conditional mean squared error by using some techniques of numerical calculation. Through simulation studies and real data applications, we evaluate a performance of the EUB estimator and compare it with the usual empirical Bayes estimator. 相似文献

5.

Nonparametric estimation of bivariate distribution under right truncation with application to panic disorder

Xiaodong Luo Wei-Yann Tsai 《Journal of statistical planning and inference》2009

In several studies, investigators are interested in estimating the bivariate distribution of the onset ages of a generic disorder in successive generations. The empirical distribution is inappropriate for this purpose due to truncation: only parent–child pairs with onset ages prior to the ages at interview were included in the sample. In this paper, we propose a simple nonparametric estimator for the underlying bivariate distribution of the onset ages. Compared with the existing estimators, the proposed estimator has a closed form and smaller biases when estimating marginal distributions. A real example is used to illustrate this estimator. 相似文献

6.

Multistage plug—in bandwidth selection for kernel distribution function estimates

《Journal of Statistical Computation and Simulation》2012,82(1-4):63-80

The use of a kernel estimator as a smooth estimator for a distribution function has been suggested by many authors An expression for the bandwidth that minimizes the mean integrated square error asymptotically has been available for some time. However, few practical data based methods ior estimating this bandwidth have been investigated. In this paper we propose multisstage plug-in type estimater for this optimal bandwith and derive its asymptotic properties. In particular we show that two stages are required for good asymptotic properties. This behavior is verified for finite samples using a simulation study. 相似文献

7.

Use of several auxiliary variables in estimating the population mean in a two-occasion rotation pattern

Housila P. Singh 《统计学通讯:理论与方法》2013,42(23):6928-6942

ABSTRACT

This paper addresses the problem of estimation of the population mean on the current (second) occasion in two-occasion successive sampling. Utilizing the readily available information on several auxiliary variables on both occasions and the information on the study variable from the previous occasion, an estimation procedure of the population mean on the current occasion has been proposed. Theoretical properties of the proposed estimator have been investigated. Optimum replacement policy to the proposed estimator has been discussed. The proposed estimator has been compared empirically with the sample mean estimator, when there is no matching and the optimum estimator which is a linear combination of the means of the matched and unmatched portions of the sample at the current occasion. Appropriate recommendations have been made for practical applications. 相似文献

8.

Large sample properties of some survival estimators in heterogeneous samples

《Journal of statistical planning and inference》1997,60(1):123-138

相似文献

9.

A study of regression calibration in a partially observed stratified Cox model

Jean-François Dupuy Eve Leconte 《Journal of statistical planning and inference》2009

Regression calibration is a simple method for estimating regression models when covariate data are missing for some study subjects. It consists in replacing an unobserved covariate by an estimator of its conditional expectation given available covariates. Regression calibration has recently been investigated in various regression models such as the linear, generalized linear, and proportional hazards models. The aim of this paper is to investigate the appropriateness of this method for estimating the stratified Cox regression model with missing values of the covariate defining the strata. Despite its practical relevance, this problem has not yet been discussed in the literature. Asymptotic distribution theory is developed for the regression calibration estimator in this setting. A simulation study is also conducted to investigate the properties of this estimator. 相似文献

10.

Variance estimation of the Buckley–James estimator under discrete assumptions

《Journal of Statistical Computation and Simulation》2012,82(4):481-496

The Buckley–James estimator (BJE) is a widely recognized approach in dealing with right-censored linear regression models. There have been a lot of discussions in the literature on the estimation of the BJE as well as its asymptotic distribution. So far, no simulation has been done to directly estimate the asymptotic variance of the BJE. Kong and Yu [Asymptotic distributions of the Buckley–James estimator under nonstandard conditions, Statist. Sinica 17 (2007), pp. 341–360] studied the asymptotic distribution under discontinuous assumptions. Based on their methodology, we recalculate and correct some missing terms in the expression of the asymptotic variance in Theorem 2 of their work. We propose an estimator of the standard deviation of the BJE by using plug-in estimators. The estimator is shown to be consistent. The performance of the estimator is accessed through simulation studies under discrete underline distributions. We further extend our studies to several continuous underline distributions through simulation. The estimator is also applied to a real medical data set. The simulation results suggest that our estimation is a good approximation to the true standard deviation with reference to the empirical standard deviation. 相似文献

11.

Estimation with Interval Censored Data and Covariates

van der Laan Mark J. Hubbard Alan 《Lifetime data analysis》1997,3(1):77-91

In biostatistical applications interest often focuses on the estimation of the distribution of time T between two consecutive events. If the initial event time is observed and the subsequent event time is only known to be larger or smaller than an observed monitoring time C, then the data conforms to the well understood singly-censored current status model, also known as interval censored data, case I. Additional covariates can be used to allow for dependent censoring and to improve estimation of the marginal distribution of T. Assuming a wrong model for the conditional distribution of T, given the covariates, will lead to an inconsistent estimator of the marginal distribution. On the other hand, the nonparametric maximum likelihood estimator of FT requires splitting up the sample in several subsamples corresponding with a particular value of the covariates, computing the NPMLE for every subsample and then taking an average. With a few continuous covariates the performance of the resulting estimator is typically miserable. In van der Laan, Robins (1996) a locally efficient one-step estimator is proposed for smooth functionals of the distribution of T, assuming nothing about the conditional distribution of T, given the covariates, but assuming a model for censoring, given the covariates. The estimators are asymptotically linear if the censoring mechanism is estimated correctly. The estimator also uses an estimator of the conditional distribution of T, given the covariates. If this estimate is consistent, then the estimator is efficient and if it is inconsistent, then the estimator is still consistent and asymptotically normal. In this paper we show that the estimators can also be used to estimate the distribution function in a locally optimal way. Moreover, we show that the proposed estimator can be used to estimate the distribution based on interval censored data (T is now known to lie between two observed points) in the presence of covariates. The resulting estimator also has a known influence curve so that asymptotic confidence intervals are directly available. In particular, one can apply our proposal to the interval censored data without covariates. In Geskus (1992) the information bound for interval censored data with two uniformly distributed monitoring times at the uniform distribution (for T has been computed. We show that the relative efficiency of our proposal w.r.t. this optimal bound equals 0.994, which is also reflected in finite sample simulations. Finally, the good practical performance of the estimator is shown in a simulation study. This revised version was published online in July 2006 with corrections to the Cover Date. 相似文献

12.

On the one parameter unit-Lindley distribution and its associated regression model for proportion data

Josmar Mazucheli André Felipe Berdusco Menezes Subrata Chakraborty 《Journal of applied statistics》2019,46(4):700-714

In this paper considering an appropriate transformation on the Lindley distribution, we propose the unit-Lindley distribution and investigate some of its statistical properties. An important fact associated with this new distribution is that it is possible to obtain the analytical expression for bias correction of the maximum likelihood estimator. Moreover, it belongs to the exponential family. This distribution allows us to incorporate covariates directly in the mean and consequently to quantify their influences on the average of the response variable. Finally, a practical application is presented to show that our model fits much better than the Beta regression. 相似文献

13.

缺失数据下的逆概率多重加权分位回归估计及其应用

邰凌楠等《统计研究》2018,35(9):115-128

数据缺失问题普遍存在于应用研究中。在随机缺失机制假定下,本文从模型推断角度出发,针对线性缺失分位回归模型,提出一种新的有效估计方法——逆概率多重加权（IPMW）估计。该方法是在逆概率加权（IPW）估计的基础上,结合倾向得分匹配及模型平均思想,经过多次估计,加权确定最终参数估计结果。该方法适用于响应变量是独立同分布或独立非同分布的情形,并适用于绝大多数缺失场景。经过理论推导及模拟研究发现,IPMW估计量在继承IPW估计量的优势上具有更稳健的性质。最后,将该方法应用于含有缺失数据的微观调查数据中,研究了经济较发达的准一线城市中等收入群体消费水平的影响因素,对比两种估计方法的估计结果及置信带,发现逆概率多重加权估计量的标准偏差更小,估计结果更稳健。相似文献

14.

The exact distribution and density functions of the stein-type estimator for normal variance

Kazuhiro Ohtani 《统计学通讯:理论与方法》2013,42(10):2863-2876

In this paper, we derive the exact distribution and density functions of the Stein-type estimator for the normal variance. It is shown by numerical evaluation that the density function of the Stein-type estimator is unimodal and concentrates around the mode more than that of the usual estimator. 相似文献

15.

An improved estimation procedure of population mean in two-occasion successive sampling

G. N. Singh 《统计学通讯:理论与方法》2013,42(13):3930-3938

ABSTRACT

In this paper an attempt to estimate the current population mean in two-occasion successive sampling has been made. A modified regression-type estimator has been proposed. Optimum replacement strategy of the proposed estimator has been formulated. The proposed estimator is compared with sample mean estimator when there is no matching from the previous occasion and the optimum natural successive sampling estimator. Empirical studies are carried out and suitable recommendations have been made. 相似文献

16.

A novel weighted likelihood estimation with empirical Bayes flavor

Md. Mobarak Hossain Krzysztof Podgórski 《统计学通讯:模拟与计算》2018,47(2):392-412

We propose a novel approach to estimation, where a set of estimators of a parameter is combined into a weighted average to produce the final estimator. The weights are chosen to be proportional to the likelihood evaluated at the estimators. We investigate the method for a set of estimators obtained by using the maximum likelihood principle applied to each individual observation. The method can be viewed as a Bayesian approach with a data-driven prior distribution. We provide several examples illustrating the new method and argue for its consistency, asymptotic normality, and efficiency. We also conduct simulation studies to assess the performance of the estimators. This straightforward methodology produces consistent estimators comparable with those obtained by the maximum likelihood method. The method also approximates the distribution of the estimator through the “posterior” distribution. 相似文献

17.

A statistical test for the hypothesis of Gaussian random function

Shun Matsuura Haruka Yamashita Kimberly K. J. Kinateder 《Statistics》2018,52(4):801-817

A Gaussian random function is a functional version of the normal distribution. This paper proposes a statistical hypothesis test to test whether or not a random function is a Gaussian random function. A parameter that is equal to 0 under Gaussian random function is considered, and its unbiased estimator is given. The asymptotic distribution of the estimator is studied, which is used for constructing a test statistic and discussing its asymptotic power. The performance of the proposed test is investigated through several numerical simulations. An illustrative example is also presented. 相似文献

18.

Zero-inflated models and estimation in zero-inflated Poisson distribution

Yogita S. Wagh 《统计学通讯:模拟与计算》2018,47(8):2248-2265

In this paper, we briefly overview different zero-inflated probability distributions. We compare the performance of the estimates of Poisson, Generalized Poisson, ZIP, ZIGP and ZINB models through Mean square error (MSE), bias and Standard error (SE) when the samples are generated from ZIP distribution. We propose a new estimator referred to as probability estimator (PE) of inflation parameter of ZIP distribution based on moment estimator (ME) of the mean parameter and compare its performance with ME and maximum likelihood estimator (MLE) through a simulation study. We use the PE along with ME and MLE to fit ZIP distribution to various zero-inflated datasets and observe that the results do not differ significantly. We recommend using PE in place of MLE since it is easy to calculate and the simulation study in this paper demonstrates that the PE performs as good as MLE irrespective of the sample size. 相似文献

19.

零膨胀计数数据的联合建模及变量选择

胡亚南田茂再《统计研究》2019,36(1):104-114

零膨胀计数数据破坏了泊松分布的方差-均值关系，可由取值服从泊松分布的数据和取值为零(退化分布)的数据各占一定比例所构成的混合分布所解释。本文基于自适应弹性网技术，研究了零膨胀计数数据的联合建模及变量选择问题.对于零膨胀泊松分布，引入潜变量，构造出零膨胀泊松模型的完全似然, 其中由零膨胀部分和泊松部分两项组成.考虑到协变量可能存在共线性和稀疏性，通过对似然函数加自适应弹性网惩罚得到目标函数,然后利用EM算法得到回归系数的稀疏估计量，并用贝叶斯信息准则BIC来确定最优调节参数.本文也给出了估计量的大样本性质的理论证明和模拟研究，最后把所提出的方法应用到实际问题中。相似文献

20.

Two new confidence intervals for the coefficient of variation in a normal distribution

Rahim Mahmoudvand 《Journal of applied statistics》2009,36(4):429-442

In this article we introduce an approximately unbiased estimator for the population coefficient of variation, τ, in a normal distribution. The accuracy of this estimator is examined by several criteria. Using this estimator and its variance, two approximate confidence intervals for τ are introduced. The performance of the new confidence intervals is compared to those obtained by current methods. 相似文献