期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Some properties of a nonparametric estimator of the size of an open population

Richard Huggins 《Journal of statistical planning and inference》2008

It has been demonstrated in the literature that local polynomial models may be used to estimate the size of an open population using capture–recapture data. However, very little is known about their properties. Here we develop a setting in which the properties of nonparametric estimators of the size of an open population using capture–recapture data can be examined and establish conditions under which expressions for the bias and variance may be determined. 相似文献

2.

CAMCR: Computer-Assisted Mixture model analysis for Capture–Recapture count data

Ronny Kuhnert Dankmar Böhning 《AStA Advances in Statistical Analysis》2009,93(1):61-71

Population size estimation with discrete or nonparametric mixture models is considered, and reliable ways of construction of the nonparametric mixture model estimator are reviewed and set into perspective. Construction of the maximum likelihood estimator of the mixing distribution is done for any number of components up to the global nonparametric maximum likelihood bound using the EM algorithm. In addition, the estimators of Chao and Zelterman are considered with some generalisations of Zelterman’s estimator. All computations are done with CAMCR, a special software developed for population size estimation with mixture models. Several examples and data sets are discussed and the estimators illustrated. Problems using the mixture model-based estimators are highlighted. 相似文献

3.

Estimators in capture–recapture studies with two sources

Sarah Brittain Dankmar Böhning 《AStA Advances in Statistical Analysis》2009,93(1):23-47

This paper investigates the applications of capture–recapture methods to human populations. Capture–recapture methods are commonly used in estimating the size of wildlife populations but can also be used in epidemiology and social sciences, for estimating prevalence of a particular disease or the size of the homeless population in a certain area. Here we focus on estimating the prevalence of infectious diseases. Several estimators of population size are considered: the Lincoln–Petersen estimator and its modified version, the Chapman estimator, Chao’s lower bound estimator, the Zelterman’s estimator, McKendrick’s moment estimator and the maximum likelihood estimator. In order to evaluate these estimators, they are applied to real, three-source, capture-recapture data. By conditioning on each of the sources of three source data, we have been able to compare the estimators with the true value that they are estimating. The Chapman and Chao estimators were compared in terms of their relative bias. A variance formula derived through conditioning is suggested for Chao’s estimator, and normal 95% confidence intervals are calculated for this and the Chapman estimator. We then compare the coverage of the respective confidence intervals. Furthermore, a simulation study is included to compare Chao’s and Chapman’s estimator. Results indicate that Chao’s estimator is less biased than Chapman’s estimator unless both sources are independent. Chao’s estimator has also the smaller mean squared error. Finally, the implications and limitations of the above methods are discussed, with suggestions for further development. We are grateful to the Medical Research Council for supporting this work. 相似文献

4.

Some General Comparative Points on Chao's and Zelterman's Estimators of the Population Size

DANKMAR BÖHNING 《Scandinavian Journal of Statistics》2010,37(2):221-236

Abstract. Two simple and frequently used capture–recapture estimates of the population size are compared: Chao's lower‐bound estimate and Zelterman's estimate allowing for contaminated distributions. In the Poisson case it is shown that if there are only counts of ones and twos, the estimator of Zelterman is always bounded above by Chao's estimator. If counts larger than two exist, the estimator of Zelterman is becoming larger than that of Chao's, if only the ratio of the frequencies of counts of twos and ones is small enough. A similar analysis is provided for the binomial case. For a two‐component mixture of Poisson distributions the asymptotic bias of both estimators is derived and it is shown that the Zelterman estimator can experience large overestimation bias. A modified Zelterman estimator is suggested and also the bias‐corrected version of Chao's estimator is considered. All four estimators are compared in a simulation study. 相似文献

5.

On the Chao and Zelterman estimators in a binomial mixture model

《Statistical Methodology》2015

Data from a surveillance system can be used to estimate the size of a disease population. For certain surveillance systems, a binomial mixture model arises as a natural choice. The Chao estimator estimates a lower bound of the population size. The Zelterman estimator estimates a parameter that is neither a lower bound nor an upper bound. By comparing the Chao estimator and the Zelterman estimator both theoretically and numerically, we conclude that the Chao estimator is better. 相似文献

6.

Merging data from multiple sources: pretest and shrinkage perspectives

Muhammad Kashif Ali Shah Supranee Lisawadi S. Ejaz Ahmed 《Journal of Statistical Computation and Simulation》2017,87(8):1577-1592

In this article, we have developed asymptotic theory for the simultaneous estimation of the k means of arbitrary populations under the common mean hypothesis and further assuming that corresponding population variances are unknown and unequal. The unrestricted estimator, the Graybill-Deal-type restricted estimator, the preliminary test, and the Stein-type shrinkage estimators are suggested. A large sample test statistic is also proposed as a pretest for testing the common mean hypothesis. Under the sequence of local alternatives and squared error loss, we have compared the asymptotic properties of the estimators by means of asymptotic distributional quadratic bias and risk. Comprehensive Monte-Carlo simulation experiments were conducted to study the relative risk performance of the estimators with reference to the unrestricted estimator in finite samples. Two real-data examples are also furnished to illustrate the application of the suggested estimation strategies. 相似文献

7.

Parametric and semiparametric models for recapture and removal studies: a likelihood approach

Kani Chen 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2001,63(3):607-619

Capture–recapture processes are biased samplings of recurrent event processes, which can be modelled by the Andersen–Gill intensity model. The intensity function is assumed to be a function of time, covariates and a parameter. We derive the maximum likelihood estimators of both the parameter and the population size and show the consistency and asymptotic normality of the estimators for both recapture and removal studies. The estimators are asymptotically efficient and their theoretical asymptotic relative efficiencies with respect to the existing estimators of Yip and co-workers can be as large as ∞. The variance estimation and a numerical example are also presented. 相似文献

8.

Population size estimation and heterogeneity in capture–recapture data: a linear regression estimator based on the Conway–Maxwell–Poisson distribution

Orasa Anan Dankmar Böhning Antonello Maruotti 《Statistical Methods and Applications》2017,26(1):49-79

The purpose of the study is to estimate the population size under a truncated count model that accounts for heterogeneity. The proposed estimator is based on the Conway–Maxwell–Poisson distribution. The benefit of using the Conway–Maxwell–Poisson distribution is that it includes the Bernoulli, the Geometric and the Poisson distributions as special cases and, furthermore, allows for heterogeneity. Parameter estimates can be obtained by exploiting the ratios of successive frequency counts in a weighted linear regression framework. The results of the comparisons with Turing’s, the maximum likelihood Poisson, Zelterman’s and Chao’s estimators reveal that our proposal can be beneficially used. Furthermore, our proposal outperforms its competitors under all heterogeneous settings. The empirical examples consider the homogeneous case and several heterogeneous cases, each with its own features, and provide interesting insights on the behavior of the estimators. 相似文献

9.

Link-tracing sampling with an initial sequential sample of sites: Estimating the size of a hidden human population

Martín H. Flix-Medina Pedro E. Monjardin 《Statistical Methodology》2009,6(5):490-502

We proposed a modification to the variant of link-tracing sampling suggested by Félix-Medina and Thompson [M.H. Félix-Medina, S.K. Thompson, Combining cluster sampling and link-tracing sampling to estimate the size of hidden populations, Journal of Official Statistics 20 (2004) 19–38] that allows the researcher to have certain control of the final sample size, precision of the estimates or other characteristics of the sample that the researcher is interested in controlling. We achieve this goal by selecting an initial sequential sample of sites instead of an initial simple random sample of sites as those authors suggested. We estimate the population size by means of the maximum likelihood estimators suggested by the above-mentioned authors or by the Bayesian estimators proposed by Félix-Medina and Monjardin [M.H. Félix-Medina, P.E. Monjardin, Combining link-tracing sampling and cluster sampling to estimate the size of hidden populations: A Bayesian-assisted approach, Survey Methodology 32 (2006) 187–195]. Variances are estimated by means of jackknife and bootstrap estimators as well as by the delta estimators proposed in the two above-mentioned papers. Interval estimates of the population size are obtained by means of Wald and bootstrap confidence intervals. The results of an exploratory simulation study indicate good performance of the proposed sampling strategy. 相似文献

10.

Finite-population variance estimation under systematic sampling schemes with multiple random starts

《Journal of Statistical Computation and Simulation》2012,82(8):1207-1221

Motivated by Sampath [Finite population variance estimation under LSS with multiple random starts, Commun. Statist. – Theory Methods 38 (2009), pp. 3596–3607], in this paper unbiased estimators for population variance have been developed under linear systematic sampling, balanced systematic sampling and modified systematic sampling with multiple random starts. Expressions for variances of the estimators are also developed. Detailed numerical comparative studies have been carried out to study the performances of the estimators under various systematic sampling schemes with multiple random starts and some interesting conclusions have been drawn out of the study. 相似文献

11.

Measurement errors in continuous-time capture–recapture models

Wen-Han Hwang Yih-Huei Huang 《Journal of statistical planning and inference》2007

The extent of bias due to measurement errors is an important problem in the context of regression and survival analysis. While research in these areas has been extensive and fruitful, investigations into the effect of measurement errors in capture–recapture models have been very limited. The contributions of this paper are to understand the effects of measurement errors in continuous-time capture–recapture models and to propose new methods to circumvent their impact. We derive asymptotic variance formulas for each method and assess their finite sample properties via simulation studies. 相似文献

12.

Properties of design-based functional principal components analysis

Herv Cardot Mohamed Chaouch Camelia Goga Catherine Labrure 《Journal of statistical planning and inference》2010,140(1):75-91

This work aims at performing functional principal components analysis (FPCA) with Horvitz–Thompson estimators when the observations are curves collected with survey sampling techniques. One important motivation for this study is that FPCA is a dimension reduction tool which is the first step to develop model-assisted approaches that can take auxiliary information into account. FPCA relies on the estimation of the eigenelements of the covariance operator which can be seen as nonlinear functionals. Adapting to our functional context the linearization technique based on the influence function developed by Deville [1999. Variance estimation for complex statistics and estimators: linearization and residual techniques. Survey Methodology 25, 193–203], we prove that these estimators are asymptotically design unbiased and consistent. Under mild assumptions, asymptotic variances are derived for the FPCA’ estimators and consistent estimators of them are proposed. Our approach is illustrated with a simulation study and we check the good properties of the proposed estimators of the eigenelements as well as their variance estimators obtained with the linearization approach. 相似文献

13.

Bias-corrected realized variance

Jin-Huei Yeh Jying-Nan Wang 《Econometric Reviews》2019,38(2):170-192

We propose a novel “bias-corrected realized variance” (BCRV) estimator based upon the appropriate re-weighting of two realized variances calculated at different sampling frequencies. Our bias-correction methodology is found to be extremely accurate, with the finite sample variance being significantly minimized. In our Monte Carlo experiments and a finite sample MSE comparison of alternative estimators, the performance of our straightforward BCRV estimator is shown to be comparable to other widely-used integrated variance estimators. Given its simplicity, our BCRV estimator is likely to appeal to researchers and practitioners alike for the estimation of integrated variance. 相似文献

14.

Two-step centre sampling for estimating the size, total and mean of elusive population

Monica Pratesi Emilia Rocco 《Statistical Methods and Applications》2005,14(3):357-374

相似文献

15.

Estimation of a rare sensitive attribute in a stratified sample using Poisson distribution

Gi-Sung Lee Jong-Min Kim 《Statistics》2013,47(3):575-589

This study proposes the estimators for the mean and its variance of the number of respondents who possessed a rare sensitive attribute based on stratified sampling schemes (stratified sampling and stratified double sampling). This study deals with the extension of the estimation reported in Land et al. [Estimation of a rare sensitive attribute using Poisson distribution, Statistics (2011), in press. DOI: 10.1080/02331888.2010.524300] using a Poisson distribution and an unrelated question randomized response model reported in Greenberg et al. [The unrelated question randomized response model: Theoretical framework, J. Amer. Statist. Assoc. 64 (1969), 520–539]. In the stratified sampling, the estimators are proposed when the parameter of the rare unrelated attribute is known and unknown. The variances of estimators using a proportional and optimum allocation are also suggested. The proposed estimators are evaluated using a relative efficiency comparing variances of the estimators reported in Land et al. depending on the parameters and the probability of selecting a question. We showed that our proposed methods have better efficiencies than Land et al.’s randomized response model in some conditions. When the sizes of stratified populations are not given, other estimators are suggested using a stratified double sampling. For the proportional allocation, the difference between two variances in the stratified sampling and the stratified double sampling is given with the known rare unrelated attribute. 相似文献

16.

Two separate effects of variance heterogeneity on the validity and power of significance tests of location

Donald W. Zimmerman 《Statistical Methodology》2006,3(4):351-374

Heterogeneity of variances of treatment groups influences the validity and power of significance tests of location in two distinct ways. First, if sample sizes are unequal, the Type I error rate and power are depressed if a larger variance is associated with a larger sample size, and elevated if a larger variance is associated with a smaller sample size. This well-established effect, which occurs in t and F tests, and to a lesser degree in nonparametric rank tests, results from unequal contributions of pooled estimates of error variance in the computation of test statistics. It is observed in samples from normal distributions, as well as non-normal distributions of various shapes. Second, transformation of scores from skewed distributions with unequal variances to ranks produces differences in the means of the ranks assigned to the respective groups, even if the means of the initial groups are equal, and a subsequent inflation of Type I error rates and power. This effect occurs for all sample sizes, equal and unequal. For the t test, the discrepancy diminishes, and for the Wilcoxon–Mann–Whitney test, it becomes larger, as sample size increases. The Welch separate-variance t test overcomes the first effect but not the second. Because of interaction of these separate effects, the validity and power of both parametric and nonparametric tests performed on samples of any size from unknown distributions with possibly unequal variances can be distorted in unpredictable ways. 相似文献

17.

RANKED SET SAMPLING FROM LOCATION-SCALE FAMILIES OF SYMMETRIC DISTRIBUTIONS

《统计学通讯:理论与方法》2013,42(8-9):1641-1659

Statistical inference based on ranked set sampling has primarily been motivated by nonparametric problems. However, the sampling procedure can provide an improved estimator of the population mean when the population is partially known. In this article, we consider estimation of the population mean and variance for the location-scale families of distributions. We derive and compare different unbiased estimators of these parameters based on rindependent replications of a ranked set sample of size n.Large sample properties, along with asymptotic relative efficiencies, help identify which estimators are best suited for different location-scale distributions. 相似文献

18.

A family of estimators of population variance in two-occasion rotation patterns

Housila P. Singh Jong M. Kim 《统计学通讯:理论与方法》2013,42(14):4106-4116

Abstract

In this article, we have considered the problem of estimation of population variance on current (second) occasion in two occasion successive (rotation) sampling. A class of estimators of population variance has been proposed and its asymptotic properties have been discussed. The proposed class of estimators is compared with the sample variance estimator when there is no matching from the previous occasion and the Singh et al. (2013) estimator. Optimum replacement policy is discussed. It has been shown that the suggested estimator is more efficient than the Singh et al. (2013) estimator and a usual unbiased estimator when there is no matching. An empirical study is carried out in support of the present study. 相似文献

19.

Blinded sample size re‐estimation in superiority and noninferiority trials: bias versus variance in variance estimation

Tim Friede Meinhard Kieser 《Pharmaceutical statistics》2013,12(3):141-146

The internal pilot study design allows for modifying the sample size during an ongoing study based on a blinded estimate of the variance thus maintaining the trial integrity. Various blinded sample size re‐estimation procedures have been proposed in the literature. We compare the blinded sample size re‐estimation procedures based on the one‐sample variance of the pooled data with a blinded procedure using the randomization block information with respect to bias and variance of the variance estimators, and the distribution of the resulting sample sizes, power, and actual type I error rate. For reference, sample size re‐estimation based on the unblinded variance is also included in the comparison. It is shown that using an unbiased variance estimator (such as the one using the randomization block information) for sample size re‐estimation does not guarantee that the desired power is achieved. Moreover, in situations that are common in clinical trials, the variance estimator that employs the randomization block length shows a higher variability than the simple one‐sample estimator and in turn the sample size resulting from the related re‐estimation procedure. This higher variability can lead to a lower power as was demonstrated in the setting of noninferiority trials. In summary, the one‐sample estimator obtained from the pooled data is extremely simple to apply, shows good performance, and is therefore recommended for application. Copyright © 2013 John Wiley & Sons, Ltd. 相似文献

20.

A note on the greenwood variance estimator in cohort life tables: Isolating the effect of one source of bias

Joseph Breuer 《统计学通讯:理论与方法》2013,42(25):2947-2951

A modification of the Greenwood variance estimator is defined and shown to be free of bias whenever its constituent interval estimators are conditionally unbiased, given the sample size at the start of the interval. Using the modified estimator as a standard of comparison, the original Greenwood estimator is seen to have an intrinsic positive bias.Underestimation of variances through the use of Greenwood's formula must be due to bias in the constituent interval estimators and/or, with fixed interval bounds, due to disregarding the random character of the total number of life table intervals to exhaustion of ttje sample. Some easy to prove properties of the modified and the original Greenwood estimators are stated that apply in the absence of censoring. A suggestion is made for reducing the bias of the interval variance estimators. 相似文献