期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Weighted Likelihood for Semiparametric Models and Two-phase Stratified Samples, with Application to Cox Regression 总被引：2，自引：1，他引：1

NORMAN E. BRESLOW JON A. WELLNER 《Scandinavian Journal of Statistics》2007,34(1):86-102

Abstract. We consider semiparametric models for which solution of Horvitz–Thompson or inverse probability weighted (IPW) likelihood equations with two-phase stratified samples leads to consistent and asymptotically Gaussian estimators of both Euclidean and non-parametric parameters. For Bernoulli (independent and identically distributed) sampling, standard theory shows that the Euclidean parameter estimator is asymptotically linear in the IPW influence function. By proving weak convergence of the IPW empirical process, and borrowing results on weighted bootstrap empirical processes, we derive a parallel asymptotic expansion for finite population stratified sampling. Several of our key results have been derived already for Cox regression with stratified case–cohort and more general survey designs. This paper is intended to help interpret this previous work and to pave the way towards a general Horvitz–Thompson approach to semiparametric inference with data from complex probability samples. 相似文献

2.

Variance estimation of DEE and regression estimator when a first-phase cluster sample is restratified

Gwansu Kim 《统计学通讯:模拟与计算》2017,46(4):3020-3033

We consider a variance estimation when a stratified single stage cluster sample is selected in the first phase and a stratified simple random element sample is selected in the second phase. We propose explicit formulas of (asymptotically), we propose explicit formulas of (asymptotically) unbiased variance estimators for the double expansion estimator and regression estimator. We perform a small simulation study to investigate the performance of the proposed variance estimators. In our simulation study, the proposed variance estimator showed better or comparable performance to the Jackknife variance estimator. We also extend the results to a two-phase sampling design in which a stratified pps with replacement cluster sample is selected in the first phase. 相似文献

3.

Empirical likelihood for nonlinear regression models with nonignorable missing responses

Zhihuang Yang Niansheng Tang 《Revue canadienne de statistique》2020,48(3):386-416

This article develops three empirical likelihood (EL) approaches to estimate parameters in nonlinear regression models in the presence of nonignorable missing responses. These are based on the inverse probability weighted (IPW) method, the augmented IPW (AIPW) method and the imputation technique. A logistic regression model is adopted to specify the propensity score. Maximum likelihood estimation is used to estimate parameters in the propensity score by combining the idea of importance sampling and imputing estimating equations. Under some regularity conditions, we obtain the asymptotic properties of the maximum EL estimators of these unknown parameters. Simulation studies are conducted to investigate the finite sample performance of our proposed estimation procedures. Empirical results provide evidence that the AIPW procedure exhibits better performance than the other two procedures. Data from a survey conducted in 2002 are used to illustrate the proposed estimation procedure. The Canadian Journal of Statistics 48: 386–416; 2020 © 2020 Statistical Society of Canada 相似文献

4.

Comparing three bootstrap methods for survey data

R. R. Sitter 《Revue canadienne de statistique》1992,20(2):135-154

Various bootstrap methods for variance estimation and confidence intervals in complex survey data, where sampling is done without replacement, have been proposed in the literature. The oldest, and perhaps the most intuitively appealing, is the without-replacement bootstrap (BWO) method proposed by Gross (1980). Unfortunately, the BWO method is only applicable to very simple sampling situations. We first introduce extensions of the BWO method to more complex sampling designs. The performance of the BWO and two other bootstrap methods, the rescaling bootstrap (Rao and Wu 1988) and the mirror-match bootstrap (Sitter 1992), are then compared through a simulation study. Together these three methods encompass the various bootstrap proposals. 相似文献

5.

Weighted empirical likelihood inference for multiple samples

Yuejiao Fu Xiaogang Wang Changbao Wu 《Journal of statistical planning and inference》2009

We propose a weighted empirical likelihood approach to inference with multiple samples, including stratified sampling, the estimation of a common mean using several independent and non-homogeneous samples and inference on a particular population using other related samples. The weighting scheme and the basic result are motivated and established under stratified sampling. We show that the proposed method can ideally be applied to the common mean problem and problems with related samples. The proposed weighted approach not only provides a unified framework for inference with multiple samples, including two-sample problems, but also facilitates asymptotic derivations and computational methods. A bootstrap procedure is also proposed in conjunction with the weighted approach to provide better coverage probabilities for the weighted empirical likelihood ratio confidence intervals. Simulation studies show that the weighted empirical likelihood confidence intervals perform better than existing ones. 相似文献

6.

Confidence intervals for the closed population size under inverse sampling without replacement

Mohammad Mohammadi 《统计学通讯:理论与方法》2019,48(14):3518-3529

Inverse sampling is an appropriate design for the second phase of capture-recapture experiments which provides an exactly unbiased estimator of the population size. However, the sampling distribution of the resulting estimator tends to be highly right skewed for small recapture samples, so, the traditional Wald-type confidence intervals appear to be inappropriate. The objective of this paper is to study the performance of interval estimators for the population size under inverse recapture sampling without replacement. To this aim, we consider the Wald-type, the logarithmic transformation-based, the Wilson score, the likelihood ratio and the exact methods. Also, we propose some bootstrap confidence intervals for the population size, including the with-replacement bootstrap (BWR), the without replacement bootstrap (BWO), and the Rao–Wu’s rescaling method. A Monte Carlo simulation is employed to evaluate the performance of suggested methods in terms of the coverage probability, error rates and standardized average length. Our results show that the likelihood ratio and exact confidence intervals are preferred to other competitors, having the coverage probabilities close to the desired nominal level for any sample size, with more balanced error rate for exact method and shorter length for likelihood ratio method. It is notable that the BWO and Rao–Wu’s rescaling methods also may provide good intervals for some situations, however, those coverage probabilities are not invariant with respect to the population arguments, so one must be careful to use them. 相似文献

7.

Double bootstrap estimation of variance under systematic sampling with probability proportional to size

《Journal of Statistical Computation and Simulation》2012,82(2):73-82

Variance estimation under systematic sampling with probability proportional to size is known to be a difficult problem. We attempt to tackle this problem by the bootstrap resampling method. It is shown that the usual way to bootstrap fails to give satisfactory variance estimates. As a remedy, we propose a double bootstrap method which is based on certain working models and involves two levels of resampling. Unlike existing methods which deal exclusively with the Horvitz–Thompson estimator, the double bootstrap method can be used to estimate the variance of any statistic. We illustrate this within the context of both mean and median estimation. Empirical results based on five natural populations are encouraging. 相似文献

8.

Design‐Based Inference in a Mixture Model for Ordinal Variables for a Two Stage Stratified Design

R. Gambacorta M. Iannario R. Valliant 《Australian & New Zealand Journal of Statistics》2014,56(2):125-143

In this paper we present methods for inference on data selected by a complex sampling design for a class of statistical models for the analysis of ordinal variables. Specifically, assuming that the sampling scheme is not ignorable, we derive for the class of cub models (Combination of discrete Uniform and shifted Binomial distributions) variance estimates for a complex two stage stratified sample. Both Taylor linearization and repeated replication variance estimators are presented. We also provide design‐based test diagnostics and goodness‐of‐fit measures. We illustrate by means of real data analysis the differences between survey‐weighted and unweighted point estimates and inferences for cub model parameters. 相似文献

9.

A nonparametric estimator of the number of classes based on a stratified random sample

James L. Norris III Duane A. Meeter 《统计学通讯:理论与方法》2013,42(9):2561-2580

Under stratified random sampling, we develop a k^th-order bootstrap bias-corrected estimator of the number of classes θ which exist in a study region. This research extends Smith and van Belle’s (1984) first-order bootstrap bias-corrected estimator under simple random sampling. Our estimator has applicability for many settings including: estimating the number of animals when there are stratified capture periods, estimating the number of species based on stratified random sampling of subunits (say, quadrats) from the region, and estimating the number of errors/defects in a product based on observations from two or more types of inspectors. When the differences between the strata are large, utilizing stratified random sampling and our estimator often results in superior performance versus the use of simple random sampling and its bootstrap or jackknife [Burnham and Overton (1978)] estimator. The superior performance is often associated with more observed classes, and we provide insights into optimal designation of the strata and optimal allocation of sample sectors to strata. 相似文献

10.

Estimation under generalized sampling of cell proportions for contingency tables subject to marginal constraints

Beverley D. Causey 《统计学通讯:理论与方法》2013,42(20):2487-2494

Under simple random (multinomial) sampling the problem of estimating cell proportions for a contingency table subject to marginal constraints has been well explored. We briefly review methods that have been considered; then we develop a general method, for more complicated sampling, which reflects the variance structure of the estimated cell proportions. For stratified and cluster sampling we compare our method against earlier methods for the 2×2 table and find it potentially advantageous. 相似文献

11.

A Weighted Linear Estimator of Multivariate ARCH Parameters

Farhat Iqbal 《统计学通讯:模拟与计算》2013,42(4):544-560

A weighted linear estimator (WLE) of the parameters of multivariate ARCH models is proposed. The accuracy of WLE in estimating the parameters of multivariate ARCH models is compared with the widely used quasi-maximum likelihood estimator (QMLE) through simulations. Application to real data sets are also presented and forecasts of variance-covariance matrix and value-at-risk (VaR) are obtained. The weighted resampling methods are used to approximate the sampling distribution of the proposed estimator. Our study indicates that the forecasting performance of WLE is not inferior and one-day ahead risk estimates are also found better than the QMLE. 相似文献

12.

ARFIMA processes and outliers: a weighted likelihood approach

Claudio Agostinelli Luisa Bisaglia 《Journal of applied statistics》2010,37(9):1569-1584

In this paper, we consider the problem of robust estimation of the fractional parameter, d, in long memory autoregressive fractionally integrated moving average processes, when two types of outliers, i.e. additive and innovation, are taken into account without knowing their number, position or intensity. The proposed method is a weighted likelihood estimation (WLE) approach for which needed definitions and algorithm are given. By an extensive Monte Carlo simulation study, we compare the performance of the WLE method with the performance of both the approximated maximum likelihood estimation (MLE) and the robust M-estimator proposed by Beran (Statistics for Long-Memory Processes, Chapman & Hall, London, 1994). We find that robustness against the two types of considered outliers can be achieved without loss of efficiency. Moreover, as a byproduct of the procedure, we can classify the suspicious observations in different kinds of outliers. Finally, we apply the proposed methodology to the Nile River annual minima time series. 相似文献

13.

Performance of Interval Estimators for the Inverse Hypergeometric Distribution

Lei Zhang Wenting Xie William D. Johnson 《统计学通讯:模拟与计算》2015,44(5):1300-1310

The inverse hypergeometric distribution is of interest in applications of inverse sampling without replacement from a finite population where a binary observation is made on each sampling unit. Thus, sampling is performed by randomly choosing units sequentially one at a time until a specified number of one of the two types is selected for the sample. Assuming the total number of units in the population is known but the number of each type is not, we consider the problem of estimating this parameter. We use the Delta method to develop approximations for the variance of three parameter estimators. We then propose three large sample confidence intervals for the parameter. Based on these results, we selected a sampling of parameter values for the inverse hypergeometric distribution to empirically investigate performance of these estimators. We evaluate their performance in terms of expected probability of parameter coverage and confidence interval length calculated as means of possible outcomes weighted by the appropriate outcome probabilities for each parameter value considered. The unbiased estimator of the parameter is the preferred estimator relative to the maximum likelihood estimator and an estimator based on a negative binomial approximation, as evidenced by empirical estimates of closeness to the true parameter value. Confidence intervals based on the unbiased estimator tend to be shorter than the two competitors because of its relatively small variance but at a slight cost in terms of coverage probability. 相似文献

14.

Comparison of Ability Estimation Methods Using IRT for Tests with Different Degrees of Difficulty

Adriano Ferreti Borgatto Caio Azevedo Aluisio Pinheiro Dalton Andrade 《统计学通讯:模拟与计算》2015,44(2):474-488

The aim of the present study is to determine the dependence of the estimation of individual abilities obtained by item response theory (IRT) in relation to the degree of test difficulty and to evaluate how the estimation error may be affected by the estimation method employed. It is shown that abilities in the scale region with little test information are more efficiently estimated using the maximum weighted likelihood estimation (WLE) method, particularly abilities belonging to the upper part of the scale. The study also demonstrates the importance of largest tests for ability estimation. 相似文献

15.

辅以排序集样本的分层抽样方法研究

张建军乔松珊《统计与信息论坛》2012,27(5):14-18

分层抽样中,样本在各层中的不同获取方式会对估计量的精度和试验费用产生一定的影响,而已有的理论方法大多不能在提高精度的同时降低调查费用。为此,将排序抽样与分层抽样方法相结合,提出了辅以排序集样本的分层抽样方案,并得到了总体均值的估计量以及这一估计量的良好性质。这些结果表明,与单一的分层随机抽样相比,这种抽样设计的估计量具有更高的精度,同时也节约了各层抽样调查的费用。相似文献

16.

A conditional perspective of weighted variance estimation of the optimal regression estimator

《Journal of statistical planning and inference》2006,136(1):221-234

The estimation of the variance for the GREG (general regression) estimator by weighted residuals is widely accepted as a method which yields estimators with good conditional properties. Since the optimal (regression) estimator shares the properties of GREG estimators which are used in the construction of weighted variance estimators, we introduce the weighting procedure also for estimating the variance of the optimal estimator. This method of variance estimation was originally presented in a seemingly ad hoc manner, and we shall discuss it from a conditional point of view and also look at an alternative way of utilizing the weights. Examples that stress conditional behaviour of estimators are then given for elementary sampling designs such as simple random sampling, stratified simple random sampling and Poisson sampling, where for the latter design we have conducted a small simulation study. 相似文献

17.

Maximum Likelihood Estimation of Logistic Regression Parameters under Two-phase, Outcome-dependent Sampling

Norman E. Breslow & Richard Holubkov 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1997,59(2):447-461

Outcome-dependent sampling increases the efficiency of studies of rare outcomes, examples being case—control studies in epidemiology and choice–based sampling in econometrics. Two-phase or double sampling is a standard technique for drawing efficient stratified samples. We develop maximum likelihood estimation of logistic regression coefficients for a hybrid two-phase, outcome–dependent sampling design. An algorithm is given for determining the estimates by repeated fitting of ordinary logistic regression models. Simulation results demonstrate the efficiency loss associated with alternative pseudolikelihood and weighted likelihood methods for certain data configurations. These results provide an efficient solution to the measurement error problem with validation sampling based on a discrete surrogate. 相似文献

18.

Parameter estimation in multivariate logit models with many binary choices

Koen Bel Dennis Fok Richard Paap 《Econometric Reviews》2018,37(5):534-550

Multivariate Logit models are convenient to describe multivariate correlated binary choices as they provide closed-form likelihood functions. However, the computation time required for calculating choice probabilities increases exponentially with the number of choices, which makes maximum likelihood-based estimation infeasible when many choices are considered. To solve this, we propose three novel estimation methods: (i) stratified importance sampling, (ii) composite conditional likelihood (CCL), and (iii) generalized method of moments, which yield consistent estimates and still have similar small-sample bias to maximum likelihood. Our simulation study shows that computation times for CCL are much smaller and that its efficiency loss is small. 相似文献

19.

Probability proportional to size (πps) sampling using ranks

Tommy Wright 《统计学通讯:理论与方法》2013,42(1):347-362

There can be gains in estimation efficiency over equal probability samplin methods when one makes use of auxiliary information for probability proporti onal to size with replacement (πpswr) sampling methods. The usual method is simple to execute, but might lead to more than one appearance in the sampl e for any particular unit. When a suitable variable x is not available, one may know how to rank units reasonably well relative to the unknown y values before sample selection. When such ranking is possible, we introduce a simple and efficient sampling plan using the ranks as the unknown x measures of size. The proposed sampling plan is similar to, has the simplicity of, and has no greater sampling variance than with replacement sampling, but is without replacement. 相似文献

20.

On Measuring Uncertainty of Benchmarked Predictors with Application to Disease Risk Estimate

Tatsuya Kubokawa Mana Hasukawa Kunihiko Takahashi 《Scandinavian Journal of Statistics》2014,41(2):394-413

Empirical Bayes (EB) estimates in general linear mixed models are useful for the small area estimation in the sense of increasing precision of estimation of small area means. However, one potential difficulty of EB is that the overall estimate for a larger geographical area based on a (weighted) sum of EB estimates is not necessarily identical to the corresponding direct estimate such as the overall sample mean. Another difficulty is that EB estimates yield over‐shrinking, which results in the sampling variance smaller than the posterior variance. One way to fix these problems is the benchmarking approach based on the constrained empirical Bayes (CEB) estimators, which satisfy the constraints that the aggregated mean and variance are identical to the requested values of mean and variance. In this paper, we treat the general mixed models, derive asymptotic approximations of the mean squared error (MSE) of CEB and provide second‐order unbiased estimators of MSE based on the parametric bootstrap method. These results are applied to natural exponential families with quadratic variance functions. As a specific example, the Poisson‐gamma model is dealt with, and it is illustrated that the CEB estimates and their MSE estimates work well through real mortality data. 相似文献