首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Publication of indexes measuring changes in prices of retail, wholesale, export, and import items is an important part of many governmental statistics programs. One form of price index that is often used is the fixed-base Laspeyres, in which a fixed market basket of goods is priced over time. This article introduces a new class of multiplicative estimators of Laspeyres indexes. The optimum within the class is derived for long-term price change and compared with two other members of the class when used for estimating both long-term and short-term change. Theoretical properties are derived under a model in which long-term relative price changes for individual items have common within-stratum means and are correlated over time. Theory for long-term and short-term change estimators is tested in a simulation study in which a large number of stratified probability samples is selected from a population extracted from items priced for the U.S. consumer price index.  相似文献   

2.
Estimators of chain and fixed-base Laspeyres price indexes are studied using the prediction approach to finite population sampling. The estimators include some that are based on those used in several U.S. government index programs and others derived from prediction models. Biases and variances of the estimators are studied for a case in which the reference period index weights are unknown for nonsample items. Under a model for a one-period price change in which items have common within-stratum means, unbiased estimators can be constructed, but under a more general regression model, special sample balance conditions are needed for unbiasedness of those estimators. The theory for the estimators of fixed-base indexes is illustrated in an empirical study using a population of items priced for the U.S. Consumer Price Index.  相似文献   

3.
When multilevel models are estimated from survey data derived using multistage sampling, unequal selection probabilities at any stage of sampling may induce bias in standard estimators, unless the sources of the unequal probabilities are fully controlled for in the covariates. This paper proposes alternative ways of weighting the estimation of a two-level model by using the reciprocals of the selection probabilities at each stage of sampling. Consistent estimators are obtained when both the sample number of level 2 units and the sample number of level 1 units within sampled level 2 units increase. Scaling of the weights is proposed to improve the properties of the estimators and to simplify computation. Variance estimators are also proposed. In a limited simulation study the scaled weighted estimators are found to perform well, although non-negligible bias starts to arise for informative designs when the sample number of level 1 units becomes small. The variance estimators perform extremely well. The procedures are illustrated using data from the survey of psychiatric morbidity.  相似文献   

4.
In a longitudinal study, an individual is followed up over a period of time. Repeated measurements on the response and some time-dependent covariates are taken at a series of sampling times. The sampling times are often irregular and depend on covariates. In this paper, we propose a sampling adjusted procedure for the estimation of the proportional mean model without having to specify a sampling model. Unlike existing procedures, the proposed method is robust to model misspecification of the sampling times. Large sample properties are investigated for the estimators of both regression coefficients and the baseline function. We show that the proposed estimation procedure is more efficient than the existing procedures. Large sample confidence intervals for the baseline function are also constructed by perturbing the estimation equations. A simulation study is conducted to examine the finite sample properties of the proposed estimators and to compare with some of the existing procedures. The method is illustrated with a data set from a recurrent bladder cancer study.  相似文献   

5.
This research focuses on the estimation of tumor incidence rates from long-term animal studies which incorporate interim sacrifices. A nonparametric stochastic model is described with transition rates between states corresponding to the tumor incidence rate, the overall death rate, and the death rate for tumor-free animals. Exact analytic solutions for the maximum likelihood estimators of the hazard rates are presented, and their application to data from a long-term animal study is illustrated by an example. Unlike many common methods for estimation and comparison of tumor incidence rates among treatment groups, the estimators derived in this paper require no assumptions regarding tumor lethality or treatment lethality. The small sample operating characteristics of these estimators are evaluated using Monte Carlo simulation studies.  相似文献   

6.
Neoteric ranked set sampling (NRSS) is a recently developed sampling plan, derived from the well-known ranked set sampling (RSS) scheme. It has already been proved that NRSS provides more efficient estimators for population mean and variance compared to RSS and other sampling designs based on ranked sets. In this work, we propose and evaluate the performance of some two-stage sampling designs based on NRSS. Five different sampling schemes are proposed. Through an extensive Monte Carlo simulation study, we verified that all proposed sampling designs outperform RSS, NRSS, and the original double RSS design, producing estimators for the population mean with a lower mean square error. Furthermore, as with NRSS, two-stage NRSS estimators present some bias for asymmetric distributions. We complement the study with a discussion on the relative performance of the proposed estimators. Moreover, an additional simulation based on data of the diameter and height of pine trees is presented.  相似文献   

7.
We consider the problem of the estimation of the population mean of a study variable by assuming that the population means of an auxiliary variable are known at both stages of sample selection. The design weights at the first and second stages of sample selection are calibrated by optimizing the chi-squared type distance between the design weights and the new weights at both the first and second stages of sample selection. The regression type estimator in two-stage sampling is shown to be a special case. An application of the proposed estimators using a real data set is discussed.  相似文献   

8.
Summary.  In sample surveys of finite populations, subpopulations for which the sample size is too small for estimation of adequate precision are referred to as small domains. Demand for small domain estimates has been growing in recent years among users of survey data. We explore the possibility of enhancing the precision of domain estimators by combining comparable information collected in multiple surveys of the same population. For this, we propose a regression method of estimation that is essentially an extended calibration procedure whereby comparable domain estimates from the various surveys are calibrated to each other. We show through analytic results and an empirical study that this method may greatly improve the precision of domain estimators for the variables that are common to these surveys, as these estimators make effective use of increased sample size for the common survey items. The design-based direct estimators proposed involve only domain-specific data on the variables of interest. This is in contrast with small domain (mostly small area) indirect estimators, based on a single survey, which incorporate through modelling data that are external to the targeted small domains. The approach proposed is also highly effective in handling the closely related problem of estimation for rare population characteristics.  相似文献   

9.
A model involving autocorrelated random effects and sampling errors is proposed for small-area estimation, using both time-series and cross-sectional data. The sampling errors are assumed to have a known block-diagonal covariance matrix. This model is an extension of a well-known model, due to Fay and Herriot (1979), for cross-sectional data. A two-stage estimator of a small-area mean for the current period is obtained under the proposed model with known autocorrelation, by first deriving the best linear unbiased prediction estimator assuming known variance components, and then replacing them with their consistent estimators. Extending the approach of Prasad and Rao (1986, 1990) for the Fay-Herriot model, an estimator of mean squared error (MSE) of the two-stage estimator, correct to a second-order approximation for a small or moderate number of time points, T, and a large number of small areas, m, is obtained. The case of unknown autocorrelation is also considered. Limited simulation results on the efficiency of two-stage estimators and the accuracy of the proposed estimator of MSE are présentés.  相似文献   

10.
Abstract

In a quantitative linear model with errors following a stationary Gaussian, first-order autoregressive or AR(1) process, Generalized Least Squares (GLS) on raw data and Ordinary Least Squares (OLS) on prewhitened data are efficient methods of estimation of the slope parameters when the autocorrelation parameter of the error AR(1) process, ρ, is known. In practice, ρ is generally unknown. In the so-called two-stage estimation procedures, ρ is then estimated first before using the estimate of ρ to transform the data and estimate the slope parameters by OLS on the transformed data. Different estimators of ρ have been considered in previous studies. In this article, we study nine two-stage estimation procedures for their efficiency in estimating the slope parameters. Six of them (i.e., three noniterative, three iterative) are based on three estimators of ρ that have been considered previously. Two more (i.e., one noniterative, one iterative) are based on a new estimator of ρ that we propose: it is provided by the sample autocorrelation coefficient of the OLS residuals at lag 1, denoted r(1). Lastly, REstricted Maximum Likelihood (REML) represents a different type of two-stage estimation procedure whose efficiency has not been compared to the others yet. We also study the validity of the testing procedures derived from GLS and the nine two-stage estimation procedures. Efficiency and validity are analyzed in a Monte Carlo study. Three types of explanatory variable x in a simple quantitative linear model with AR(1) errors are considered in the time domain: Case 1, x is fixed; Case 2, x is purely random; and Case 3, x follows an AR(1) process with the same autocorrelation parameter value as the error AR(1) process. In a preliminary step, the number of inadmissible estimates and the efficiency of the different estimators of ρ are compared empirically, whereas their approximate expected value in finite samples and their asymptotic variance are derived theoretically. Thereafter, the efficiency of the estimation procedures and the validity of the derived testing procedures are discussed in terms of the sample size and the magnitude and sign of ρ. The noniterative two-stage estimation procedure based on the new estimator of ρ is shown to be more efficient for moderate values of ρ at small sample sizes. With the exception of small sample sizes, REML and its derived F-test perform the best overall. The asymptotic equivalence of two-stage estimation procedures, besides REML, is observed empirically. Differences related to the nature, fixed or random (uncorrelated or autocorrelated), of the explanatory variable are also discussed.  相似文献   

11.
Synthetic and composite estimation under a superpopulation model   总被引:1,自引:1,他引:0  
Under a simple superpopulation model for an arbitrary sampling design we derive optimal linear unbiased estimators/predictors of a mean in a domain. They can be viewed as synthetic and composite estimators of small area estimation theory when no auxiliary variable is available. Moreover, we show that the only requirement for optimality of a sampling strategy is to use any sampling plan of fixed sample size together with traditional estimators (as designed for simple random sampling without replacement). Finally, for symmetric sampling plans, simplified formulas (based on the first two moments of sample sizes) for optimal synthetic and composite estimators and their MSE’s are derived. Throughout the paper we consistently use the model-design setup.  相似文献   

12.
We suggest five types of two-stage James-Stein type estimators of the mean vector μ based on prior knowledge about μ and two-stage sampling scheme proposed by Waikar and Katti(1971) Their risks are evaluated and calculated to compare with two-stage estimator suggested by Waikar and Katti(1971) when the prior form of an initial estimate of μ is 0. We find that the five estimators suggested here all have high efficiencies in large dimensions and/or in large value of ratio of two sample sizes at each stage.  相似文献   

13.
This paper investigates estimation of parameters in a combination of the multivariate linear model and growth curve model, called a generalized GMANOVA model. Making analogy between the outer product of data vectors and covariance yields an approach to directly do least squares to covariance. An outer product least squares estimator of covariance (COPLS estimator) is obtained and its distribution is presented if a normal assumption is imposed on the error matrix. Based on the COPLS estimator, two-stage generalized least squares estimators of the regression coefficients are derived. In addition, asymptotic normalities of these estimators are investigated. Simulation studies have shown that the COPLS estimator and two-stage GLS estimators are alternative competitors with more efficiency in the sense of sample mean, standard deviations and mean of the variance estimates to the existing ML estimator in finite samples. An example of application is also illustrated.  相似文献   

14.
In the health and social sciences, researchers often encounter categorical data for which complexities come from a nested hierarchy and/or cross-classification for the sampling structure. A common feature of these studies is a non-standard data structure with repeated measurements which may have some degree of clustering. In this paper, methodology is presented for the joint estimation of quantities of interest in the context of a stratified two-stage sample with bivariate dichotomous data. These quantities are the mean value π of an observed dichotomous response for a certain condition or time-point and a set of correlation coefficients for intra-cluster association for each condition or time period and for inter-condition correlation within and among clusters. The methodology uses the cluster means and pairwise joint probability parameters from each cluster. They together provide appropriate information across clusters for the estimation of the correlation coefficients.  相似文献   

15.
Lee and Wolfe (Biometrics vol. 54 pp. 1176–1178, 1998) proposed the two-stage sampling design for testing the assumption of independent censoring, which involves further follow-up of a subset of lost-to-follow-up censored subjects. They also proposed an adjusted estimator for the survivor function for a proportional hazards model under the dependent censoring model. In this paper, a new estimator for the survivor function is proposed for the semi-Markov model under the dependent censorship on the basis of the two-stage sampling data. The consistency and the asymptotic distribution of the proposed estimator are derived. The estimation procedure is illustrated with an example of lung cancer clinical trial and simulation results are reported of the mean squared errors of estimators under a proportional hazards and two different nonproportional hazards models.  相似文献   

16.
As an alternative to an estimation based on a simple random sample (BLUE-SRS) for the simple linear regression model, Moussa-Hamouda and Leone [E. Moussa-Hamouda and F.C. Leone, The o-blue estimators for complete and censored samples in linear regression, Technometrics, 16 (3) (1974), pp. 441–446.] discussed the best linear unbiased estimators based on order statistics (BLUE-OS), and showed that BLUE-OS is more efficient than BLUE-SRS for normal data. Using the ranked set sampling, Barreto and Barnett [M.C.M. Barreto and V. Barnett, Best linear unbiased estimators for the simple linear regression model using ranked set sampling. Environ. Ecoll. Stat. 6 (1999), pp. 119–133.] derived the best linear unbiased estimators (BLUE-RSS) for simple linear regression model and showed that BLUE-RSS is more efficient for the estimation of the regression parameters (intercept and slope) than BLUE-SRS for normal data, but not so for the estimation of the residual standard deviation in the case of small sample size. As an alternative to RSS, this paper considers the best linear unbiased estimators based on order statistics from a ranked set sample (BLUE-ORSS) and shows that BLUE-ORSS is uniformly more efficient than BLUE-RSS and BLUE-OS for normal data.  相似文献   

17.
Adaptive cluster sampling is an efficient method of estimating the parameters of rare and clustered populations. The method mimics how biologists would like to collect data in the field by targeting survey effort to localised areas where the rare population occurs. Another popular sampling design is inverse sampling. Inverse sampling was developed so as to be able to obtain a sample of rare events having a predetermined size. Ideally, in inverse sampling, the resultant sample set will be sufficiently large to ensure reliable estimation of population parameters. In an effort to combine the good properties of these two designs, adaptive cluster sampling and inverse sampling, we introduce inverse adaptive cluster sampling with unequal selection probabilities. We develop an unbiased estimator of the population total that is applicable to data obtained from such designs. We also develop numerical approximations to this estimator. The efficiency of the estimators that we introduce is investigated through simulation studies based on two real populations: crabs in Al Khor, Qatar and arsenic pollution in Kurdistan, Iran. The simulation results show that our estimators are efficient.  相似文献   

18.
The estimation of the finite population mean in successive occasions is investigated with calibration estimators in this article. We propose several estimators based on calibration techniques with arbitrary sampling design in each of the occasions. Asymptotic variance formulaes are derived for the proposed estimators. The properties of these estimators are studied via a simulation study and using natural populations.  相似文献   

19.
Summary.  The jackknife method is often used for variance estimation in sample surveys but has only been developed for a limited class of sampling designs. We propose a jackknife variance estimator which is defined for any without-replacement unequal probability sampling design. We demonstrate design consistency of this estimator for a broad class of point estimators. A Monte Carlo study shows how the proposed estimator may improve on existing estimators.  相似文献   

20.
In surveys of natural resources in agriculture, ecology, fisheries, forestry, environmental management, etc., cost-effective sampling methods are of major concern. In this paper, we propose a two-stage cluster sampling (TSCS) in integration with the hybrid ranked set sampling (HRSS)—named TSCS-HRSS—in the second stage of sampling for estimating the population mean. The TSCS-HRSS scheme encompasses several existing ranked set sampling (RSS) schemes and may help in selecting a smaller number of units to rank. It is shown both theoretically and numerically that the TSCS-HRSS provides an unbiased estimator of the population mean and it is more precise than the mean estimators based on TSCS with SRS and RSS schemes. An unbiased estimator of the variance of the proposed mean estimator is also derived. A similar trend is observed when studying the impact of imperfect rankings on the performance of the TSCS-HRSS based mean estimator.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号