首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Numerous optimization problems arise in survey designs. The problem of obtaining an optimal (or near optimal) sampling design can be formulated and solved as a mathematical programming problem. In multivariate stratified sample surveys usually it is not possible to use the individual optimum allocations for sample sizes to various strata for one reason or another. In such situations some criterion is needed to work out an allocation which is optimum for all characteristics in some sense. Such an allocation may be called an optimum compromise allocation. This paper examines the problem of determining an optimum compromise allocation in multivariate stratified random sampling, when the population means of several characteristics are to be estimated. Formulating the problem of allocation as an all integer nonlinear programming problem, the paper develops a solution procedure using a dynamic programming technique. The compromise allocation discussed is optimal in the sense that it minimizes a weighted sum of the sampling variances of the estimates of the population means of various characteristics under study. A numerical example illustrates the solution procedure and shows how it compares with Cochran's average allocation and proportional allocation.  相似文献   

2.
Sampling has evolved into a universally accepted approach for gathering information and data mining as it is widely accepted that a reasonably modest-sized sample can sufficiently characterize a much larger population. In stratified sampling designs, the whole population is divided into homogeneous strata in order to achieve higher precision in the estimation. This paper proposes an efficient method of constructing optimum stratum boundaries (OSB) and determining optimum sample size (OSS) for the survey variable. The survey variable may not be available in practice since the variable of interest is unavailable prior to conducting the survey. Thus, the method is based on the auxiliary variable which is usually readily available from past surveys. To illustrate the application as an example using a real data, the auxiliary variable considered for this problem follows Weibull distribution. The stratification problem is formulated as a Mathematical Programming Problem (MPP) that seeks minimization of the variance of the estimated population parameter under Neyman allocation. The solution procedure employs the dynamic programming technique, which results in substantial gains in the precision of the estimates of the population characteristics.  相似文献   

3.
In many environmental sampling situations, the variable of interest is either not easily observable or is too expensive to observe. Under such circumstances, the need arises to observe another variable, related to the variable of interest, so as to estimate the population parameters of interest. We study the performance of two different sampling procedures, i.e. ranked set sampling and stratified simple random sampling, when both stratification and ranking are accomplished on the basis of such a concomitant variable. The relative precision of the two methods is obtained and expressed as a function of population variance, between-stratum and between-rank variation, and the correlation coefficient between the variable of interest and the concomitant variable. The relative precision is computed for several important families of distributions that occur frequently in environmental and ecological work. Under equal allocation of sampling units, stratified simple random sampling is found to perform better than ranked set sampling, when the costs incurred to obtain sample measurements are ignored. When optimum allocation is considered for both methods, ranked set sampling performs better than stratified simple random sampling, when the concomitant variable is not highly correlated with the variable of interest. Furthermore, when the costs of sampling and the costs of measurement are incorporated into the assessment of the relative precision, the ranked set sampling is seen to be more efficient than stratified simple random sampling, particularly when the cost of stratification is high compared with that of ranking. This is generally the case in practice.  相似文献   

4.
In many environmental sampling situations, the variable of interest is either not easily observable or is too expensive to observe. Under such circumstances, the need arises to observe another variable, related to the variable of interest, so as to estimate the population parameters of interest. We study the performance of two different sampling procedures, i.e. ranked set sampling and stratified simple random sampling, when both stratification and ranking are accomplished on the basis of such a concomitant variable. The relative precision of the two methods is obtained and expressed as a function of population variance, between-stratum and between-rank variation, and the correlation coefficient between the variable of interest and the concomitant variable. The relative precision is computed for several important families of distributions that occur frequently in environmental and ecological work. Under equal allocation of sampling units, stratified simple random sampling is found to perform better than ranked set sampling, when the costs incurred to obtain sample measurements are ignored. When optimum allocation is considered for both methods, ranked set sampling performs better than stratified simple random sampling, when the concomitant variable is not highly correlated with the variable of interest. Furthermore, when the costs of sampling and the costs of measurement are incorporated into the assessment of the relative precision, the ranked set sampling is seen to be more efficient than stratified simple random sampling, particularly when the cost of stratification is high compared with that of ranking. This is generally the case in practice.  相似文献   

5.
In this article, a choice of the optimum sampling design to study a finite population is studied. Three sampling schemes are compared, viz., Sunter's procedure of unequal probability sampling, stratified sampling under optimum stratification, and simple random sampling without replacement. The comparison is made against a background of various correlation between stratification and survey variables and various variability in the variables. Under weak correlation and large variability, stratification appeared to be more efficient than Sunter's procedure. Under strong correlation and/or low variability in the variables, the latter procedure was the most efficient. Simple random sampling was usually the least efficient.  相似文献   

6.
In stratified sample surveys, the problem of determining the optimum allocation is well known due to articles published in 1923 by Tschuprow and in 1934 by Neyman. The articles suggest the optimum sample sizes to be selected from each stratum for which sampling variance of the estimator is minimum for fixed total cost of the survey or the cost is minimum for a fixed precision of the estimator. If in a sample survey more than one characteristic is to be measured on each selected unit of the sample, that is, the survey is a multi-response survey, then the problem of determining the optimum sample sizes to various strata becomes more complex because of the non-availability of a single optimality criterion that suits all the characteristics. Many authors discussed compromise criterion that provides a compromise allocation, which is optimum for all characteristics, at least in some sense. Almost all of these authors worked out the compromise allocation by minimizing some function of the sampling variances of the estimators under a single cost constraint. A serious objection to this approach is that the variances are not unit free so that minimizing any function of variances may not be an appropriate objective to obtain a compromise allocation. This fact suggests the use of coefficient of variations instead of variances. In the present article, the problem of compromise allocation is formulated as a multi-objective non-linear programming problem. By linearizing the non-linear objective functions at their individual optima, the problem is approximated to an integer linear programming problem. Goal programming technique is then used to obtain a solution to the approximated problem.  相似文献   

7.
This article presents results concerning the significance of sample leads and illustrates their uses. They enable the analysis of one party over another in political polls, one product over another in market research surveys, market share in industrial studies, etc. They enable the calculation of confidence intervals and required sample size, testing of hypotheses, etc. Sampling from infinite and finite populations with overlapping classifications, stratified sampling, optimum allocation to strata, etc., are considered. Derivation of the results is given in a set of Appendices. Some of these are rather complicated, but their uses are straightforward.  相似文献   

8.
In stratified sampling when strata weights are unknown a double sampling technique may be used to estimate them. A large simple random sample from the unstratified population is drawn and units falling in each stratum are recorded. A stratified random sample is then selected and simple random subsamples are obtained out of the previously selected units of the strata. This procedure is called double sampling for stratification. If the problem of non-response is there, then subsamples are divided into classes of respondents and non-respondents. A second subsample is then obtained out of the non-respondents and an attempt is made to obtain the information by increasing efforts, persuasion and call backs. In this paper, the problem of obtaining a compromise allocation in multivariate stratified random sampling is discussed when strata weights are unknown and non-response is present. The problem turns out to be a multiobjective non-linear integer programming problem. An approximation of the problem to an integer linear programming problem by linearizing the non-linear objective functions at their individual optima is worked out. Chebyshev's goal programming technique is then used to solve the approximated problem. A numerical example is also presented to exhibit the practical application of the developed procedure.  相似文献   

9.
In multivariate cases, usually the minimization of sampling variances is considered as an objective under a cost constraint. Since the variances are not unit free, it is more logical to consider the minimization of the squared coefficients of variation as an objective. In this paper, the problem of optimum compromise allocation in multivariate stratified sampling in the case of non-response as a multi-objective all-integer nonlinear programming problem is described. A solution procedure using four different approaches is considered, namely the value function, goal programming,∈-constraint and distance based, to obtain the compromise allocation for non-response. A numerical example is also presented to illustrate the computational details.  相似文献   

10.
In previous papers the problem of estimating the Gini-Simpson index of diversity for large populations has been considered by using random samplings with and without replacement, Nevertheless, the populations to which this estimation is usually applied (e.g., anthropoiogicai, ecological, linguistic and sociological populations) often arise naturally stratified.

In this paper we first construct unbiased estimators of the Gini-Simpson index from a sample drawn according to a stratified sampling with proportional allocation and independently in different strata. Then, we determine the standard error of such estimators. The advantages of the stratification in estimating diversity are later confirmed by means of a practical example. We finally suggest complementary studies that could be additionally developed.  相似文献   

11.
Corrigendum     
The articles presents results concerningthe significances of the sample lead and illustrates their uses. They apply to the sample lead of one party over another in political polls, one productover another in market research surveys, share of the market of one company over another in industrial studies, one class over another in social investigations, one programme over another in TV viewing surveys, etc. They enable calculation of confidence intervals and required sample sizes, testing of hypotheses, etc. They deal with sampling from finite and infinite populations, with and without replacement, sampling from population with overlapping classifications, stratified sampling, optimum allocation to strata, etc. Derivation of the results is given in the appendices. Some of the proofs are rather complicated, but the final results are quite simple and easy to use.  相似文献   

12.
Statistical simulation in survey statistics is usually based on repeatedly drawing samples from population data. Furthermore, population data may be used in courses on survey statistics to explain issues regarding, e.g., sampling designs. Since the availability of real population data is in general very limited, it is necessary to generate synthetic data for such applications. The simulated data need to be as realistic as possible, while at the same time ensuring data confidentiality. This paper proposes a method for generating close-to-reality population data for complex household surveys. The procedure consists of four steps for setting up the household structure, simulating categorical variables, simulating continuous variables and splitting continuous variables into different components. It is not required to perform all four steps so that the framework is applicable to a broad class of surveys. In addition, the proposed method is evaluated in an application to the European Union Statistics on Income and Living Conditions (EU-SILC).  相似文献   

13.
In stratified random sampling, it is generally recognised that nonproportional allocation is worthwhile only if the gain in precision is substantial. This note presents a sharp lower bound for the relative precision of proportional to optimum (Neyman) allocation, in terms of the ratio of the largest to the smallest stratum standard deviations. This provides a quick measure of the efficiency of proportional allocation, and may be used as a formal basis for deriving useful practical rules. In particular, it is formally confirmed that for estimating a proportion nonproportional allocation is rarely worthwhile.  相似文献   

14.
Two-phase stratified sampling is used to select subjects for the collection of additional data, e.g. validation data in measurement error problems. Stratification jointly by outcome and covariates, with sampling fractions chosen to achieve approximately equal numbers per stratum at the second phase of sampling, enhances efficiency compared with stratification based on the outcome or covariates alone. Nonparametric maximum likelihood may result in substantially more efficient estimates of logistic regression coefficients than weighted or pseudolikelihood procedures. Software to implement all three procedures is available. We demonstrate the practical importance of these design and analysis principles by an analysis of, and simulations based on, data from the US National Wilms Tumor Study.  相似文献   

15.
徐映梅  杨延飞 《统计研究》2019,36(5):100-119
本文基于超总体模型研究抽样调查中设计效应的计算问题。首先以随机效应模型为基础,明确了简单随机、二阶段、不等概率和分层抽样对应的超总体模型,进而通过所给模型推导出分层、类集、加权单因素设计效应的计算公式和多因素组合的设计效应计算公式并给出了对应估计量,公式表明:多因素同时存在的组合设计效应等于对应单因素设计效应的乘积。最后,对设计效应的理论值、估计值和真实值之间的关系进行了蒙特卡洛仿真,并利用相对偏倚、相对均方误进行了评价。本文的研究,对复杂抽样设计中正确计算、使用设计效应具有指导意义。  相似文献   

16.
In recent years, calibration estimation has become an important field of research in survey sampling. This paper proposes a new calibration estimator for the population mean in the presence of two auxiliary variables in stratified sampling. The theory of new calibration estimator is given and optimum calibration weights are derived. A simulation study is carried out to performance of the proposed calibration estimator over other existing calibration estimators. The results reveal that the proposed calibration estimators are more efficient than other existing calibration estimators in stratified sampling.  相似文献   

17.
Allocation of samples in stratified and/or multistage sampling is one of the central issues of sampling theory. In a survey of a population often the constraints for precision of estimators of subpopulations parameters have to be taken care of during the allocation of the sample. Such issues are often solved with mathematical programming procedures. In many situations it is desirable to allocate the sample, in a way which forces the precision of estimates at the subpopulations level to be both: optimal and identical, while the constraints of the total (expected) size of the sample (or samples, in two-stage sampling) are imposed. Here our main concern is related to two-stage sampling schemes. We show that such problem in a wide class of sampling plans has an elegant mathematical and computational solution. This is done due to a suitable definition of the optimization problem, which enables to solve it through a linear algebra setting involving eigenvalues and eigenvectors of matrices defined in terms of some population quantities. As a final result, we obtain a very simple and relatively universal method for calculating the subpopulation optimal and equal-precision allocation which is based on one of the most standard algorithms of linear algebra (available, e.g., in R software). Theoretical solutions are illustrated through a numerical example based on the Labour Force Survey. Finally, we would like to stress that the method we describe allows to accommodate quite automatically for different levels of precision priority for subpopulations.  相似文献   

18.
Bryant, Hartley & Jessen (1960) presented a two‐way stratification sampling design when the sample size n is less than the number of strata. Their design was extended to a three‐way stratification case by Chaudhary & Kumar (1988) , but this design does not take into account serial correlation, which might be present as a result of the presence of a time variable. In this paper, a new sampling procedure is presented for three‐way stratification when one of the stratifying variables is time. The purpose of such a design is to take into account serial correlation. The variance of the unweighted estimator of the population mean with respect to a super population model is used as the basis for comparison. Simulation results show that the suggested design is more efficient than the Chaudhary & Kumar (1988) design.  相似文献   

19.
This study proposes a more efficient calibration estimator for estimating population mean in stratified double sampling using new calibration weights. The variance of the proposed calibration estimator has been derived under large sample approximation. Calibration asymptotic optimum estimator and its approximate variance estimator are derived for the proposed calibration estimator and existing calibration estimators in stratified double sampling. Analytical results showed that the proposed calibration estimator is more efficient than existing members of its class in stratified double sampling. Analysis and evaluation are presented.  相似文献   

20.
Various methods exist in the literature for achieving marginal balance for baseline stratification variables in sequential clinical trials. One major limitation with balancing on the margins of the stratification variables is that there is an efficiency loss when the primary analysis is stratified. To preserve the efficiency of a stratified analysis one recently proposed approach balances on the crossing of the stratification variables included in the analysis, which achieves conditional balance for the variables. A hybrid approach to achieving both marginal and conditional balances in sequential clinical trials is proposed, which is applicable to both continuous and categorical stratification variables. Numerical results based on extensive simulation studies and a real dataset show that the proposed approach outperforms the existing ones and is particularly useful when both additive and stratified models are planned for a trial. Copyright © 2013 John Wiley & Sons, Ltd.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号