首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 156 毫秒
1.
规下工业抽样调查是社会经济统计调查的重要组成部分,为国民经济核算提供基础数据,而样本代表性直接决定统计推断结果。对企业目录库抽取平衡样本,能够使得样本结构与总体结构相似。平衡样本是指满足如下条件的样本:辅助变量的汉森赫维茨估计等于总体总量真值。平衡抽样设计需要包含丰富辅助信息的完善抽样框,政府统计数据能够为此提供足够的支撑。基于2009年工业企业数据库的实证分析表明,平衡抽样设计对总体总量的估计相对误差很小,特别是估计的均值与总体真值非常接近,近似无偏;与简单随机抽样比较,平衡抽样设计更加有效。  相似文献   

2.
住户调查是我国社会经济统计调查体系的重要组成部分,样本代表性直接决定统计数据质量。多阶段抽样中初级单元的方差对估计的影响是主要的,因此本文结合2010年全国第六次人口普查分县数据,采用平衡抽样设计获取初级单元的代表性样本-平衡样本。对代表性样本的事后评估结果表明,样本结构与总体结构吻合,目标估计的误差很小,说明了本文平衡设计的有效性。  相似文献   

3.
模型辅助方法的思想是基于抽样设计借助于超总体模型获得对总体参数的有效推断.满足辅助变量的HT估计等于总体总量真值的样本被称为平衡样本.对于平衡样本,如果超总体模型的异方差性可以通过辅助变量解释,由此得出最优抽样策略:平衡抽样设计与HT估计结合是最优策略,包含概率正比于模型残差的标准差.  相似文献   

4.
卢山 《统计研究》2005,22(3):53-5
一、问题的提出在抽样调查实践中经常会遇到多目标抽样问题 ,包括多目标变量问题和多目标总体问题。多目标变量问题 ,是指用一套样本估计多个目标变量的总量、均值、比率和比例等估计量时 ,由于各个变量的总体分布不一致 ,导致不同变量的估计量精度 (一般用相对误差表示 )不同 ,而且可能相差很大。在实践中 ,多目标变量问题是一个普遍问题 ,因为任何一项调查都不可能仅仅调查一个指标 (即变量 )。解决多目标变量问题的关键是在抽样设计中选择合适的辅助变量。一般来说 ,在抽样设计中作为分层辅助信息的变量的估计量精度会比其他变量的估计量…  相似文献   

5.
本文在传统统计回归方法的基础上,构建了一种新的特征样本重复抽样回归(FSR)建模方法.该方法是依据变量特征采用机器抽样方法重复抽样,形成多个特征样本,然后对多个样本进行参数估计,形成参数的抽样分布;最后依据抽样分布,在多个优化目标要求下建立最优化模型.FSR方法能够作为社会科学研究中一种通用的建模方法.  相似文献   

6.
随着大数据和网络的不断发展,网络调查越来越广泛,大部分网络调查样本属于非概率样本,难以采用传统的抽样推断理论进行推断,如何解决网络调查样本的推断问题是大数据背景下网络调查发展的迫切需求。本文首次从建模的角度提出了解决该问题的基本思路:一是入样概率的建模推断,可以考虑构建基于机器学习与变量选择的倾向得分模型来估计入样概率推断总体;二是目标变量的建模推断,可以考虑直接对目标变量建立参数、非参数或半参数超总体模型进行估计;三是入样概率与目标变量的双重建模推断,可以考虑进行倾向得分模型与超总体模型的加权估计与混合推断。最后,以基于广义Boosted模型的入样概率建模推断为例演示了具体解决方法。  相似文献   

7.
万舒晨 《统计研究》2021,38(6):116-127
为推动规模以下工业抽样调查工作以及解决当前调查面临的有关问题,本文对抽样设计进行了改进研究。首先,本文对规模以下工业抽样设计演变过程进行系统梳理,总结了现行抽样设计充分利用双重抽样框设计和综合运用三种抽样方法的特点。其次,针对园区层企业密度高的特点,探索结合园区因素改进地域抽样设计,对园区层和非园区层分别抽样,解决调查中面临的非抽样误差问题,并调整辅助变量使其与核心指标相关性均较高,确保抽样推断精度,有效提高抽样调查效率。并以我国东 部某省为例进行实证模拟得到结合园区因素抽样设计对调查工作改进的结论。再次,针对我国各级政府管理需要以及局队业务分工优化调整情况,介绍了规模以下工业样本追加理论和实证应用的主要研究成果。最后,在大数据时代数据来源广泛的背景下,本文在多重抽样框设计以及利用辅助变量提升样本轮换推断精度方面提出了进一步改进抽样设计的思路。  相似文献   

8.
在连续性抽样调查中,利用前期信息和辅助信息可以大大提高估计精度,但是以往的估计量大多假设辅助信息总体均值已知,文章介绍一种在连续性抽样调查中,辅助信息总体均值未知的情况下,通过两阶段抽样,利用轮换样本方法和辅助变量信息,对总体均值进行估计的新的估计方法,并将新提出的估计量与原有的估计量进行比较,发现其精度更高,而且有利于减少调查成本。  相似文献   

9.
采用三阶段抽样设计的月度调查需要实施三个层次的样本轮换,平衡三层次样本轮换模式使得任意一级单元的任意一次轮换各级抽样单元的样本拼配率不变.文章构造了一种新的平衡三层次样本轮换模式.  相似文献   

10.
现在越来越多的调查需要在估计总体目标变量的基础上估计各个层次的目标变量,即多层次调查问题。文章从抽样设计的角度介绍了两种样本追加的方法和利用永久随机数的样本追加方法,这两种方法可以更好地解决多层次调查问题。  相似文献   

11.
A number of multi-variate etiological surveys are analyzed for recurrent sources of bias in balanced and purposive sampling designs. Three nonsampling components emerge that may dominate the total error of a sample survey estimate. Outstanding among these appear to be administrative consideration of cost and convenience which may actually determine a sampling procedure, especially by reliance on voluntary participation, proxy responses and case-finding methods that restrict the sample. Next is lack of comparability of population series that differ on some initial state (as to smoking). Finally, errors are caused by strong beliefs in what results should be. Extensive experience has now shown that it may not be possible to conduct a satisfactory etiological inquiry by use of surveys using nonrandom population samples.  相似文献   

12.
The sampling designs dependent on sample moments of auxiliary variables are well known. Lahiri (Bull Int Stat Inst 33:133–140, 1951) considered a sampling design proportionate to a sample mean of an auxiliary variable. Sing and Srivastava (Biometrika 67(1):205–209, 1980) proposed the sampling design proportionate to a sample variance while Wywiał (J Indian Stat Assoc 37:73–87, 1999) a sampling design proportionate to a sample generalized variance of auxiliary variables. Some other sampling designs dependent on moments of an auxiliary variable were considered e.g. in Wywiał (Some contributions to multivariate methods in, survey sampling. Katowice University of Economics, Katowice, 2003a); Stat Transit 4(5):779–798, 2000) where accuracy of some sampling strategies were compared, too.These sampling designs cannot be useful in the case when there are some censored observations of the auxiliary variable. Moreover, they can be much too sensitive to outliers observations. In these cases the sampling design proportionate to the order statistic of an auxiliary variable can be more useful. That is why such an unequal probability sampling design is proposed here. Its particular cases as well as its conditional version are considered, too. The sampling scheme implementing this sampling design is proposed. The inclusion probabilities of the first and second orders were evaluated. The well known Horvitz–Thompson estimator is taken into account. A ratio estimator dependent on an order statistic is constructed. It is similar to the well known ratio estimator based on the population and sample means. Moreover, it is an unbiased estimator of the population mean when the sample is drawn according to the proposed sampling design dependent on the appropriate order statistic.  相似文献   

13.
成本条件下多目标复合抽样设计   总被引:2,自引:0,他引:2       下载免费PDF全文
王国明  石庆焱 《统计研究》2002,19(11):17-20
一、成本条件下多目标抽样设计的背景及基本思路  众所周知 ,在人口、社会、经济领域的抽样调查实践工作中 ,所涉及的调查指标往往不止一个 ,有的甚至多达数百个。随着我国社会主义市场经济体制的建立和发展 ,人们对这些调查资料可靠程度的要求也越来越高。另外 ,从人力、物力和经费的角度 ,对目前已经开展的有些有关的调查是否能合并进行的问题也提到了议事日程。所以 ,从我国统计调查工作的实际背景来看 ,本文研究的多目标抽样问题有两层含义 :其一是各项调查指标估计精度的控制问题 :如何在总调查费用不变的前提下 ,尽可能使多项调查指…  相似文献   

14.
When auxiliary information is available at the design stage, samples may be selected by means of balanced sampling. The variance of the Horvitz-Thompson estimator is then reduced, since it is approximately given by that of the residuals of the variable of interest on the balancing variables. In this paper, a method for computing optimal inclusion probabilities for balanced sampling on given auxiliary variables is studied. We show that the method formerly suggested by Tillé and Favre (2005) enables the computation of inclusion probabilities that lead to a decrease in variance under some conditions on the set of balancing variables. A disadvantage is that the target optimal inclusion probabilities depend on the variable of interest. If the needed quantities are unknown at the design stage, we propose to use estimates instead (e.g., arising from a previous wave of the survey). A limited simulation study suggests that, under some conditions, our method performs better than the method of Tillé and Favre (2005).  相似文献   

15.
The properties of the estimators of population mean arising from the ratio and product methods of estimation in the context of sample surveys have been analyzed in this paper when the observations on both the study and auxiliary variables are contaminated with measurement errors. The measurement errors in both the variables are also correlated. The properties of the ratio and product estimators along with the sample mean under the influence of measurement errors are derived and studied. The properties of the estimators in finite samples are studied through Monte-Carlo simulation and its findings are reported.  相似文献   

16.
Calibration on the available auxiliary variables is widely used to increase the precision of the estimates of parameters. Singh and Sedory [Two-step calibration of design weights in survey sampling. Commun Stat Theory Methods. 2016;45(12):3510–3523.] considered the problem of calibration of design weights under two-step for single auxiliary variable. For a given sample, design weights and calibrated weights are set proportional to each other, in the first step. While, in the second step, the value of proportionality constant is determined on the basis of objectives of individual investigator/user for, for example, to get minimum mean squared error or reduction of bias. In this paper, we have suggested to use two auxiliary variables for two-step calibration of the design weights and compared the results with single auxiliary variable for different sample sizes based on simulated and real-life data set. The simulated and real-life application results show that two-auxiliary variables based two-step calibration estimator outperforms the estimator under single auxiliary variable in terms of minimum mean squared error.  相似文献   

17.
Jun Shao 《Statistics》2013,47(3-4):203-237
This article reviews the applications of three resampling methods, the jackknife, the balanced repeated replication, and the bootstrap, in sample surveys. The sampling design under consideration is a stratified multistage sampling design. We discuss the implementation of the resampling methods; for example, the construction of balanced repeated replications and approximated balanced repeated replication estimators; four modified bootstrap algorithms to generate bootstrap samples; and three different ways of applying the resampling methods in the presence of imputed missing values. Asymptotic properties of the resampling estimators are discussed for two types of important survey estimators, functions of weighted averages and sample quantiles.  相似文献   

18.
In this article, we present a straightforward Bonferroni approach for determining sample size for estimating the mean vector of a multivariate population under two scenarios: (1) a pre-specified overall confidence level is desired; and (2) a pre-specified confidence level needs to be guaranteed for each individual variable. It is demonstrated that correlation between variables helps reduce the sample size. The formula to calculate the reduced sample size is derived. A binormal example is presented to illustrate the effect of correlation on sample size reduction for various values of the correlation coefficient.  相似文献   

19.
In this paper, for the general non Gaussian spiked population model, where a few fixed eigenvalues of the population covariance matrix are separated from others, we investigate the convergence properties of the eigenvectors of sample covariance matrices corresponding to the spiked population eigenvalues and angle between the population eigenvectors and sample eigenvectors as both the sample size and population size are large.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号