首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The author considers the problem of finding exactly optimal sampling designs for estimating a second‐order, centered random process on the basis of finitely many observations. The value of the process at an unsampled point is estimated by the best linear unbiased estimator. A weighted integrated mean squared error or the maximum mean squared error is used to measure the performance of the estimator. The author presents a set of necessary and sufficient conditions for a design to be exactly optimal for processes with a product covariance structure. Expansions of these conditions lead to conditions for asymptotic optimality.  相似文献   

2.
In this article, we consider the problem of classifying m independent repeated (multiple) observations coming from the same population under a separate sampling scheme. We derive the asymptotic risk of the proposed NN type classification rule and obtain the upper and lower bounds for it in specific cases in terms of Bayes risk. Using a Monte Carlo simulation study we show that, as m increases, the classification risk decreases.  相似文献   

3.
Hypothesis Testing in Two-Stage Cluster Sampling   总被引:1,自引:0,他引:1  
Correlated observations often arise in complex sampling schemes such as two-stage cluster sampling. The resulting observations from this sampling scheme usually exhibit certain positive intracluster correlation, as a result of which the standard statistical procedures for testing hypotheses concerning linear combinations of the parameters may lack some of the optimal properties that these possess when the data are uncorrelated. The aim of this paper is to present exact methods for testing these hypotheses by combining within and between cluster information much as in Zhou & Mathew (1993).  相似文献   

4.
The efficiency of schemes for sampling an alternating Poisson process (0,1 observations) is evaluated by the inverse ratio of the variance of the proportion estimate, p, to the binomial variance. The variance ratio presented by D.R. Cox (in Renewal Theory) for fixed interval sampling is generalized to accommodate random sampling and random sampling after a time delay equal to a fixed proportion, γ , of the mean time between observations, δ. The result is a sampling design tool that provides quantifications for the effect of various spacings between observations and of fixed vs. random sampling. Direct application is made to thes field of work sampling.  相似文献   

5.
The main focus of agricultural, ecological and environmental studies is to develop well designed, cost-effective and efficient sampling designs. Ranked set sampling (RSS) is one method that leads to accomplish such objectives by incorporating expert knowledge to its advantage. In this paper, we propose an efficient sampling scheme, named mixed RSS (MxRSS), for estimation of the population mean and median. The MxRSS scheme is a suitable mixture of both simple random sampling (SRS) and RSS schemes. The MxRSS scheme provides an unbiased estimator of the population mean, and its variance is always less than the variance of sample mean based on SRS. For both symmetric and asymmetric populations, the mean and median estimators based on SRS, partial RSS (PRSS) and MxRSS schemes are compared. It turns out that the mean and median estimates under MxRSS scheme are more precise than those based on SRS scheme. Moreover, when estimating the mean of symmetric and some asymmetric populations, the mean estimates under MxRSS scheme are found to be more efficient than the mean estimates with PRSS scheme. An application to real data is also provided to illustrate the implementation of the proposed sampling scheme.  相似文献   

6.
The sampling designs dependent on sample moments of auxiliary variables are well known. Lahiri (Bull Int Stat Inst 33:133–140, 1951) considered a sampling design proportionate to a sample mean of an auxiliary variable. Sing and Srivastava (Biometrika 67(1):205–209, 1980) proposed the sampling design proportionate to a sample variance while Wywiał (J Indian Stat Assoc 37:73–87, 1999) a sampling design proportionate to a sample generalized variance of auxiliary variables. Some other sampling designs dependent on moments of an auxiliary variable were considered e.g. in Wywiał (Some contributions to multivariate methods in, survey sampling. Katowice University of Economics, Katowice, 2003a); Stat Transit 4(5):779–798, 2000) where accuracy of some sampling strategies were compared, too.These sampling designs cannot be useful in the case when there are some censored observations of the auxiliary variable. Moreover, they can be much too sensitive to outliers observations. In these cases the sampling design proportionate to the order statistic of an auxiliary variable can be more useful. That is why such an unequal probability sampling design is proposed here. Its particular cases as well as its conditional version are considered, too. The sampling scheme implementing this sampling design is proposed. The inclusion probabilities of the first and second orders were evaluated. The well known Horvitz–Thompson estimator is taken into account. A ratio estimator dependent on an order statistic is constructed. It is similar to the well known ratio estimator based on the population and sample means. Moreover, it is an unbiased estimator of the population mean when the sample is drawn according to the proposed sampling design dependent on the appropriate order statistic.  相似文献   

7.
Five sampling schemes (SS) for price index construction – one cut-off sampling technique and four probability-proportional-to-size (pps) methods – are evaluated by comparing their performance on a homescan market research data set across 21 months for each of the 13 classification of individual consumption by purpose (COICOP) food groups. Classifications are derived for each of the food groups and the population index value is used as a reference to derive performance error measures, such as root mean squared error, bias and standard deviation for each food type. Repeated samples are taken for each of the pps schemes and the resulting performance error measures analysed using regression of three of the pps schemes to assess the overall effect of SS and COICOP group whilst controlling for sample size, month and population index value. Cut-off sampling appears to perform less well than pps methods and multistage pps seems to have no advantage over its single-stage counterpart. The jackknife resampling technique is also explored as a means of estimating the standard error of the index and compared with the actual results from repeated sampling.  相似文献   

8.
Unbiased estimators for restricted adaptive cluster sampling   总被引:2,自引:0,他引:2  
In adaptive cluster sampling the size of the final sample is random, thus creating design problems. To get round this, Brown (1994) and Brown & Manly (1998) proposed a modification of the method, placing a restriction on the size of the sample, and using standard but biased estimators for estimating the population mean. But in this paper a new unbiased estimator and an unbiased variance estimator are proposed, based on estimators proposed by Murthy (1957) and extended to sequential and adaptive sampling designs by Salehi & Seber (2001). The paper also considers a restricted version of the adaptive scheme of Salehi & Seber (1997a) in which the networks are selected without replacement, and obtains unbiased estimators. The method is demonstrated by a simple example. Using simulation from this example, the new estimators are shown to compare very favourably with the standard biased estimators.  相似文献   

9.
In this study, we consider different sampling designs of ranked set sampling (RSS) and give empirical distribution function (EDF) estimators for each sampling designs. We provide comparative graphs for the EDFs. Using these EDFs, power of five goodness-of-fit tests are obtained by Monte Carlo simulations for Tukey's gh distributions under RSS and simple random sampling (SRS). Performances of these tests are compared with the tests based on the SRS. Also, critical values belong to these tests are obtained for different set and cycle sizes.  相似文献   

10.
Neoteric ranked set sampling (NRSS) is a recently developed sampling plan, derived from the well-known ranked set sampling (RSS) scheme. It has already been proved that NRSS provides more efficient estimators for population mean and variance compared to RSS and other sampling designs based on ranked sets. In this work, we propose and evaluate the performance of some two-stage sampling designs based on NRSS. Five different sampling schemes are proposed. Through an extensive Monte Carlo simulation study, we verified that all proposed sampling designs outperform RSS, NRSS, and the original double RSS design, producing estimators for the population mean with a lower mean square error. Furthermore, as with NRSS, two-stage NRSS estimators present some bias for asymmetric distributions. We complement the study with a discussion on the relative performance of the proposed estimators. Moreover, an additional simulation based on data of the diameter and height of pine trees is presented.  相似文献   

11.
In this paper, order statistics from independent and non identically distributed random variables is used to obtain ordered ranked set sampling (ORSS). Bayesian inference of unknown parameters under a squared error loss function of the Pareto distribution is determined. We compute the minimum posterior expected loss (the posterior risk) of the derived estimates and compare them with those based on the corresponding simple random sample (SRS) to assess the efficiency of the obtained estimates. Two-sample Bayesian prediction for future observations is introduced by using SRS and ORSS for one- and m-cycle. A simulation study and real data are applied to show the proposed results.  相似文献   

12.
Abstract. Two new unequal probability sampling methods are introduced: conditional and restricted Pareto sampling. The advantage of conditional Pareto sampling compared with standard Pareto sampling, introduced by Rosén (J. Statist. Plann. Inference, 62, 1997, 135, 159), is that the factual inclusion probabilities better agree with the desired ones. Restricted Pareto sampling, preferably conditioned or adjusted, is able to handle cases where there are several restrictions on the sample and is an alternative to the recent cube method for balanced sampling introduced by Deville and Tillé (Biometrika, 91, 2004, 893). The new sampling designs have high entropy and the involved random numbers can be seen as permanent random numbers.  相似文献   

13.
In some applications it is cost efficient to sample data in two or more stages. In the first stage a simple random sample is drawn and then stratified according to some easily measured attribute. In each subsequent stage a random subset of previously selected units is sampled for more detailed and costly observation, with a unit's sampling probability determined by its attributes as observed in the previous stages. This paper describes multistage sampling designs and estimating equations based on the resulting data. Maximum likelihood estimates (MLEs) and their asymptotic variances are given for designs using parametric models. Horvitz–Thompson estimates are introduced as alternatives to MLEs, their asymptotic distributions are derived and their strengths and weaknesses are evaluated. The designs and the estimates are illustrated with data on corn production.  相似文献   

14.
A general family of estimators, which use the information of two auxiliary variables in the stratified random sampling, is proposed to estimate the population mean of the variable under study. Under stratified random sampling without replacement scheme, the expressions of bias and mean square error (MSE) up to the first- and second-order approximations are derived. The family of estimators in its optimum case is discussed. Also, an empirical study is carried out to show the properties of the proposed estimators.  相似文献   

15.
Composite samples are formed by physically mixing samples. Usually, composite samples are used to reduce the overall cost associated with analytical procedures that must be performed on each sample, but they can also be used to protect the privacy of individuals.

Composite sampling can reduce the cost of identifying individual cases that have a certain trait, such as those with a rare disease or those exceeding pollution-level standards. Not much is lost by applying this method as long as the trait is relatively rare.

Composite sampling can reduce the cost of estimating the mean of some process. When samples are composited, the ability to estimate the variance is lost. In spite of this, the potential savings are so great that composite samples have been used.

Much of this paper deasl with the variance of estimators based on composite sampling when the porportions of hte original samples comprising the composite sample are actually random. Taking repeated samples and measurements on several composite samples complicates the prodcedure, but allows the estimation of between and within variation as well as measurement error.  相似文献   

16.
The variance of the sampling distribution of the sample mean is derived for two sampling designs in which a single cluster is randomly drawn from an autocorrelated population. The derivations are motivated by potential applications to statistical quality control, where a "one-cluster" sampling design may often be used because of ease of implementation, and where it is likely that process output is autocorrelated Scenarios in statistical process control for which either non-overlapping or overlapping clusters are appropriate are described The sampling design variance under non-overlapping clusters is related to the sampling design variance under overlapping clusters through the use of a circular population.  相似文献   

17.
In preposterior analysis, Bayesians use an Expected-Net-Gain chart to identify the optimal sample size. This kind of chart, it turns out, is also an excellent educational vehicle for illustrating many of the reasons given for preferring sampling from a population over taking a census, preferring one type of sampling over another (e.g., stratified sampling rather than simple random sampling), or allocating part of a fixed budget to reduce systematic error rather than using it all to reduce sampling errors. The use of such a chart in a basic statistics course is described.  相似文献   

18.
Summary.  Previous research has proposed a design-based analysis procedure for experiments that are embedded in complex sampling designs in which the ultimate sampling units of an on-going sample survey are randomized over different treatments according to completely randomized designs or randomized block designs. Design-based Wald and t -statistics are applied to test whether sample means that are observed under various survey implementations are significantly different. This approach is generalized to experimental designs in which clusters of sampling units are randomized over the different treatments. Furthermore, test statistics are derived to test differences between ratios of two sample estimates that are observed under alternative survey implementations. The methods are illustrated with a simulation study and real life applications of experiments that are embedded in the Dutch Labour Force Survey. The functionality of a software package that was developed to conduct these analyses is described.  相似文献   

19.
ABSTRACT

Recently, distance sampling emerged as an advantageous technique to estimate the abundance of many animal populations, including ungulates. Its basic design involves the random selection of several samplers (transects or points) within the population range, and a Horvitz–Thompson-like estimator is then applied to estimate the population abundance while correcting for animal detectability. Ensuring even coverage probability is essential for subsequent inference on the population size, but it may not be achievable because of limited access to parts of the population range. Moreover, in several environmental conditions, a random selection of samplers may induce very high survey costs because it does not minimize the displacement time of the observer(s) between successive samplers. We thus tested whether two-stage designs – based on the random selection of points and then of nearby samplers – could be more cost-effective, for a given population size and when even area coverage cannot be guaranteed. Here, we further extend our analyses to assess the performance of two-stage designs under varying animal densities.  相似文献   

20.
抽样调查中,使用不完备的抽样框及存在无回答单元是产生非抽样误差的两类主要原因。运用概率统计方法,分析抽样调查中使用三类不完备抽样框且存在无回答单元时对估计误差的影响,尝试构建基于不完备抽样框及无回答共存时的非抽样误差函数,为抽样调查中非抽样误差的定量分析与控制提供依据。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号