首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 81 毫秒
1.
The present article deals with the estimation of mean number of respondents who possess a rare sensitive character in presence of known and unknown proportion of a rare unrelated non-sensitive attribute by using the Poisson probability distribution in stratified random sampling as well as in stratified random double sampling. The variance of rare sensitive character is also derived under proportional and optimal allocation methods in stratified random sampling when stratum sizes are known and unknown. The properties of the suggested estimation procedures have been deeply examined. The proposed model is found to be dominant over Lee et al. [Estimation of a rare sensitive attribute in a stratified sample using Poisson distribution. Statistics. 2013;47:575–589] model. Numerical illustrations are presented to support the theoretical results. Results are analysed and suitable recommendations are put forward to the survey practitioners.  相似文献   

2.
A maximin criterion is used to find optimal designs for the logistic random intercept model with dichotomous independent variables. The dichotomous independent variables can be subdivided into variables for which the distribution is specified prior to data sampling, called variates, and into variables for which the distribution is not specified prior to data sampling, but is obtained from data sampling, called covariates. The proposed maximin criterion maximizes the smallest possible relative efficiency not only with respect to all possible values of the model parameters, but also with respect to the joint distribution of the covariates. We have shown that, under certain conditions, the maximin design is balanced with respect to the joint distribution of the variates. The proposed method will be used to plan a (stratified) clinical trial where variates and covariates are involved.  相似文献   

3.
Consider the exchangeable Bayesian hierarchical model where observations yi are independently distributed from sampling densities with unknown means, the means µi, are a random sample from a distribution g, and the parameters of g are assigned a known distribution h. A simple algorithm is presented for summarizing the posterior distribution based on Gibbs sampling and the Metropolis algorithm. The software program Matlab is used to implement the algorithm and provide a graphical output analysis. An binomial example is used to illustrate the flexibility of modeling possible using this algorithm. Methods of model checking and extensions to hierarchical regression modeling are discussed.  相似文献   

4.
Ranked-set sampling is an alternative to random sampling for settings in which measurements are difficult or costly. Ranked-set sampling utilizes information gained without measurement to structure the eventual measured sample. This additional information yields improved properties for ranked-set sample procedures relative to their simple random sample counterparts. We review the available nonparametric procedures for data from ranked-set samples. Estimation of the distribution function was the first nonparametric setting to which ranked-set sampling methodology was applied. Since the first paper on the ranked-set sample empirical distribution function, the two-sample location setting, the sign test, and the signed-rank test have all been examined for ranked-set samples. In addition, estimation of the distribution function has been considered in a more general setting. We discuss the similarities and differences in the properties of the ranked-set sample procedures for the various settings  相似文献   

5.
The innovation random variable for a non-negative self-decomposable random variable can have a compound Poisson distribution. In this case, we provide the density function for the compounded variable. When it does not have a compound Poisson representation, there is a straightforward and easily available compound Poisson approximation for which the density function of the compounded variable is also available. These results can be used in the simulation of Ornstein–Uhlenbeck type processes with given marginal distributions. Previously, simulation of such processes used the inverse of the corresponding tail Lévy measure. We show this approach corresponds to the use of an inverse cdf method of a certain distribution. With knowledge of this distribution and hence density function, the sampling procedure is open to direct sampling methods.  相似文献   

6.
SUMMARY Ranked-set sampling is a widely used sampling procedure when sample observations are expensive or difficult to obtain. It departs from simple random sampling by seeking to spread the observations in the sample widely over the distribution or population. This is achieved by ranking methods which may need to employ concomitant information. The ranked-set sample mean is known to be more efficient than the corresponding simple random sample mean. Instead of the ranked-set sample mean, this paper considers the corresponding optimal estimator: the ranked-set best linear unbiased estimator. This is shown to be more efficient, even for normal data, but particularly for skew data, such as from an exponential distribution. The corresponding forms of the estimators are quite distinct from the ranked-set sample mean. Improvement holds where the ordering is perfect or imperfect, with this prospect of improper ordering being explored through the use of concomitants. In addition, the corresponding optimal linear estimator of a scale parameter is also discussed. The results are applied to a biological problem that involves the estimation of root weights for experimental plants, where the expense of measurement implies the need to minimize the number of observations taken.  相似文献   

7.
A significant problem in the collection of responses to potentially sensitive questions, such as relating to illegal, immoral or embarrassing activities, is non-sampling error due to refusal to respond or false responses. Eichhorn & Hayre (1983) suggested the use of scrambled responses to reduce this form of bias. This paper considers a linear regression model in which the dependent variable is unobserved but for which the sum or product with a scrambling random variable of known distribution, is known. The performance of two likelihood-based estimators is investigated, namely of a Bayesian estimator achieved through a Markov chain Monte Carlo (MCMC) sampling scheme, and a classical maximum-likelihood estimator. These two estimators and an estimator suggested by Singh, Joarder & King (1996) are compared. Monte Carlo results show that the Bayesian estimator out-performs the classical estimators in all cases, and the relative performance of the Bayesian estimator improves as the responses become more scrambled.  相似文献   

8.
This paper presents a new random weighting-based adaptive importance resampling method to estimate the sampling distribution of a statistic. A random weighting-based cross-entropy procedure is developed to iteratively calculate the optimal resampling probability weights by minimizing the Kullback-Leibler distance between the optimal importance resampling distribution and a family of parameterized distributions. Subsequently, the random weighting estimation of the sampling distribution is constructed from the obtained optimal importance resampling distribution. The convergence of the proposed method is rigorously proved. Simulation and experimental results demonstrate that the proposed method can effectively estimate the sampling distribution of a statistic.  相似文献   

9.
Bayesian random effects models may be fitted using Gibbs sampling, but the Gibbs sampler can be slow mixing due to what might be regarded as lack of model identifiability. This slow mixing substantially increases the number of iterations required during Gibbs sampling. We present an analysis of data on immunity after Rubella vaccinations which results in a slow-mixing Gibbs sampler. We show that this problem of slow mixing can be resolved by transforming the random effects and then, if desired, expressing their joint prior distribution as a sequence of univariate conditional distributions. The resulting analysis shows that the decline in antibodies after Rubella vaccination is relatively shallow compared to the decline in antibodies which has been shown after Hepatitis B vaccination.  相似文献   

10.
Multistage ranked-set sampling (MRSS) is a generalization of ranked-set sampling in which multiple stages of ranking are used. It is known that for a fixed distribution under perfect rankings, each additional stage provides a gain in efficiency when estimating the population mean. However, the maximum possible efficiency for the MRSS sample mean relative to the simple random sampling sample mean has not previously been determined. In this paper, we provide a method for computing this maximum possible efficiency under perfect rankings for any choice of the set size and the number of stages. The maximum efficiency tends to infinity as the number of stages increases, and, for large numbers of stages, the efficiency-maximizing distributions are symmetric multi-modal distributions where the number of modes matches the set size. The results in this paper correct earlier assertions in the literature that the maximum efficiency is bounded and that it is achieved when the distribution is uniform.  相似文献   

11.
POT模型在巨灾损失预测中的应用——基于MCMC方法的估计   总被引:1,自引:0,他引:1  
极值统计学主要研究随机事件极端情况的统计规律性。运用POT模型拟合中国暴雨损失数据,确定损失超出量的分布形式。实证分析表明,借助POT模型对巨灾风险损失分布进行估计是较为合理的,但当数据量较小时,使用基于Gibbs抽样的MCMC方法估计POT模型的参数,可以解决样本数据不足导致的极大似然估计中误差增大的问题。  相似文献   

12.
We consider a likelihood ratio test of independence for large two-way contingency tables having both structural (non-random) and sampling (random) zeros in many cells. The solution of this problem is not available using standard likelihood ratio tests. One way to bypass this problem is to remove the structural zeroes from the table and implement a test on the remaining cells which incorporate the randomness in the sampling zeros; the resulting test is a test of quasi-independence of the two categorical variables. This test is based only on the positive counts in the contingency table and is valid when there is at least one sampling (random) zero. The proposed (likelihood ratio) test is an alternative to the commonly used ad hoc procedures of converting the zero cells to positive ones by adding a small constant. One practical advantage of our procedure is that there is no need to know if a zero cell is structural zero or a sampling zero. We model the positive counts using a truncated multinomial distribution. In fact, we have two truncated multinomial distributions; one for the null hypothesis of independence and the other for the unrestricted parameter space. We use Monte Carlo methods to obtain the maximum likelihood estimators of the parameters and also the p-value of our proposed test. To obtain the sampling distribution of the likelihood ratio test statistic, we use bootstrap methods. We discuss many examples, and also empirically compare the power function of the likelihood ratio test relative to those of some well-known test statistics.  相似文献   

13.
方丽婷 《统计研究》2014,31(5):102-106
本文采用Bayes方法对空间滞后模型进行全面分析。在构建模型的贝叶斯框架时,对模型系数与误差方差分别选取正态先验分布和逆伽玛先验分布,这样以便获得参数的联合后验分布和条件后验分布。在抽样估计时,文章主要使用MCMC方法,同时还设计了一个简单随机游动Metropolis抽样器,以方便从空间权重因子系数的条件后验分布中进行抽样。最后应用所建议的方法进行数值模拟。  相似文献   

14.
Recognizing that the efficiency in relative risk estimation for the Cox proportional hazards model is largely constrained by the total number of cases, Prentice (1986) proposed the case-cohort design in which covariates are measured on all cases and on a random sample of the cohort. Subsequent to Prentice, other methods of estimation and sampling have been proposed for these designs. We formalize an approach to variance estimation suggested by Barlow (1994), and derive a robust variance estimator based on the influence function. We consider the applicability of the variance estimator to all the proposed case-cohort estimators, and derive the influence function when known sampling probabilities in the estimators are replaced by observed sampling fractions. We discuss the modifications required when cases are missing covariate information. The missingness may occur by chance, and be completely at random; or may occur as part of the sampling design, and depend upon other observed covariates. We provide an adaptation of S-plus code that allows estimating influence function variances in the presence of such missing covariates. Using examples from our current case-cohort studies on esophageal and gastric cancer, we illustrate how our results our useful in solving design and analytic issues that arise in practice.  相似文献   

15.
A model involving autocorrelated random effects and sampling errors is proposed for small-area estimation, using both time-series and cross-sectional data. The sampling errors are assumed to have a known block-diagonal covariance matrix. This model is an extension of a well-known model, due to Fay and Herriot (1979), for cross-sectional data. A two-stage estimator of a small-area mean for the current period is obtained under the proposed model with known autocorrelation, by first deriving the best linear unbiased prediction estimator assuming known variance components, and then replacing them with their consistent estimators. Extending the approach of Prasad and Rao (1986, 1990) for the Fay-Herriot model, an estimator of mean squared error (MSE) of the two-stage estimator, correct to a second-order approximation for a small or moderate number of time points, T, and a large number of small areas, m, is obtained. The case of unknown autocorrelation is also considered. Limited simulation results on the efficiency of two-stage estimators and the accuracy of the proposed estimator of MSE are présentés.  相似文献   

16.
Abstract.  One of the main research areas in Bayesian Nonparametrics is the proposal and study of priors which generalize the Dirichlet process. In this paper, we provide a comprehensive Bayesian non-parametric analysis of random probabilities which are obtained by normalizing random measures with independent increments (NRMI). Special cases of these priors have already shown to be useful for statistical applications such as mixture models and species sampling problems. However, in order to fully exploit these priors, the derivation of the posterior distribution of NRMIs is crucial: here we achieve this goal and, indeed, provide explicit and tractable expressions suitable for practical implementation. The posterior distribution of an NRMI turns out to be a mixture with respect to the distribution of a specific latent variable. The analysis is completed by the derivation of the corresponding predictive distributions and by a thorough investigation of the marginal structure. These results allow to derive a generalized Blackwell–MacQueen sampling scheme, which is then adapted to cover also mixture models driven by general NRMIs.  相似文献   

17.
ABSTRACT

A general Bayesian random effects model for analyzing longitudinal mixed correlated continuous and negative binomial responses with and without missing data is presented. This Bayesian model, given some random effects, uses a normal distribution for the continuous response and a negative binomial distribution for the count response. A Markov Chain Monte Carlo sampling algorithm is described for estimating the posterior distribution of the parameters. This Bayesian model is illustrated by a simulation study. For sensitivity analysis to investigate the change of parameter estimates with respect to the perturbation from missing at random to not missing at random assumption, the use of posterior curvature is proposed. The model is applied to a medical data, obtained from an observational study on women, where the correlated responses are the negative binomial response of joint damage and continuous response of body mass index. The simultaneous effects of some covariates on both responses are also investigated.  相似文献   

18.
In this paper, we discuss a fully Bayesian quantile inference using Markov Chain Monte Carlo (MCMC) method for longitudinal data models with random effects. Under the assumption of error term subject to asymmetric Laplace distribution, we establish a hierarchical Bayesian model and obtain the posterior distribution of unknown parameters at τ-th level. We overcome the current computational limitations using two approaches. One is the general MCMC technique with Metropolis–Hastings algorithm and another is the Gibbs sampling from the full conditional distribution. These two methods outperform the traditional frequentist methods under a wide array of simulated data models and are flexible enough to easily accommodate changes in the number of random effects and in their assumed distribution. We apply the Gibbs sampling method to analyse a mouse growth data and some different conclusions from those in the literatures are obtained.  相似文献   

19.
We consider a Bayesian approach to the study of independence in a two-way contingency table which has been obtained from a two-stage cluster sampling design. If a procedure based on single-stage simple random sampling (rather than the appropriate cluster sampling) is used to test for independence, the p-value may be too small, resulting in a conclusion that the null hypothesis is false when it is, in fact, true. For many large complex surveys the Rao–Scott corrections to the standard chi-squared (or likelihood ratio) statistic provide appropriate inference. For smaller surveys, though, the Rao–Scott corrections may not be accurate, partly because the chi-squared test is inaccurate. In this paper, we use a hierarchical Bayesian model to convert the observed cluster samples to simple random samples. This provides surrogate samples which can be used to derive the distribution of the Bayes factor. We demonstrate the utility of our procedure using an example and also provide a simulation study which establishes our methodology as a viable alternative to the Rao–Scott approximations for relatively small two-stage cluster samples. We also show the additional insight gained by displaying the distribution of the Bayes factor rather than simply relying on a summary of the distribution.  相似文献   

20.
This paper presents a procedure for developing life-test sampling plans for exponential distributions based upon accelerated life testing(ALT). Type II censoring is assumed at each overstress level. The derived test statistic is shown to be a quotient of two independent random variables, each of which is a rational power of a Chi-square random variable. The distribution of the test statistic is characterized by the H-function, which can be numerically evaluated to obtain desired sampling plans.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号