首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
马佳羽等 《统计研究》2020,37(11):30-43
在居民生活满意度的相关研究中,除考虑人口学特征外,越来越多的实证同时考虑了微观个体所处的宏观环境,对这类呈嵌套结构的分层数据需构建分层统计模型,但传统的分层统计模型未考虑真实的空间依赖。本文将分层统计模型和空间自回归模型相结合,创新性地构建了四种序数分层空间自回归Probit模型,该类模型能够合理地对因变量为序数且存在空间依赖情况并呈分层结构的数据进行建模,模型可避免忽略真实的空间依赖对模型估计的不利影响,且能够对高层组间的空间效应和低层个体间的空间效应区别对待,更有利于模型的解释。最后,空气质量对居民生活满意度的效应实证研究表明:空气质量确实能够对生活满意度产生影响,居民对空气质量的认识和要求并非孤立地局限于本地,而是对一个区域空气质量的空间综合结果。对比2018年和2016年模型结果可知:空气质量的福利效应无法被其他民生福祉因素所取代,并且随着空气质量相关统计信息的高度开放和广泛传播,居民更加重视空气质量,也形成了更加全局的了解。  相似文献   

2.
Hierarchical binary outcome data with three levels, such as disease remission for patients nested within physicians, nested within clinics are frequently encountered in practice. One important aspect in such data is the correlation that occurs at each level of the data. In parametric modeling, accounting for these correlations increases the complexity. These models may also yield results that lead to the same conclusions as simpler models. We developed a measure of intraclass correlation at each stage of a three-level nested structure and identified guidelines for determining when the dependencies in hierarchical models need to be taken into account. These guidelines are supported by simulations of hierarchical data sets, as well as the analysis of AIDS knowledge in Bangladesh from the 2011 Demographic Health Survey. We also provide a simple rule of thumb to assist researchers faced with the challenge of choosing an appropriately complex model when analyzing hierarchical binary data.  相似文献   

3.
Summary.  Genetic polymorphisms in deoxyribonucleic acid coding regions may have a phenotypic effect on the carrier, e.g. by influencing susceptibility to disease. Detection of deleterious mutations via association studies is hampered by the large number of candidate sites; therefore methods are needed to narrow down the search to the most promising sites. For this, a possible approach is to use structural and sequence-based information of the encoded protein to predict whether a mutation at a particular site is likely to disrupt the functionality of the protein itself. We propose a hierarchical Bayesian multivariate adaptive regression spline (BMARS) model for supervised learning in this context and assess its predictive performance by using data from mutagenesis experiments on lac repressor and lysozyme proteins. In these experiments, about 12 amino-acid substitutions were performed at each native amino-acid position and the effect on protein functionality was assessed. The training data thus consist of repeated observations at each position, which the hierarchical framework is needed to account for. The model is trained on the lac repressor data and tested on the lysozyme mutations and vice versa. In particular, we show that the hierarchical BMARS model, by allowing for the clustered nature of the data, yields lower out-of-sample misclassification rates compared with both a BMARS and a frequen-tist MARS model, a support vector machine classifier and an optimally pruned classification tree.  相似文献   

4.
The authors develop a methodology for predicting unobserved values in a conditionally lognormal random spatial field like those commonly encountered in environmental risk analysis. These unobserved values are of two types. The first come from spatial locations where the field has never been monitored, the second, from currently monitored sites which have been only recently installed. Thus the monitoring data exhibit a monotone pattern, resembling a staircase whose highest step comes from the oldest monitoring sites. The authors propose a hierarchical Bayesian approach using the lognormal sampling distribution, in conjunction with a conjugate generalized Wishart distribution. This prior distribution allows different degrees of freedom to be fitted for individual steps, taking into account the differential amounts of information available from sites at the different steps in the staircase. The resulting hierarchical model is a predictive distribution for the unobserved values of the field. The method is demonstrated by application to the ambient ozone field for the southwestern region of British Columbia.  相似文献   

5.
Due to the escalating growth of big data sets in recent years, new Bayesian Markov chain Monte Carlo (MCMC) parallel computing methods have been developed. These methods partition large data sets by observations into subsets. However, for Bayesian nested hierarchical models, typically only a few parameters are common for the full data set, with most parameters being group specific. Thus, parallel Bayesian MCMC methods that take into account the structure of the model and split the full data set by groups rather than by observations are a more natural approach for analysis. Here, we adapt and extend a recently introduced two-stage Bayesian hierarchical modeling approach, and we partition complete data sets by groups. In stage 1, the group-specific parameters are estimated independently in parallel. The stage 1 posteriors are used as proposal distributions in stage 2, where the target distribution is the full model. Using three-level and four-level models, we show in both simulation and real data studies that results of our method agree closely with the full data analysis, with greatly increased MCMC efficiency and greatly reduced computation times. The advantages of our method versus existing parallel MCMC computing methods are also described.  相似文献   

6.
We present how the repeatability and reproducibility of a measurement device can be estimated from a suitably defined hierarchical linear model. The methodology is illustrated using a collection of eight data sets which consist of the distortion product otoacoustic emission recordings collected from both ears of ten young Sprague-Dawley rats at different frequencies under eight different recording conditions. We formulate a model which extends the commonly used one-way random effects model (5) to account for an experimental setup that is more elaborated than the ones traditionally used in interlaboratory experiments. The fitted model is easily interpretable and furnishes as a by-product the frequencies at which the highest response level is achieved under the eight recording conditions. These values together with the repeatability and reproducibility limits of the protocols are crucial in contributing to the enhancement of the research capabilities on the possible causes of hearing impairment.  相似文献   

7.
With rapid improvements in medical treatment and health care, many datasets dealing with time to relapse or death now reveal a substantial portion of patients who are cured (i.e., who never experience the event). Extended survival models called cure rate models account for the probability of a subject being cured and can be broadly classified into the classical mixture models of Berkson and Gage (BG type) or the stochastic tumor models pioneered by Yakovlev and extended to a hierarchical framework by Chen, Ibrahim, and Sinha (YCIS type). Recent developments in Bayesian hierarchical cure models have evoked significant interest regarding relationships and preferences between these two classes of models. Our present work proposes a unifying class of cure rate models that facilitates flexible hierarchical model-building while including both existing cure model classes as special cases. This unifying class enables robust modeling by accounting for uncertainty in underlying mechanisms leading to cure. Issues such as regressing on the cure fraction and propriety of the associated posterior distributions under different modeling assumptions are also discussed. Finally, we offer a simulation study and also illustrate with two datasets (on melanoma and breast cancer) that reveal our framework's ability to distinguish among underlying mechanisms that lead to relapse and cure.  相似文献   

8.
The last decade has witnessed major developments in Geographical Information Systems (GIS) technology resulting in the need for statisticians to develop models that account for spatial clustering and variation. In public health settings, epidemiologists and health-care professionals are interested in discerning spatial patterns in survival data that might exist among the counties. This paper develops a Bayesian hierarchical model for capturing spatial heterogeneity within the framework of proportional odds. This is deemed more appropriate when a substantial percentage of subjects enjoy prolonged survival. We discuss the implementation issues of our models, perform comparisons among competing models and illustrate with data from the SEER (Surveillance Epidemiology and End Results) database of the National Cancer Institute, paying particular attention to the underlying spatial story.  相似文献   

9.
Individual-level models (ILMs) for infectious disease can be used to model disease spread between individuals while taking into account important covariates. One important covariate in determining the risk of infection transfer can be spatial location. At the same time, measurement error is a concern in many areas of statistical analysis, and infectious disease modelling is no exception. In this paper, we are concerned with the issue of measurement error in the recorded location of individuals when using a simple spatial ILM to model the spread of disease within a population. An ILM that incorporates spatial location random effects is introduced within a hierarchical Bayesian framework. This model is tested upon both simulated data and data from the UK 2001 foot-and-mouth disease epidemic. The ability of the model to successfully identify both the spatial infection kernel and the basic reproduction number (R 0) of the disease is tested.  相似文献   

10.
Imprisonment levels vary widely across the United States, with some state imprisonment rates six times higher than others. Imposition of prison sentences also varies between counties within states, with previous research suggesting that covariates such as crime rate, unemployment level, racial composition, political conservatism, geographic region, and sentencing policies account for some of this variation. Other studies, using court data on individual felons, demonstrate how type of offense, demographics, criminal history, and case characteristics affect sentence severity. This article considers the effects of both county-level and individual-level covariates on whether a convicted felon receives a prison sentence rather than a jail or non-custodial sentence. We analyze felony court case processing data from May 1998 for 39 of the nation's most populous urban counties using a Bayesian hierarchical logistic regression model. By adopting a Bayesian approach, we are able to overcome a number of challenges. The model allows individual-level effects to vary by county, but relates these effects across counties using county-level covariates. We account for missing data using imputation via additional Gibbs sampling steps when estimating the model. Finally, we use posterior samples to construct novel predictor effect plots to aid communication of results to criminal justice policy-makers.  相似文献   

11.
In this paper, a generalization of the two-parameter partial credit model (2PL-PCM) and of two special cases, the partial credit model (PCM) and the rating scale model (RSM), with a hierarchical data structure will be presented. Having shown how 2PL-PCM, as with other item response theory (IRT) models, may be read in terms of a generalized linear mixed model (GLMM) with two aggregation levels, a presentation will be given of an extension to the case of measuring the latent trait of individuals aggregated in groups. The use of this Multilevel IRT model will be illustrated via reference to the evaluation of university teaching by students following the courses. The aim is to generate a ranking of teaching on the basis of student satisfaction, so as to give teachers, and those responsible for organizing study courses, a background of information that takes the opinions of the direct target group for university teaching (that is, the students) into account, in the context of improving the teaching courses available.  相似文献   

12.
Due to significant progress in cancer treatments and management in survival studies involving time to relapse (or death), we often need survival models with cured fraction to account for the subjects enjoying prolonged survival. Our article presents a new proportional odds survival models with a cured fraction using a special hierarchical structure of the latent factors activating cure. This new model has same important differences with classical proportional odds survival models and existing cure-rate survival models. We demonstrate the implementation of Bayesian data analysis using our model with data from the SEER (Surveillance Epidemiology and End Results) database of the National Cancer Institute. Particularly aimed at survival data with cured fraction, we present a novel Bayes method for model comparisons and assessments, and demonstrate our new tool’s superior performance and advantages over competing tools.  相似文献   

13.
Recently, there has been great interest in estimating the decline in cognitive ability in patients with Alzheimer's disease. Measuring decline is not straightforward, since one must consider the choice of scale to measure cognitive ability, possible floor and ceiling effects, between-patient variability, and the unobserved age of onset. The authors demonstrate how to account for the above features by modeling decline in scores on the Mini-Mental State Exam in two different data sets. To this end, they use hierarchical Bayesian models with change points, for which posterior distributions are calculated using the Gibbs sampler. They make comparisons between several such models using both prior and posterior Bayes factors, and compare the results from the models suggested by these two model selection criteria.  相似文献   

14.
Recent advances in computing make it practical to use complex hierarchical models. However, the complexity makes it difficult to see how features of the data determine the fitted model. This paper describes an approach to diagnostics for hierarchical models, specifically linear hierarchical models with additive normal or t -errors. The key is to express hierarchical models in the form of ordinary linear models by adding artificial `cases' to the data set corresponding to the higher levels of the hierarchy. The error term of this linear model is not homoscedastic, but its covariance structure is much simpler than that usually used in variance component or random effects models. The re-expression has several advantages. First, it is extremely general, covering dynamic linear models, random effect and mixed effect models, and pairwise difference models, among others. Second, it makes more explicit the geometry of hierarchical models, by analogy with the geometry of linear models. Third, the analogy with linear models provides a rich source of ideas for diagnostics for all the parts of hierarchical models. This paper gives diagnostics to examine candidate added variables, transformations, collinearity, case influence and residuals.  相似文献   

15.
Meta-analytical approaches have been extensively used to analyze medical data. In most cases, the data come from different studies or independent trials with similar characteristics. However, these methods can be applied in a broader sense. In this paper, we show how existing meta-analytic techniques can also be used as well when dealing with parameters estimated from individual hierarchical data. Specifically, we propose to apply statistical methods that account for the variances (and possibly covariances) of such measures. The estimated parameters together with their estimated variances can be incorporated into a general linear mixed model framework. We illustrate the methodology by using data from a first-in-man study and a simulated data set. The analysis was implemented with the SAS procedure MIXED and example code is offered.  相似文献   

16.
In this paper, measurements from experiments and results of a finite element analysis (FEA) are combined in order to compute accurate empirical models for the temperature distribution before a thermomechanically coupled forming process. To accomplish this, Design and Analysis of Computer Experiments (DACE) is used to separately compute models for the measurements and the functional output of the FEA. Based on a hierarchical approach, a combined model of the process is computed. In this combined modelling approach, the model for the FEA is corrected by taking into account the systematic deviations from the experimental measurements. The large number of observations based on the functional output hinders the direct computation of the DACE models due to the internal inversion of the correlation matrix. Thus, different techniques for identifying a relevant subset of the observations are proposed. The application of the resulting procedure is presented, and a statistical validation of the empirical models is performed.  相似文献   

17.
We propose a Bayesian hierarchical model for multiple comparisons in mixed models where the repeated measures on subjects are described with the subject random effects. The model facilitates inferences in parameterizing the successive differences of the population means, and for them, we choose independent prior distributions that are mixtures of a normal distribution and a discrete distribution with its entire mass at zero. For the other parameters, we choose conjugate or vague priors. The performance of the proposed hierarchical model is investigated in the simulated and two real data sets, and the results illustrate that the proposed hierarchical model can effectively conduct a global test and pairwise comparisons using the posterior probability that any two means are equal. A simulation study is performed to analyze the type I error rate, the familywise error rate, and the test power. The Gibbs sampler procedure is used to estimate the parameters and to calculate the posterior probabilities.  相似文献   

18.
基于手机市场的调研数据,构建了多层贝叶斯模型,并将其与传统哑变量回归模型进行比较,得出前者比后者具有更好的模型拟合能力和预测能力的结论。利用有人口特征变量的多层贝叶斯随机效应模型来完成对所有未知参数的估计,从个体内行为和个体间行为两个层面对消费者的偏好行为进行全面分析,结果发现该模型可以很好解释消费者的总体偏好及其偏好差异性。  相似文献   

19.
This paper studies prediction of future failure (rates) by hierarchical empirical Bayes (EB) Poisson regression methodologies. Both a gamma distributed superpopulation as well as a more robust (long-tailed) log student-t superpopulation are considered. Simulation results are reported concerning predicted Poisson rates. The results tentatively suggest that a hierarchical model with gamma superpopulation can effectively adapt to data coming from a log-Student-t superpopulation particularly if the additional computation involved with estimation for the log-Student-t hierarchical model is burdensome.  相似文献   

20.
Significance tests on coefficients of lower-order terms in polynomial regression models are affected by linear transformations. For this reason, a polynomial regression model that excludes hierarchically inferior predictors (i.e., lower-order terms) is considered to be not well formulated. Existing variable-selection algorithms do not take into account the hierarchy of predictors and often select as “best” a model that is not hierarchically well formulated. This article proposes a theory of the hierarchical ordering of the predictors of an arbitrary polynomial regression model in m variables, where m is any arbitrary positive integer. Ways of modifying existing algorithms to restrict their search to well-formulated models are suggested. An algorithm that generates all possible well-formulated models is presented.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号