共查询到20条相似文献,搜索用时 15 毫秒
1.
Miao-Yu Tsai 《Journal of applied statistics》2010,37(6):1043-1058
Family studies are often conducted to examine the existence of familial aggregation. Particularly, twin studies can model separately the genetic and environmental contribution. Here we estimate the heritability of quantitative traits via variance components of random-effects in linear mixed models (LMMs). The motivating example was a myopia twin study containing complex nesting data structures: twins and siblings in the same family and observations on both eyes for each individual. Three models are considered for this nesting structure. Our proposal takes into account the model uncertainty in both covariates and model structures via an extended Bayesian model averaging (EBMA) procedure. We estimate the heritability using EBMA under three suggested model structures. When compared with the results under the model with the highest posterior model probability, the EBMA estimate has smaller variation and is slightly conservative. Simulation studies are conducted to evaluate the performance of variance-components estimates, as well as the selections of risk factors, under the correct or incorrect structure. The results indicate that EBMA, with consideration of uncertainties in both covariates and model structures, is robust in model misspecification than the usual Bayesian model averaging (BMA) that considers only uncertainty in covariates selection. 相似文献
2.
Various statistical models have been proposed for two‐dimensional dose finding in drug‐combination trials. However, it is often a dilemma to decide which model to use when conducting a particular drug‐combination trial. We make a comprehensive comparison of four dose‐finding methods, and for fairness, we apply the same dose‐finding algorithm under the four model structures. Through extensive simulation studies, we compare the operating characteristics of these methods in various practical scenarios. The results show that different models may lead to different design properties and that no single model performs uniformly better in all scenarios. As a result, we propose using Bayesian model averaging to overcome the arbitrariness of the model specification and enhance the robustness of the design. We assign a discrete probability mass to each model as the prior model probability and then estimate the toxicity probabilities of combined doses in the Bayesian model averaging framework. During the trial, we adaptively allocated each new cohort of patients to the most appropriate dose combination by comparing the posterior estimates of the toxicity probabilities with the prespecified toxicity target. The simulation results demonstrate that the Bayesian model averaging approach is robust under various scenarios. Copyright © 2015 John Wiley & Sons, Ltd. 相似文献
3.
We extend the Bayesian Model Averaging (BMA) framework to dynamic panel data models with endogenous regressors using a Limited Information Bayesian Model Averaging (LIBMA) methodology. Monte Carlo simulations confirm the asymptotic performance of our methodology both in BMA and selection, with high posterior inclusion probabilities for all relevant regressors, and parameter estimates very close to their true values. In addition, we illustrate the use of LIBMA by estimating a dynamic gravity model for bilateral trade. Once model uncertainty, dynamics, and endogeneity are accounted for, we find several factors that are robustly correlated with bilateral trade. We also find that applying methodologies that do not account for either dynamics or endogeneity (or both) results in different sets of robust determinants. 相似文献
4.
David R. Bickel 《统计学通讯:理论与方法》2018,47(13):3125-3137
Just as frequentist hypothesis tests have been developed to check model assumptions, prior predictive p-values and other Bayesian p-values check prior distributions as well as other model assumptions. These model checks not only suffer from the usual threshold dependence of p-values, but also from the suppression of model uncertainty in subsequent inference. One solution is to transform Bayesian and frequentist p-values for model assessment into a fiducial distribution across the models. Averaging the Bayesian or frequentist posterior distributions with respect to the fiducial distribution can reproduce results from Bayesian model averaging or classical fiducial inference. 相似文献
5.
6.
《Journal of Statistical Computation and Simulation》2012,82(8):1667-1678
We propose a new iterative algorithm, called model walking algorithm, to the Bayesian model averaging method on the longitudinal regression models with AR(1) random errors within subjects. The Markov chain Monte Carlo method together with the model walking algorithm are employed. The proposed method is successfully applied to predict the progression rates on a myopia intervention trial in children. 相似文献
7.
基于MCMC模拟的贝叶斯分层信用风险评估模型 总被引:1,自引:2,他引:1
缺少违约数据与债务人异质性是度量信用风险时面临的重要问题。贝叶斯模型中分层先验信息和马尔可夫链蒙特卡罗(MCMC)模拟方法的应用可以有效缓解数据缺失和测量误差问题,并能对债务人异质性进行评价和比较,从而避免低估风险。针对银行数据的模型拟合与模型诊断均展现了分层估计的适应性和灵活性,相关方法简洁清晰,利于国内风险分析人员采用。同时,涵盖宏观经济协变量的贝叶斯分层模型可以用于更加复杂的风险分析。 相似文献
8.
Identification of different gene expressions of chickpea (Cicer arietinum) plant tissue is needed in order to develop new varieties of chickpea plant which is resistant to disease through the insertion of genes. This plant is the third legume plant of the Leguminosae (Fabaceae) family and is much needed in the world due to its high-protein seeds and roots that contain symbiotic nitrogen-fixing bacteria. This paper has succeeded to demonstrate the work of Bayesian mixture model averaging (BMMA) approach to identify the different gene expressions of chickpea plant tissue in Indonesia. The results show that the best BMMA normal models contain from 727 (73%) up to 939 (94%) models from 1,000 generated mixture normal models. The fitted BMMA models to gene expression differences data on average is 0.2878511 for Kolmogorov–Smirnov (KS) and 0.1278080 for continuous rank probability score (CRPS). Based on these BMMA models, there are three groups of gene IDs: downregulated, regulated, and upregulated. The results of this grouping can be useful to find new varieties of chickpea plants that are more resistant to disease. The BMMA normal models coupled with Occam's window as a data-driven modeling have succeed to demonstrate the work of building the gene expression differences microarray experiments data. 相似文献
9.
Imad Bou-Hamad 《统计学通讯:理论与方法》2017,46(15):7289-7300
Cerciello and Giudici (2014) proposed a Bayesian approach to improve the ordinal variable selection in credit rating assessment. However, no comparison has been made with other methods and the predictive power was not tested. This study proposes an integrated framework of random forest (RF)-based methods and Bayesian model averaging (BMA) to validate and investigate the ordinal variable importance in evaluating credit risk and predicting default in greater depth. The proposed approach was superior to the Cerciello and Giudici method in terms of predictive accuracy and interpretability when applied to a European credit risk database. 相似文献
10.
P. J. Brown M. Vannucci T. Fearn 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(3):519-536
Summary. When a number of distinct models contend for use in prediction, the choice of a single model can offer rather unstable predictions. In regression, stochastic search variable selection with Bayesian model averaging offers a cure for this robustness issue but at the expense of requiring very many predictors. Here we look at Bayes model averaging incorporating variable selection for prediction. This offers similar mean-square errors of prediction but with a vastly reduced predictor space. This can greatly aid the interpretation of the model. It also reduces the cost if measured variables have costs. The development here uses decision theory in the context of the multivariate general linear model. In passing, this reduced predictor space Bayes model averaging is contrasted with single-model approximations. A fast algorithm for updating regressions in the Markov chain Monte Carlo searches for posterior inference is developed, allowing many more variables than observations to be contemplated. We discuss the merits of absolute rather than proportionate shrinkage in regression, especially when there are more variables than observations. The methodology is illustrated on a set of spectroscopic data used for measuring the amounts of different sugars in an aqueous solution. 相似文献
11.
Given a set of possible models for variables X and a set of possible parameters for each model, the Bayesian estimate of the probability distribution for X given observed data is obtained by averaging over the possible models and their parameters. An often-used approximation for this estimate is obtained by selecting a single model and averaging over its parameters. The approximation is useful because it is computationally efficient, and because it provides a model that facilitates understanding of the domain. A common criterion for model selection is the posterior probability of the model. Another criterion for model selection, proposed by San Martini and Spezzafari (1984), is the predictive performance of a model for the next observation to be seen. From the standpoint of domain understanding, both criteria are useful, because one identifies the model that is most likely, whereas the other identifies the model that is the best predictor of the next observation. To highlight the difference, we refer to the posterior-probability and alternative criteria as the scientific criterion (SC) and engineering criterion (EC), respectively. When we are interested in predicting the next observation, the model-averaged estimate is at least as good as that produced by EC, which itself is at least as good as the estimate produced by SC. We show experimentally that, for Bayesian-network models containing discrete variables only, the predictive performance of the model average can be significantly better than those of single models selected by either criterion, and that differences between models selected by the two criterion can be substantial. 相似文献
12.
《Journal of Statistical Computation and Simulation》2012,82(10):1963-1986
The value at risk (VaR) is a risk measure that is widely used by financial institutions to allocate risk. VaR forecast estimation involves the evaluation of conditional quantiles based on the currently available information. Recent advances in VaR evaluation incorporate conditional variance into the quantile estimation, which yields the conditional autoregressive VaR (CAViaR) models. However, uncertainty with regard to model selection in CAViaR model estimators raises the issue of identifying the better quantile predictor via averaging. In this study, we propose a quasi-Bayesian model averaging method that generates combinations of conditional VaR estimators based on single CAViaR models. This approach provides us a basis for comparing single CAViaR models against averaged ones for their ability to forecast VaR. We illustrate this method using simulated and financial daily return data series. The results demonstrate significant findings with regard to the use of averaged conditional VaR estimates when forecasting quantile risk. 相似文献
13.
A method is suggested to estimate posterior model probabilities and model averaged parameters via MCMC sampling under a Bayesian approach. The estimates use pooled output for J models (J>1) whereby all models are updated at each iteration. Posterior probabilities are based on averages of continuous weights obtained for each model at each iteration, while samples of averaged parameters are obtained from iteration specific averages that are based on these weights. Parallel sampling of models assists in deriving posterior densities for parameter contrasts between models and in assessing hypotheses regarding model averaged parameters. Four worked examples illustrate application of the approach, two involving fixed effect regression, and two involving random effects. 相似文献
14.
Bayesian model comparison for compartmental models with applications in positron emission tomography
We develop strategies for Bayesian modelling as well as model comparison, averaging and selection for compartmental models with particular emphasis on those that occur in the analysis of positron emission tomography (PET) data. Both modelling and computational issues are considered. Biophysically inspired informative priors are developed for the problem at hand, and by comparison with default vague priors it is shown that the proposed modelling is not overly sensitive to prior specification. It is also shown that an additive normal error structure does not describe measured PET data well, despite being very widely used, and that within a simple Bayesian framework simultaneous parameter estimation and model comparison can be performed with a more general noise model. The proposed approach is compared with standard techniques using both simulated and real data. In addition to good, robust estimation performance, the proposed technique provides, automatically, a characterisation of the uncertainty in the resulting estimates which can be considerable in applications such as PET. 相似文献
15.
David R. Bickel 《Statistics》2018,52(3):552-570
Learning from model diagnostics that a prior distribution must be replaced by one that conflicts less with the data raises the question of which prior should instead be used for inference and decision. The same problem arises when a decision maker learns that one or more reliable experts express unexpected beliefs. In both cases, coherence of the solution would be guaranteed by applying Bayes's theorem to a distribution of prior distributions that effectively assigns the initial prior distribution a probability arbitrarily close to 1. The new distribution for inference would then be the distribution of priors conditional on the insight that the prior distribution lies in a closed convex set that does not contain the initial prior. A readily available distribution of priors needed for such conditioning is the law of the empirical distribution of sufficiently large number of independent parameter values drawn from the initial prior. According to the Gibbs conditioning principle from the theory of large deviations, the resulting new prior distribution minimizes the entropy relative to the initial prior. While minimizing relative entropy accommodates the necessity of going beyond the initial prior without departing from it any more than the insight demands, the large-deviation derivation also ensures the advantages of Bayesian coherence. This approach is generalized to uncertain insights by allowing the closed convex set of priors to be random. 相似文献
16.
Georg Man 《Journal of applied statistics》2018,45(5):845-867
Jointness is a Bayesian approach to capturing dependence among regressors in multivariate data. It addresses the general issue of whether explanatory factors for a given empirical phenomenon are complements or substitutes. I ask a number of questions about existing jointness concepts: Are the patterns revealed stable across datasets? Are results robust to prior choice and do data characteristics affect results? And importantly: What do the answers imply from a practical vista? The present study takes an applied, interdisciplinary and comparative perspective, validating jointness concepts on datasets across scientific fields with focus on life sciences (Parkinson's disease) and sociology. Simulations complement the study of real-world data. My findings suggest that results depend on which jointness concept is used: Some concepts deliver jointness patterns remarkably uniform across datasets, while all concepts are fairly robust to the choice of prior structure. This can be interpreted as critique of jointness from a practical perspective, given that the patterns revealed are at times very different and no concept emerges as overall advantageous. The composite indicators approach to combining information across jointness concepts is also explored, suggesting an avenue to facilitate the application of the concepts in future research. 相似文献
17.
Edward L. Boone Susan J. Simmons Haikun Bao Ann E. Stapleton 《Journal of applied statistics》2008,35(7):799-808
Quantitative trait loci (QTL) mapping is a growing field in statistical genetics. In plants, QTL detection experiments often feature replicates or clones within a specific genetic line. In this work, a Bayesian hierarchical regression model is applied to simulated QTL data and to a dataset from the Arabidopsis thaliana plants for locating the QTL mapping associated with cotyledon opening. A conditional model search strategy based on Bayesian model averaging is utilized to reduce the computational burden. 相似文献
18.
We consider the complete clinic visit records and environmental monitoring data at 50 townships and city districts where ambient air monitoring stations of Taiwan Air Quality Monitoring Stations are located. A Bayesian analysis is carried out using regression spline model on principal components obtained from several pollutant covariables. The appropriate model is selected using Bayesian model averaging. A brief account of our results is provided for the elderly patients group. 相似文献
19.
Ziwen Gao Jiahui Zou Xinyu Zhang Yanyuan Ma 《Scandinavian Journal of Statistics》2023,50(3):1325-1364
The envelope method produces efficient estimation in multivariate linear regression, and is widely applied in biology, psychology, and economics. This paper estimates parameters through a model averaging methodology and promotes the predicting abilities of the envelope models. We propose a frequentist model averaging method by minimizing a cross-validation criterion. When all the candidate models are misspecified, the proposed model averaging estimator is proved to be asymptotically optimal. When correct candidate models exist, the coefficient estimator is proved to be consistent, and the sum of the weights assigned to the correct models, in probability, converges to one. Simulations and an empirical application demonstrate the effectiveness of the proposed method. 相似文献
20.
We revisit the complete clinic visit records and environmental monitoring data at 50 townships and city districts of Taiwan. Extending the earlier analyses, here we consider a Bayesian analysis using Daubechies wavelet. Appropriate model selection is also considered using Bayesian model averaging. Temperature, dew point, and NO2 and CO of the current day and the previous day are identified as the pollutants in different areas of the island following some spatial pattern. 相似文献