排序方式: 共有16条查询结果,搜索用时 31 毫秒
1.
The B-spline representation is a common tool to improve the fitting of smooth nonlinear functions, it offers a fitting as a piecewise polynomial. The regions that define the pieces are separated by a sequence of knots. The main difficulty in this type of modeling is the choice of the number and the locations of these knots. The Reversible Jump Markov Chain Monte Carlo (RJMCMC) algorithm provides a solution to simultaneously select these two parameters by considering the knots as free parameters. This algorithm belongs to the MCMC techniques that allow simulations from target distributions on spaces of varying dimension. The aim of the present investigation is to use this algorithm in the framework of the analysis of survival time, for the Cox model in particular. In fact, the relation between the hazard ratio function and the covariates being assumed to be log-linear, this assumption is too restrictive. Thus, we propose to use the RJMCMC algorithm to model the log hazard ratio function by a B-spline representation with an unknown number of knots at unknown locations. This method is illustrated with two real data sets: the Stanford heart transplant data and lung cancer survival data. Another application of the RJMCMC is selecting the significant covariates, and a simulation study is performed. 相似文献
2.
《Journal of Statistical Computation and Simulation》2012,82(2):115-140
Although the effect of missing data on regression estimates has received considerable attention, their effect on predictive performance has been neglected. We studied the performance of three missing data strategies—omission of records with missing values, replacement with a mean and imputation based on regression—on the predictive performance of logistic regression (LR), classification tree (CT) and neural network (NN) models in the presence of data missing completely at random (MCAR). Models were constructed using datasets of size 500 simulated from a joint distribution of binary and continuous predictors including nonlinearities, collinearity and interactions between variables. Though omission produced models that fit better on the data from which the models were developed, imputation was superior on average to omission for all models when evaluating the receiver operating characteristic (ROC) curve area, mean squared error (MSE), pooled variance across outcome categories and calibration X 2 on an independently generated test set. However, in about one-third of simulations, omission performed better. Performance was also more variable with omission including quite a few instances of extremely poor performance. Replacement and imputation generally produced similar results except with neural networks for which replacement, the strategy typically used in neural network algorithms, was inferior to imputation. Missing data affected simpler models much less than they did more complex models such as generalized additive models that focus on local structure For moderate sized datasets, logistic regressions that use simple nonlinear structures such as quadratic terms and piecewise linear splines appear to be at least as robust to randomly missing values as neural networks and classification trees. 相似文献
3.
Quantile regression, including median regression, as a more completed statistical model than mean regression, is now well
known with its wide spread applications. Bayesian inference on quantile regression or Bayesian quantile regression has attracted
much interest recently. Most of the existing researches in Bayesian quantile regression focus on parametric quantile regression,
though there are discussions on different ways of modeling the model error by a parametric distribution named asymmetric Laplace
distribution or by a nonparametric alternative named scale mixture asymmetric Laplace distribution. This paper discusses Bayesian
inference for nonparametric quantile regression. This general approach fits quantile regression curves using piecewise polynomial
functions with an unknown number of knots at unknown locations, all treated as parameters to be inferred through reversible
jump Markov chain Monte Carlo (RJMCMC) of Green (Biometrika 82:711–732, 1995). Instead of drawing samples from the posterior, we use regression quantiles to create Markov chains for the estimation of
the quantile curves. We also use approximate Bayesian factor in the inference. This method extends the work in automatic Bayesian
mean curve fitting to quantile regression. Numerical results show that this Bayesian quantile smoothing technique is competitive
with quantile regression/smoothing splines of He and Ng (Comput. Stat. 14:315–337, 1999) and P-splines (penalized splines) of Eilers and de Menezes (Bioinformatics 21(7):1146–1153, 2005). 相似文献
4.
5.
Jianhua Z. Huang Lijian Yang 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(2):463-477
Summary. We propose a lag selection method for non-linear additive autoregressive models that is based on spline estimation and the Bayes information criterion. The additive structure of the autoregression function is used to overcome the 'curse of dimensionality', whereas the spline estimators effectively take into account such a structure in estimation. A stepwise procedure is suggested to implement the method proposed. A comprehensive Monte Carlo study demonstrates good performance of the method proposed and a substantial computational advantage over existing local-polynomial-based methods. Consistency of the lag selection method based on the Bayes information criterion is established under the assumption that the observations are from a stochastic process that is strictly stationary and strongly mixing, which provides the first theoretical result of this kind for spline smoothing of weakly dependent data. 相似文献
6.
Patricia L. Smith 《The American statistician》2013,67(2):57-62
The framework for a unified statistical theory of spline regression assuming fixed knots using the truncated polynomial or “+” function representation is presented. In particular, a partial ordering of some spline models is introduced to clarify their relationship and to indicate the hypotheses that can be tested by using either standard multiple regression procedures or a little-used conditional test developed by Hotelling (1940). The construction of spline models with polynomial pieces of different degrees is illustrated. A numerical example from a chemical experiment is given by using the GLM procedure of the statistical software package SAS (Barr et al. 1976). 相似文献
7.
This paper points out the need for performance measures in the context of simulation optimization and suggests six such measures. Two of the measures are indications of absolute performance, whereas the other four are useful in assessing the relative performance of various candidate metamodels. The measures assess performance on three fronts: accuracy of placing optima in the correct location, fit to the response, and fit to the character of the surface (expressed in terms of the number of optima). Examples are given providing evidence of the measures' utility—one in a limited scenario deciding which of two competing metamodels to use as simulation optimization response surfaces vary, and the other in a scenario of a researcher developing a new, sequential optimization search procedure. 相似文献
8.
Matthew Schipper Jeremy M. G. Taylor Xihong Lin 《Journal of the Royal Statistical Society. Series C, Applied statistics》2008,57(2):149-163
Summary. Normal tissue complications are a common side effect of radiation therapy. They are the consequence of the dose of radiation that is received by the normal tissue surrounding the site of the tumour. Within a specified organ each voxel receives a certain dose of radiation, leading to a distribution of doses over the organ. It is often not known what aspect of the dose distribution drives the presence and severity of the complications. A summary measure of the dose distribution can be obtained by integrating a weighting function of dose ( w ( d )) over the density of dose. For biological reasons the weight function should be monotonic. We propose a generalized monotonic functional mixed model to study the dose effect on a clinical outcome by estimating this weight function non-parametrically by using splines and subject to the monotonicity constraint, while allowing for overdispersion and correlation of multiple obervations within the same subject. We illustrate our method with data from a head and neck cancer study in which the irradiation of the parotid gland results in loss of saliva flow. 相似文献
9.
Quantile Curves without Crossing 总被引:1,自引:0,他引:1
Xuming He 《The American statistician》2013,67(2):186-192
10.
Lichun Wang 《统计学通讯:理论与方法》2013,42(15):2731-2740
In this article, we consider Bayesian inferences for the heteroscedastic nonparametric regression models, when both the mean function and variance function are unknown. We demonstrated consistency of posterior distributions for this model using priors induced by B-splines expansion, treating both random and deterministic covariates in a uniform manner. 相似文献