期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Simulation-based sequential Bayesian design

Peter Müller Don A. Berry Andy P. Grieve Michael Smith Michael Krams 《Journal of statistical planning and inference》2007

We consider simulation-based methods for exploration and maximization of expected utility in sequential decision problems. We consider problems which require backward induction with analytically intractable expected utility integrals at each stage. We propose to use forward simulation to approximate the integral expressions, and a reduction of the allowable action space to avoid problems related to an increasing number of possible trajectories in the backward induction. The artificially reduced action space allows strategies to depend on the full history of earlier observations and decisions only indirectly through a low dimensional summary statistic. The proposed rule provides a finite-dimensional approximation to the unrestricted infinite-dimensional optimal decision rule. We illustrate the proposed approach with an application to an optimal stopping problem in a clinical trial. 相似文献

2.

Inference after variable selection using restricted permutation methods

Rui Wang Stephen W. Lagakos 《Revue canadienne de statistique》2009,37(4):625-644

When confronted with multiple covariates and a response variable, analysts sometimes apply a variable‐selection algorithm to the covariate‐response data to identify a subset of covariates potentially associated with the response, and then wish to make inferences about parameters in a model for the marginal association between the selected covariates and the response. If an independent data set were available, the parameters of interest could be estimated by using standard inference methods to fit the postulated marginal model to the independent data set. However, when applied to the same data set used by the variable selector, standard (“naive”) methods can lead to distorted inferences. The authors develop testing and interval estimation methods for parameters reflecting the marginal association between the selected covariates and response variable, based on the same data set used for variable selection. They provide theoretical justification for the proposed methods, present results to guide their implementation, and use simulations to assess and compare their performance to a sample‐splitting approach. The methods are illustrated with data from a recent AIDS study. The Canadian Journal of Statistics 37: 625–644; 2009 © 2009 Statistical Society of Canada 相似文献

3.

Bayesian variable selection approach to a Bernstein polynomial regression model with stochastic constraints

Taeryon Choi Seongil Jo 《Journal of applied statistics》2016,43(15):2751-2771

This paper provides a Bayesian estimation procedure for monotone regression models incorporating the monotone trend constraint subject to uncertainty. For monotone regression modeling with stochastic restrictions, we propose a Bayesian Bernstein polynomial regression model using two-stage hierarchical prior distributions based on a family of rectangle-screened multivariate Gaussian distributions extended from the work of Gurtis and Ghosh [7 S.M. Curtis and S.K. Ghosh, A variable selection approach to monotonic regression with Bernstein polynomials, J. Appl. Stat. 38 (2011), pp. 961–976. doi: 10.1080/02664761003692423[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]]. This approach reflects the uncertainty about the prior constraint, and thus proposes a regression model subject to monotone restriction with uncertainty. Based on the proposed model, we derive the posterior distributions for unknown parameters and present numerical schemes to generate posterior samples. We show the empirical performance of the proposed model based on synthetic data and real data applications and compare the performance to the Bernstein polynomial regression model of Curtis and Ghosh [7 S.M. Curtis and S.K. Ghosh, A variable selection approach to monotonic regression with Bernstein polynomials, J. Appl. Stat. 38 (2011), pp. 961–976. doi: 10.1080/02664761003692423[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]] for the shape restriction with certainty. We illustrate the effectiveness of our proposed method that incorporates the uncertainty of the monotone trend and automatically adapts the regression function to the monotonicity, through empirical analysis with synthetic data and real data applications. 相似文献

4.

Estimation and variable selection for partially functional linear models

Jiang Du Dengke Xu Ruiyuan Cao 《Journal of the Korean Statistical Society》2018,47(4):436-449

In this paper, a new estimation procedure based on composite quantile regression and functional principal component analysis (PCA) method is proposed for the partially functional linear regression models (PFLRMs). The proposed estimation method can simultaneously estimate both the parametric regression coefficients and functional coefficient components without specification of the error distributions. The proposed estimation method is shown to be more efficient empirically for non-normal random error, especially for Cauchy error, and almost as efficient for normal random errors. Furthermore, based on the proposed estimation procedure, we use the penalized composite quantile regression method to study variable selection for parametric part in the PFLRMs. Under certain regularity conditions, consistency, asymptotic normality, and Oracle property of the resulting estimators are derived. Simulation studies and a real data analysis are conducted to assess the finite sample performance of the proposed methods. 相似文献

5.

A sequential design for a clinical trial with a linear prognostic factor

José Antonio Moler Fernando Plo Miguel San Miguel 《Journal of statistical planning and inference》2007

We study a randomized adaptive design to assign one of the L

L

treatments to patients who arrive sequentially by means of an urn model. At each stage n

n

, a reward is distributed between treatments. The treatment applied is rewarded according to its response, 0?_Y_n?1

0 ? Y_{n} ? 1

, and 1-_Y_n

1 - Y_{n}

is distributed among the other treatments according to their performance until stage n-1

n - 1

. Patients can be classified in K+1

K + 1

levels and we assume that the effect of this level in the response to the treatments is linear. We study the asymptotic behavior of the design when the ordinary least square estimators are used as a measure of performance until stage n-1

n - 1

. 相似文献

6.

Sparse Bayesian variable selection in multinomial probit regression model with application to high-dimensional data classification

Yang Aijun Xiang Liming Lin Jinguan 《统计学通讯:理论与方法》2017,46(12):6137-6150

Here we consider a multinomial probit regression model where the number of variables substantially exceeds the sample size and only a subset of the available variables is associated with the response. Thus selecting a small number of relevant variables for classification has received a great deal of attention. Generally when the number of variables is substantial, sparsity-enforcing priors for the regression coefficients are called for on grounds of predictive generalization and computational ease. In this paper, we propose a sparse Bayesian variable selection method in multinomial probit regression model for multi-class classification. The performance of our proposed method is demonstrated with one simulated data and three well-known gene expression profiling data: breast cancer data, leukemia data, and small round blue-cell tumors. The results show that compared with other methods, our method is able to select the relevant variables and can obtain competitive classification accuracy with a small subset of relevant genes. 相似文献

7.

Bayesian robust transformation and variable selection: A unified approach

Raphael Gottardo Adrian Raftery 《Revue canadienne de statistique》2009,37(3):361-380

The authors consider the problem of simultaneous transformation and variable selection for linear regression. They propose a fully Bayesian solution to the problem, which allows averaging over all models considered including transformations of the response and predictors. The authors use the Box‐Cox family of transformations to transform the response and each predictor. To deal with the change of scale induced by the transformations, the authors propose to focus on new quantities rather than the estimated regression coefficients. These quantities, referred to as generalized regression coefficients, have a similar interpretation to the usual regression coefficients on the original scale of the data, but do not depend on the transformations. This allows probabilistic statements about the size of the effect associated with each variable, on the original scale of the data. In addition to variable and transformation selection, there is also uncertainty involved in the identification of outliers in regression. Thus, the authors also propose a more robust model to account for such outliers based on a t‐distribution with unknown degrees of freedom. Parameter estimation is carried out using an efficient Markov chain Monte Carlo algorithm, which permits moves around the space of all possible models. Using three real data sets and a simulated study, the authors show that there is considerable uncertainty about variable selection, choice of transformation, and outlier identification, and that there is advantage in dealing with all three simultaneously. The Canadian Journal of Statistics 37: 361–380; 2009 © 2009 Statistical Society of Canada 相似文献

8.

Bayesian variable selection in Poisson change-point regression analysis

S. Min 《统计学通讯:模拟与计算》2017,46(3):2267-2282

In this article, we develop a Bayesian variable selection method that concerns selection of covariates in the Poisson change-point regression model with both discrete and continuous candidate covariates. Ranging from a null model with no selected covariates to a full model including all covariates, the Bayesian variable selection method searches the entire model space, estimates posterior inclusion probabilities of covariates, and obtains model averaged estimates on coefficients to covariates, while simultaneously estimating a time-varying baseline rate due to change-points. For posterior computation, the Metropolis-Hastings within partially collapsed Gibbs sampler is developed to efficiently fit the Poisson change-point regression model with variable selection. We illustrate the proposed method using simulated and real datasets. 相似文献

9.

Bayesian selection of log-linear models

James H. Albert 《Revue canadienne de statistique》1996,24(3):327-347

A general methodology is presented for finding suitable Poisson log-linear models with applications to multiway contingency tables. Mixtures of multivariate normal distributions are used to model prior opinion when a subset of the regression vector is believed to be nonzero. This prior distribution is studied for two- and three-way contingency tables, in which the regression coefficients are interpretable in terms of odds ratios in the table. Efficient and accurate schemes are proposed for calculating the posterior model probabilities. The methods are illustrated for a large number of two-way simulated tables and for two three-way tables. These methods appear to be useful in selecting the best log-linear model and in estimating parameters of interest that reflect uncertainty in the true model. 相似文献

10.

Bayesian transformation family selection: Moving toward a transformed Gaussian universe

下载免费PDF全文

Efstratia Charitidou Dimitris Fouskakis Ioannis Ntzoufras 《Revue canadienne de statistique》2015,43(4):600-623

相似文献

11.

Using the EM algorithm for Bayesian variable selection in logistic regression models with related covariates

M. D. Koslovsky M. D. Swartz L. Leon-Novelo W. Chan A. V. Wilkinson 《Journal of Statistical Computation and Simulation》2018,88(3):575-596

We develop a Bayesian variable selection method for logistic regression models that can simultaneously accommodate qualitative covariates and interaction terms under various heredity constraints. We use expectation-maximization variable selection (EMVS) with a deterministic annealing variant as the platform for our method, due to its proven flexibility and efficiency. We propose a variance adjustment of the priors for the coefficients of qualitative covariates, which controls false-positive rates, and a flexible parameterization for interaction terms, which accommodates user-specified heredity constraints. This method can handle all pairwise interaction terms as well as a subset of specific interactions. Using simulation, we show that this method selects associated covariates better than the grouped LASSO and the LASSO with heredity constraints in various exploratory research scenarios encountered in epidemiological studies. We apply our method to identify genetic and non-genetic risk factors associated with smoking experimentation in a cohort of Mexican-heritage adolescents. 相似文献

12.

Optimum designs in complete two-way layouts

N. Gaffke O. Krafft 《Journal of statistical planning and inference》1979,3(2):119-126

In the usual two-way layout of ANOVA (interactions are admitted) let nij ? 1 be the number of observations for the factor-level combination(i, j). For testing the hypothesis that all main effects of the first factor vanish numbers

n^{1} ij

are given such that the power function of the F-test is uniformly maximized (U-optimality), if one considers only designs (nij) for which the row-sums ni are prescribed. Furthermore, in the (larger) set of all designs for which the total number of observations is given, all D-optimum designs are constructed. 相似文献

13.

Empirical Bayes regression analysis with many regressors but fewer observations

Muni S. Srivastava Tatsuya Kubokawa 《Journal of statistical planning and inference》2007

In this paper, we consider the prediction problem in multiple linear regression model in which the number of predictor variables, p, is extremely large compared to the number of available observations, n . The least-squares predictor based on a generalized inverse is not efficient. We propose six empirical Bayes estimators of the regression parameters. Three of them are shown to have uniformly lower prediction error than the least-squares predictors when the vector of regressor variables are assumed to be random with mean vector zero and the covariance matrix (1/n)X^tX

(1 / n) X^{t} X

where X^t=(_x₁,…,_x_n)

X^{t} = (x_{1}, \dots, x_{n})

is the p×n

p \times n

matrix of observations on the regressor vector centered from their sample means. For other estimators, we use simulation to show its superiority over the least-squares predictor. 相似文献

14.

Instrumental variable based variable selection for generalized linear models with endogenous covariates

Jiting Huang 《统计学通讯:模拟与计算》2019,48(6):1891-1900

We study the variable selection problem for a class of generalized linear models with endogenous covariates. Based on the instrumental variable adjustment technology and the smooth-threshold estimating equation (SEE) method, we propose an instrumental variable based variable selection procedure. The proposed variable selection method can attenuate the effect of endogeneity in covariates, and is easy for application in practice. Some theoretical results are also derived such as the consistency of the proposed variable selection procedure and the convergence rate of the resulting estimator. Further, some simulation studies and a real data analysis are conducted to evaluate the performance of the proposed method, and simulation results show that the proposed method is workable. 相似文献

15.

A novel bagging approach for variable ranking and selection via a mixed importance measure

Chun-Xia Zhang Jiang-She Zhang Guan-Wei Wang Nan-Nan Ji 《Journal of applied statistics》2018,45(10):1734-1755

At present, ensemble learning has exhibited its great power in stabilizing and enhancing the performance of some traditional variable selection methods such as lasso and genetic algorithm. In this paper, a novel bagging ensemble method called BSSW is developed to implement variable ranking and selection in linear regression models. Its main idea is to execute stepwise search algorithm on multiple bootstrap samples. In each trial, a mixed importance measure is assigned to each variable according to the order that it is selected into final model as well as the improvement of model fitting resulted from its inclusion. Based on the importance measure averaged across some bootstrapping trials, all candidate variables are ranked and then decided to be important or not. To extend the scope of application, BSSW is extended to the situation of generalized linear models. Experiments carried out with some simulated and real data indicate that BSSW achieves better performance in most studied cases when compared with several other existing methods. 相似文献

16.

Bayesian variable selection with spherically symmetric priors

Michiel B. De Kock Hans C. Eggers 《统计学通讯:理论与方法》2017,46(9):4250-4263

We propose that Bayesian variable selection for linear parametrizations with Gaussian iid likelihoods should be based on the spherical symmetry of the diagonalized parameter space. Our r-prior results in closed forms for the evidence for four examples, including the hyper-g prior and the Zellner–Siow prior, which are shown to be special cases. Scenarios of a single-variable dispersion parameter and of fixed dispersion are studied, and asymptotic forms comparable to the traditional information criteria are derived. A simulation exercise shows that model comparison based on our r-prior gives good results comparable to or better than current model comparison schemes. 相似文献

17.

Stochastic volatility modelling in continuous time with general marginal distributions: Inference,prediction and model selection

Matthew P.S. Gander David A. Stephens 《Journal of statistical planning and inference》2007

We compare results for stochastic volatility models where the underlying volatility process having generalized inverse Gaussian (GIG) and tempered stable marginal laws. We use a continuous time stochastic volatility model where the volatility follows an Ornstein–Uhlenbeck stochastic differential equation driven by a Lévy process. A model for long-range dependence is also considered, its merit and practical relevance discussed. We find that the full GIG and a special case, the inverse gamma, marginal distributions accurately fit real data. Inference is carried out in a Bayesian framework, with computation using Markov chain Monte Carlo (MCMC). We develop an MCMC algorithm that can be used for a general marginal model. 相似文献

18.

Analysis of middle-censored data with exponential lifetime distributions

Srikanth K. Iyer S. Rao Jammalamadaka Debasis Kundu 《Journal of statistical planning and inference》2008

Recently Jammalamadaka and Mangalam [2003. Non-parametric estimation for middle censored data. J. Nonparametric Statist. 15, 253–265] introduced a general censoring scheme called the “middle-censoring” scheme in non-parametric set up. In this paper we consider this middle-censoring scheme when the lifetime distribution of the items is exponentially distributed and the censoring mechanism is independent and non-informative. In this set up, we derive the maximum likelihood estimator and study its consistency and asymptotic normality properties. We also derive the Bayes estimate of the exponential parameter under a gamma prior. Since a theoretical construction of the credible interval becomes quite difficult, we propose and implement Gibbs sampling technique to construct the credible intervals. Monte Carlo simulations are performed to evaluate the small sample behavior of the techniques proposed. A real data set is analyzed to illustrate the practical application of the proposed methods. 相似文献

19.

Bayesian optimal design for changepoint problems

Juli Atherton Benoit Charbonneau David B. Wolfson Lawrence Joseph Xiaojie Zhou Alain C. Vandal 《Revue canadienne de statistique》2009,37(4):495-513

We investigate Bayesian optimal designs for changepoint problems. We find robust optimal designs which allow for arbitrary distributions before and after the change, arbitrary prior densities on the parameters before and after the change, and any log‐concave prior density on the changepoint. We define a new design measure for Bayesian optimal design problems as a means of finding the optimal design. Our results apply to any design criterion function concave in the design measure. We illustrate our results by finding the optimal design in a problem motivated by a previous clinical trial. The Canadian Journal of Statistics 37: 495–513; 2009 © 2009 Statistical Society of Canada 相似文献

20.

Quickly variable selection for varying coefficient models with missing response at random

Jian Wu Liugen Xue Peixin Zhao 《统计学通讯:理论与方法》2018,47(10):2327-2336

This paper focuses on the variable selections for a varying coefficient models with missing response at random. A procedure is presented by basis function approximations with smooth-threshold estimating equations. Furthermore, the proposed method selects significant variables and estimates coefficients simultaneously avoiding the problem of solving a convex optimization, which reduced the burden of computation. Compared to existing equation based approaches, our procedure is more efficient and quick. With proper choices the regularization parameter, the resulting estimates perform an oracle property. A cross-validation for tuning parameter selection is also proposed, a numerical study confirms the performance of the proposed method. 相似文献