期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Models for Dependent Extremes Using Stable Mixtures 总被引：1，自引：0，他引：1

ANNE-LAURE FOUGÈRES JOHN P. NOLAN HOLGER ROOTZÉN 《Scandinavian Journal of Statistics》2009,36(1):42-59

Abstract. This paper unifies and extends results on a class of multivariate extreme value (EV) models studied by Hougaard, Crowder and Tawn. In these models, both unconditional and conditional distributions are themselves EV distributions, and all lower-dimensional marginals and maxima belong to the class. One interpretation of the models is as size mixtures of EV distributions, where the mixing is by positive stable distributions. A second interpretation is as exponential-stable location mixtures (for Gumbel) or as power-stable scale mixtures (for non-Gumbel EV distributions). A third interpretation is through a peaks over thresholds model with a positive stable intensity. The mixing variables are used as a modelling tool and for better understanding and model checking. We study EV analogues of components of variance models, and new time series, spatial and continuous parameter models for extreme values. The results are applied to data from a pitting corrosion investigation. 相似文献

2.

Simultaneous inference in structured additive conditional copula regression models: a unifying Bayesian approach

Nadja Klein Thomas Kneib 《Statistics and Computing》2016,26(4):841-860

While most regression models focus on explaining distributional aspects of one single response variable alone, interest in modern statistical applications has recently shifted towards simultaneously studying multiple response variables as well as their dependence structure. A particularly useful tool for pursuing such an analysis are copula-based regression models since they enable the separation of the marginal response distributions and the dependence structure summarised in a specific copula model. However, so far copula-based regression models have mostly been relying on two-step approaches where the marginal distributions are determined first whereas the copula structure is studied in a second step after plugging in the estimated marginal distributions. Moreover, the parameters of the copula are mostly treated as a constant not related to covariates and most regression specifications for the marginals are restricted to purely linear predictors. We therefore propose simultaneous Bayesian inference for both the marginal distributions and the copula using computationally efficient Markov chain Monte Carlo simulation techniques. In addition, we replace the commonly used linear predictor by a generic structured additive predictor comprising for example nonlinear effects of continuous covariates, spatial effects or random effects and furthermore allow to make the copula parameters covariate-dependent. To facilitate Bayesian inference, we construct proposal densities for a Metropolis–Hastings algorithm relying on quadratic approximations to the full conditionals of regression coefficients avoiding manual tuning. The performance of the resulting Bayesian estimates is evaluated in simulations comparing our approach with penalised likelihood inference, studying the choice of a specific copula model based on the deviance information criterion, and comparing a simultaneous approach with a two-step procedure. Furthermore, the flexibility of Bayesian conditional copula regression models is illustrated in two applications on childhood undernutrition and macroecology. 相似文献

3.

State-space models for multivariate longitudinal data of mixed types

Bent Jrgensen Sren Lundbye-Christensen Peter Xue-Kun Song Li Sun 《Revue canadienne de statistique》1996,24(3):385-402

We propose a class of state-space models for multivariate longitudinal data where the components of the response vector may have different distributions. The approach is based on the class of Tweedie exponential dispersion models, which accommodates a wide variety of discrete, continuous and mixed data. The latent process is assumed to be a Markov process, and the observations are conditionally independent given the latent process, over time as well as over the components of the response vector. This provides a fully parametric alternative to the quasilikelihood approach of Liang and Zeger. We estimate the regression parameters for time-varying covariates entering either via the observation model or via the latent process, based on an estimating equation derived from the Kalman smoother. We also consider analysis of residuals from both the observation model and the latent process. 相似文献

4.

A model for extreme wind gusts

David Walshaw & Clive W. Anderson 《Journal of the Royal Statistical Society. Series C, Applied statistics》2000,49(4):499-508

Estimates of the largest wind gust that will occur at a given location over a specified period are required by civil engineers. Estimation is usually based on models which are derived from the limiting distributions of maxima of stationary time series and which are fitted to data on extreme gusts. In this paper we develop a model for maximum gusts which also incorporates data on hourly mean speeds through a distributional relationship between maxima and means. This joint model is closely linked to the physical processes which generate the most extreme values and thus provides a mechanism by which data on means can augment those on gusts. It is argued that this increases the credibility of extrapolation in estimates of long period return gusts. The model is shown to provide a good fit to data obtained at a location in northern England and is compared with a more traditional modelling approach, which also performs well for this site. 相似文献

5.

General location multivariate latent variable models for mixed correlated bounded continuous,ordinal, and nominal responses with non-ignorable missing data

Elham Tabrizi Ehsan Bahrami Samani Mojtaba Ganjali 《Journal of applied statistics》2021,48(5):765

Using a multivariate latent variable approach, this article proposes some new general models to analyze the correlated bounded continuous and categorical (nominal or/and ordinal) responses with and without non-ignorable missing values. First, we discuss regression methods for jointly analyzing continuous, nominal, and ordinal responses that we motivated by analyzing data from studies of toxicity development. Second, using the beta and Dirichlet distributions, we extend the models so that some bounded continuous responses are replaced for continuous responses. The joint distribution of the bounded continuous, nominal and ordinal variables is decomposed into a marginal multinomial distribution for the nominal variable and a conditional multivariate joint distribution for the bounded continuous and ordinal variables given the nominal variable. We estimate the regression parameters under the new general location models using the maximum-likelihood method. Sensitivity analysis is also performed to study the influence of small perturbations of the parameters of the missing mechanisms of the model on the maximal normal curvature. The proposed models are applied to two data sets: BMI, Steatosis and Osteoporosis data and Tehran household expenditure budgets. 相似文献

6.

Exponentiated Weibull regression for time-to-event data

Shahedul A. Khan 《Lifetime data analysis》2018,24(2):328-354

The Weibull, log-logistic and log-normal distributions are extensively used to model time-to-event data. The Weibull family accommodates only monotone hazard rates, whereas the log-logistic and log-normal are widely used to model unimodal hazard functions. The increasing availability of lifetime data with a wide range of characteristics motivate us to develop more flexible models that accommodate both monotone and nonmonotone hazard functions. One such model is the exponentiated Weibull distribution which not only accommodates monotone hazard functions but also allows for unimodal and bathtub shape hazard rates. This distribution has demonstrated considerable potential in univariate analysis of time-to-event data. However, the primary focus of many studies is rather on understanding the relationship between the time to the occurrence of an event and one or more covariates. This leads to a consideration of regression models that can be formulated in different ways in survival analysis. One such strategy involves formulating models for the accelerated failure time family of distributions. The most commonly used distributions serving this purpose are the Weibull, log-logistic and log-normal distributions. In this study, we show that the exponentiated Weibull distribution is closed under the accelerated failure time family. We then formulate a regression model based on the exponentiated Weibull distribution, and develop large sample theory for statistical inference. We also describe a Bayesian approach for inference. Two comparative studies based on real and simulated data sets reveal that the exponentiated Weibull regression can be valuable in adequately describing different types of time-to-event data. 相似文献

7.

Generalized symmetrical partial linear model

Julio Cezar Souza Vasconcelos Cristian Villegas 《Journal of applied statistics》2021,48(3):557

In this work, we propose a new model called generalized symmetrical partial linear model, based on the theory of generalized linear models and symmetrical distributions. In our model the response variable follows a symmetrical distribution such a normal, Student-t, power exponential, among others. Following the context of generalized linear models we consider replacing the traditional linear predictors by the more general predictors in whose case one covariate is related with the response variable in a non-parametric fashion, that we do not specified the parametric function. As an example, we could imagine a regression model in which the intercept term is believed to vary in time or geographical location. The backfitting algorithm is used for estimating the parameters of the proposed model. We perform a simulation study for assessing the behavior of the penalized maximum likelihood estimators. We use the quantile residuals for checking the assumption of the model. Finally, we analyzed real data set related with pH rivers in Ireland. 相似文献

8.

The Kumaraswamy distribution: median-dispersion re-parameterizations for regression modeling and simulation-based estimation

Pablo A. Mitnik Sunyoung Baek 《Statistical Papers》2013,54(1):177-192

The Kumaraswamy distribution is very similar to the Beta distribution, but has the important advantage of an invertible closed form cumulative distribution function. The parameterization of the distribution in terms of shape parameters and the lack of simple expressions for its mean and variance hinder, however, its utilization with modeling purposes. The paper presents two median-dispersion re-parameterizations of the Kumaraswamy distribution aimed at facilitating its use in regression models in which both the location and the dispersion parameters are functions of their own distinct sets of covariates, and in latent-variable and other models estimated through simulation-based methods. In both re-parameterizations the dispersion parameter establishes a quantile-spread order among Kumaraswamy distributions with the same median and support. The study also describes the behavior of the re-parameterized distributions, determines some of their limiting distributions, and discusses the potential comparative advantages of using them in the context of regression modeling and simulation-based estimation. 相似文献

9.

Likelihood-based discrimination between separate scale and regression models

《Journal of statistical planning and inference》2006,136(10):3539-3553

相似文献

10.

Integer autoregressive models with structural breaks

Akanksha S. Kashikar T.V. Ramanathan 《Journal of applied statistics》2013,40(12):2653-2669

Even though integer-valued time series are common in practice, the methods for their analysis have been developed only in recent past. Several models for stationary processes with discrete marginal distributions have been proposed in the literature. Such processes assume the parameters of the model to remain constant throughout the time period. However, this need not be true in practice. In this paper, we introduce non-stationary integer-valued autoregressive (INAR) models with structural breaks to model a situation, where the parameters of the INAR process do not remain constant over time. Such models are useful while modelling count data time series with structural breaks. The Bayesian and Markov Chain Monte Carlo (MCMC) procedures for the estimation of the parameters and break points of such models are discussed. We illustrate the model and estimation procedure with the help of a simulation study. The proposed model is applied to the two real biometrical data sets. 相似文献

11.

Prior Distributions for Item Parameters in IRT Models

《统计学通讯:理论与方法》2012,41(16-17):2944-2958

The focus of this article is on the choice of suitable prior distributions for item parameters within item response theory (IRT) models. In particular, the use of empirical prior distributions for item parameters is proposed. Firstly, regression trees are implemented in order to build informative empirical prior distributions. Secondly, model estimation is conducted within a fully Bayesian approach through the Gibbs sampler, which makes estimation feasible also with increasingly complex models. The main results show that item parameter recovery is improved with the introduction of empirical prior information about item parameters, also when only a small sample is available. 相似文献

12.

New regression model with four regression structures and computational aspects

Thiago G. Ramires Gauss M. Cordeiro Gilberto A. Paula Niel Hens 《统计学通讯:模拟与计算》2018,47(7):1940-1962

A new general class of exponentiated sinh Cauchy regression models for location, scale, and shape parameters is introduced and studied. It may be applied to censored data and used more effectively in survival analysis when compared with the usual models. For censored data, we employ a frequentist analysis for the parameters of the proposed model. Further, for different parameter settings, sample sizes, and censoring percentages, various simulations are performed. The extended regression model is very useful for the analysis of real data and could give more adequate fits than other special regression models. 相似文献

13.

Parametric simultaneous robust inferences for regression coefficient under generalized linear models

《Journal of Statistical Computation and Simulation》2012,82(4):850-867

In this article, the parametric robust regression approaches are proposed for making inferences about regression parameters in the setting of generalized linear models (GLMs). The proposed methods are able to test hypotheses on the regression coefficients in the misspecified GLMs. More specifically, it is demonstrated that with large samples, the normal and gamma regression models can be properly adjusted to become asymptotically valid for inferences about regression parameters under model misspecification. These adjusted regression models can provide the correct type I and II error probabilities and the correct coverage probability for continuous data, as long as the true underlying distributions have finite second moments. 相似文献

14.

Bayesian LASSO-Regularized quantile regression for linear regression models with autoregressive errors

Yuzhu Tian Silian Shen Ge Lu Manlai Tang Maozai Tian 《统计学通讯:模拟与计算》2019,48(3):777-796

Quantile regression (QR) is a natural alternative for depicting the impact of covariates on the conditional distributions of a outcome variable instead of the mean. In this paper, we investigate Bayesian regularized QR for the linear models with autoregressive errors. LASSO-penalized type priors are forced on regression coefficients and autoregressive parameters of the model. Gibbs sampler algorithm is employed to draw the full posterior distributions of unknown parameters. Finally, the proposed procedures are illustrated by some simulation studies and applied to a real data analysis of the electricity consumption. 相似文献

15.

A Bayesian cluster analysis of election results

Xavier Puig 《Journal of applied statistics》2014,41(1):73-94

A Bayesian cluster analysis for the results of an election based on multinomial mixture models is proposed. The number of clusters is chosen based on the careful comparison of the results with predictive simulations from the models, and by checking whether models capture most of the spatial dependence in the results. By implementing the analysis on five recent elections in Barcelona, the reader is walked through the choice of the best statistics and graphical displays to help chose a model and present the results. Even though the models do not use any information about the location of the areas in which the results are broken into, in the example they uncover a four-cluster structure with a strong spatial dependence, that is very stable over time and relates to the demographic composition. 相似文献

16.

A Bayesian estimation of lag lengths in distributed lag models

《Journal of Statistical Computation and Simulation》2012,82(2):415-427

Dynamic regression models are widely used because they express and model the behaviour of a system over time. In this article, two dynamic regression models, the distributed lag (DL) model and the autoregressive distributed lag model, are evaluated focusing on their lag lengths. From a classical statistics point of view, there are various methods to determine the number of lags, but none of them are the best in all situations. This is a serious issue since wrong choices will provide bad estimates for the effects of the regressors on the response variable. We present an alternative for the aforementioned problems by considering a Bayesian approach. The posterior distributions of the numbers of lags are derived under an improper prior for the model parameters. The fractional Bayes factor technique [A. O'Hagan, Fractional Bayes factors for model comparison (with discussion), J. R. Statist. Soc. B 57 (1995), pp. 99–138] is used to handle the indeterminacy in the likelihood function caused by the improper prior. The zero-one loss function is used to penalize wrong decisions. A naive method using the specified maximum number of DLs is also presented. The proposed and the naive methods are verified using simulation data. The results are promising for the method we proposed. An illustrative example with a real data set is provided. 相似文献

17.

Structural changes in multivariate regression models

David H. Moen Diego Salazar Lyle D. Bromeling 《统计学通讯:理论与方法》2013,42(8):1757-1768

This study generalizes the work of chin choy and Broemeling (1980) who investigated the change in the regression parameters of univariate linear models.

The marginal posterior distributions of the change point, the regression matrices,and the precision matrix are found with the use of a proper multivariate normal-Wishart distribution for the parameters of the model.

A numerical study is undertaken in order to gain some insight into the effect that changes in sample size and certain parameter values have on these marginal posterior distributions. 相似文献

18.

Nonstationary regression models with a lagged dependent variable

Tony S. Wirjanto Robert A. Amano 《统计学通讯:理论与方法》2013,42(7):1489-1503

This paper studies regression models with a lagged dependent variable when both the dependent and independent variables are nonstationary, and the regression model is misspecified in some dimension. In particular, we discuss the limiting properties of leastsquares estimates of the parameters in such regression models, and the limiting distributions of their test statistics. We show that the estimate of the lagged dependent variable tends to unity asymptotically independent of its true value, while the estimates of the independent variables tend to zero. The limiting distributions of their test statistics are shown to diverge with sample size. 相似文献

19.

Bayesian adaptive Lasso for quantile regression models with nonignorably missing response data

Dengke Xu Niansheng Tang 《统计学通讯:模拟与计算》2013,42(9):2727-2742

Abstract

Handling data with the nonignorably missing mechanism is still a challenging problem in statistics. In this paper, we develop a fully Bayesian adaptive Lasso approach for quantile regression models with nonignorably missing response data, where the nonignorable missingness mechanism is specified by a logistic regression model. The proposed method extends the Bayesian Lasso by allowing different penalization parameters for different regression coefficients. Furthermore, a hybrid algorithm that combined the Gibbs sampler and Metropolis-Hastings algorithm is implemented to simulate the parameters from posterior distributions, mainly including regression coefficients, shrinkage coefficients, parameters in the non-ignorable missing models. Finally, some simulation studies and a real example are used to illustrate the proposed methodology. 相似文献

20.

Nonlinear mixed-effects models with scale mixture of skew-normal distributions

Marcos Antonio Alves Pereira Cibele Maria Russo 《Journal of applied statistics》2019,46(9):1602-1620

Aiming to avoid the sensitivity in the parameters estimation due to atypical observations or skewness, we develop asymmetric nonlinear regression models with mixed-effects, which provide alternatives to the use of normal distribution and other symmetric distributions. Nonlinear models with mixed-effects are explored in several areas of knowledge, especially when data are correlated, such as longitudinal data, repeated measures and multilevel data, in particular, for their flexibility in dealing with measures of areas such as economics and pharmacokinetics. The random components of the present model are assumed to follow distributions that belong to scale mixtures of skew-normal (SMSN) distribution family, that encompasses distributions with light and heavy tails, such as skew-normal, skew-Student-t, skew-contaminated normal and skew-slash, as well as symmetrical versions of these distributions. For the parameters estimation we obtain a numerical solution via the EM algorithm and its extensions, and the Newton-Raphson algorithm. An application with pharmacokinetic data shows the superiority of the proposed models, for which the skew-contaminated normal distribution has shown to be the most adequate distribution. A brief simulation study points to good properties of the parameter vector estimators obtained by the maximum likelihood method. 相似文献