首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In this paper we introduce a flexible extension of the Gumbel distribution called the odd log-logistic exponentiated Gumbel distribution. The new model was implemented in GAMLSS package of R software and a brief tutorial on how to use this package is presented throughout the paper. We provide a comprehensive treatment of its general mathematical properties. Further, we propose a new extended regression model considering four regression structures. We discuss estimation methods based on censored and uncensored data. Two simulation studies are presented and four real data sets are applied to illustrating the usefulness of the new model.  相似文献   

2.
We introduce a robust clustering procedure for parsimonious model-based clustering. The classical mclust framework is robustified through impartial trimming and eigenvalue-ratio constraints (the tclust framework, which is robust but not affine invariant). An advantage of our resulting mtclust approach is that eigenvalue-ratio constraints are not needed for certain model formulations, leading to affine invariant robust parsimonious clustering. We illustrate the approach via simulations and a benchmark real data example. R code for the proposed method is available at https://github.com/afarcome/mtclust.  相似文献   

3.
This article describes a full Bayesian treatment for simultaneous fixed-effect selection and parameter estimation in high-dimensional generalized linear mixed models. The approach consists of using a Bayesian adaptive Lasso penalty for signal-level adaptive shrinkage and a fast Variational Bayes scheme for estimating the posterior mode of the coefficients. The proposed approach offers several advantages over the existing methods, for example, the adaptive shrinkage parameters are automatically incorporated, no Laplace approximation step is required to integrate out the random effects. The performance of our approach is illustrated on several simulated and real data examples. The algorithm is implemented in the R package glmmvb and is made available online.  相似文献   

4.
Bayesian hierarchical spatio-temporal models are becoming increasingly important due to the increasing availability of space-time data in various domains. In this paper we develop a user friendly R package, spTDyn, for spatio-temporal modelling. It can be used to fit models with spatially varying and temporally dynamic coefficients. The former is used for modelling the spatially varying impact of explanatory variables on the response caused by spatial misalignment. This issue can arise when the covariates only vary over time, or when they are measured over a grid and hence do not match the locations of the response point-level data. The latter is to examine the temporally varying impact of explanatory variables in space-time data due, for example, to seasonality or other time-varying effects. The spTDyn package uses Markov chain Monte Carlo sampling written in C, which makes computations highly efficient, and the interface is written in R making these sophisticated modelling techniques easily accessible to statistical analysts. The models and software, and their advantages, are illustrated using temperature and ozone space-time data.  相似文献   

5.
We study the properties of the called log-beta Weibull distribution defined by the logarithm of the beta Weibull random variable (Famoye et al. in J Stat Theory Appl 4:121–136, 2005; Lee et al. in J Mod Appl Stat Methods 6:173–186, 2007). An advantage of the new distribution is that it includes as special sub-models classical distributions reported in the lifetime literature. We obtain formal expressions for the moments, moment generating function, quantile function and mean deviations. We construct a regression model based on the new distribution to predict recurrence of prostate cancer for patients with clinically localized prostate cancer treated by open radical prostatectomy. It can be applied to censored data since it represents a parametric family of models that includes as special sub-models several widely-known regression models. The regression model was fitted to a data set of 1,324 eligible prostate cancer patients. We can predict recurrence free probability after the radical prostatectomy in terms of highly significant clinical and pathological explanatory variables associated with the recurrence of the disease. The predicted probabilities of remaining free of cancer progression are calculated under two nested models.  相似文献   

6.
In this paper, we are employing the generalized linear model (GLM) in the form 𝓁ij= to decompose the symmetry model into the class of models discussed in Tomizawa (1992 Tomizawa, S. 1992. Quasi-diagonals-parameter symmetry model for square contingency tables with ordered categories. Calcutta Statist. Assoc. Bull., 39: 5361.  [Google Scholar]). In this formulation, the random component would be the observed counts f ij with an underlying Poisson distribution. This approach utilizes the non-standard log-linear model and our focus in this paper therefore relates to models that are decompositions of the complete symmetry model. That is, models that are implied by the symmetry models. We develop factor and regression variables required for the implementation of these models in SAS PROC GENMOD and SPSS PROC GENLOG. We apply this methodology to analyse the three 4×4 contingency table, one of which is the Japanese Unaided distance vision data. Results obtained in this study are consistent with those from the numerous literature on the subject. We further extend our applications to the 6×6 Brazilian social mobility data. We found that both the quasi linear diagonal-parameters symmetry (QLDPS) and the quasi 2-ratios parameter symmetry (Q2RPS) models fit the Brazilian data very well. Parsimonious models being the QLDPS and the quasi-conditional symmetry (QCS) models. The SAS and SPSS programs for implementing the models discussed in this paper are presented in Appendices A, B and C.  相似文献   

7.
In this paper, we propose a new semiparametric heteroscedastic regression model allowing for positive and negative skewness and bimodal shapes using the B-spline basis for nonlinear effects. The proposed distribution is based on the generalized additive models for location, scale and shape framework in order to model any or all parameters of the distribution using parametric linear and/or nonparametric smooth functions of explanatory variables. We motivate the new model by means of Monte Carlo simulations, thus ignoring the skewness and bimodality of the random errors in semiparametric regression models, which may introduce biases on the parameter estimates and/or on the estimation of the associated variability measures. An iterative estimation process and some diagnostic methods are investigated. Applications to two real data sets are presented and the method is compared to the usual regression methods.  相似文献   

8.
ABSTRACT

We develop splice plots as a diagnostic tool for parametric generalized linear models. Splice plots use the independence of the outcome and explanatory measures given the regression function. Plotting differences between the estimated parametric regression function and non-parametric estimates of the regression function computed in small neighborhoods of the fitted values from the parametric model can be used to assess model fit.  相似文献   

9.
10.
The simulation-extrapolation (SIMEX) approach of Cook and Stefanski (J. Am. Stat. Assoc. 89:1314–1328, 1994) has proved to be successful in obtaining reliable estimates if variables are measured with (additive) errors. In particular for nonlinear models, this approach has advantages compared to other procedures such as the instrumental variable approach if only variables measured with error are available. However, it has always been assumed that measurement errors for the dependent variable are not correlated with those related to the explanatory variables although such scenario is quite likely. In such a case the (standard) SIMEX suffers from misspecification even for the simple linear regression model. Our paper reports first results from a generalized SIMEX (GSIMEX) approach which takes account of this correlation. We also demonstrate in our simulation study that neglect of the correlation will lead to estimates which may be worse than those from the naive estimator which completely disregards measurement errors.  相似文献   

11.
This paper describes the modelling and fitting of Gaussian Markov random field spatial components within a Generalized AdditiveModel for Location, Scale and Shape (GAMLSS) model. This allows modelling of any or all the parameters of the distribution for the response variable using explanatory variables and spatial effects. The response variable distribution is allowed to be a non-exponential family distribution. A new package developed in R to achieve this is presented. We use Gaussian Markov random fields to model the spatial effect in Munich rent data and explore some features and characteristics of the data. The potential of using spatial analysis within GAMLSS is discussed. We argue that the flexibility of parametric distributions, ability to model all the parameters of the distribution and diagnostic tools of GAMLSS provide an ideal environment for modelling spatial features of data.  相似文献   

12.
A new modeling approach called ‘recursive segmentation’ is proposed to support the supervised exploration and identification of subgroups or clusters. It is based on the frameworks of recursive partitioning and the Patient Rule Induction Method (PRIM). Through combining these methods, recursive segmentation aims to exploit their respective strengths while reducing their weaknesses. Consequently, recursive segmentation can be applied in a very general way, that is in any (multivariate) regression, classification or survival (time-to-event) problem, using conditional inference, evolutionary learning or the CART algorithm, with predictor variables of any scale and with missing values. Furthermore, results of a synthetic example and a benchmark application study that comprises 26 data sets suggest that recursive segmentation achieves a competitive prediction accuracy and provides more accurate definitions of subgroups by models of less complexity as compared to recursive partitioning and PRIM. An application to the German Breast Cancer Study Group data demonstrates the improved interpretability and reliability of results produced by the new approach. The method is made publicly available through the R-package rseg (http://rseg.r-forge.r-project.org/).  相似文献   

13.
Abstract

Partially linear models attract much attention to investigate the association between predictors and the response variable when the dependency on some predictors may be nonlinear. However, the hypothesis test for significance of predictors is still challenging, especially when the number of predictors is larger than sample size. In this paper, we reconsider the test procedure of Zhong and Chen (2011 Zhong, P., and S. Chen. 2011. Tests for high-dimensional regression coefficients with factorial designs. Journal of the American Statistical Association 106 (493):26074. doi:10.1198/jasa.2011.tm10284.[Taylor & Francis Online], [Web of Science ®] [Google Scholar]) when regression models have nonlinear components, and propose a generalized U-statistic for testing the linear components of the high dimensional partially linear models. The asymptotic properties of test statistic are obtained under null and alternative hypotheses, where the effect of nonlinear components should be considered and thus is different from that in linear models. Through simulation studies, we demonstrate good finite-sample performance of the proposed test in comparison with the existing methods. The practical utility of our proposed method is illustrated by a real data example.  相似文献   

14.
We propose a Bayesian method to select groups of correlated explanatory variables in a linear regression framework. We do this by introducing in the prior distribution assigned to the regression coefficients a random matrix $G$ that encodes the group structure. The groups can thus be inferred by sampling from the posterior distribution of $G$ . We then give a graph-theoretic interpretation of this random matrix $G$ as the adjacency matrix of cliques. We discuss the extension of the groups from cliques to more general random graphs, so that the proposed approach can be viewed as a method to find networks of correlated covariates that are associated with the response.  相似文献   

15.
ABSTRACT

In this article, a new randomized response model has been proposed. The proposed model is found to be more efficient than the randomized response models studied by Singh (2010 Singh, S. (2010). Proposed optimal orthogonal new additive model (POONAM). Statistica. Anno LXX(1):7381. [Google Scholar]). The relative efficiency of the proposed model has been studied with respect to the Singh (2010 Singh, S. (2010). Proposed optimal orthogonal new additive model (POONAM). Statistica. Anno LXX(1):7381. [Google Scholar]) model. Numerical illustrations are also given in support of the present study.  相似文献   

16.
Abstract

In this paper, two bivariate models based on the proposed methods of Marshall and Olkin are introduced. In the first model, the new bivariate distribution is presented based on the proposed method of Marshall and Olkin (1967 Marshall, A. W., and I. Olkin. 1967. A multivariate exponential distribution. Journal of the American Statistical Association 62 (317):3044. doi: 10.2307/2282907.[Taylor & Francis Online], [Web of Science ®] [Google Scholar]) which has natural interpretations, and it can be applied in fatal shock models or in competing risks models. In the second model, the proposed method of Marshall and Olkin (1997 Marshall, A. W., and I. Olkin. 1997. A new method of adding a parameter to a family of distributions with application to the exponential and weibull families. Biometrika 84 (3):64152. doi: 10.1093/biomet/84.3.641.[Crossref], [Web of Science ®] [Google Scholar]) is generalized to bivariate case and a new bivariate distribution is introduced. We call these new distributions as the bivariate Gompertz (BGP) distribution and bivariate Gompertz-geometric (BGPG) distribution, respectively. Moreover, the BGP model can be obtained as a special case of the BGPG model. Then, we present various properties of the new bivariate models. In this regard, the joint and conditional density functions, the joint cumulative distribution function can be obtained in compact forms. Also, the aging properties and the bivariate hazard gradient are discussed. This model has five unknown parameters and the maximum likelihood estimators cannot be obtained in explicit form. We propose to use the EM algorithm to compute the maximum likelihood estimators of the unknown parameters, and it is computationally quite tractable. Also, Monte Carlo simulations are performed to investigate the effectiveness of the proposed algorithm. Finally, we analyze three real data sets for illustrative purposes.  相似文献   

17.
ABSTRACT

In this article, we study the local influence for the elliptical linear regression model under equality constraints. We first obtain the parameter estimators of this model using the penalized log-likelihood function and iterative techniques. Then we obtain the diagnostics under the perturbations of constant variance, responses, and explanatory variables in the spirit of Cook (1986 Cook, R.D. (1986). Assessment of local influence. J. Royal Stat. Soc. Ser. B 48(2):133169. [Google Scholar]). Finally, a numerical example on the data set of the salinity of water is given to illustrate the theoretical results.  相似文献   

18.
Application of the minimum distance (MD) estimation method to the linear regression model for estimating regression parameters is a difficult and time-consuming process due to the complexity of its distance function, and hence, it is computationally expensive. To deal with the computational cost, this paper proposes a fast algorithm which makes the best use of coordinate-wise minimization technique in order to obtain the MD estimator. R package (KoulMde) based on the proposed algorithm and written in Rcpp is available online.  相似文献   

19.
《统计学通讯:理论与方法》2012,41(13-14):2503-2511
Univariate partial least squares regression (PLS1) is a method of modeling relationships between a response variable and explanatory variables, especially when the explanatory variables are almost collinear. The purpose is to predict a future response observation, although in many applications there is an interest to understand the contributions of each explanatory variable. It is an algorithmic approach. In this article, we are going to use the algorithm presented by Helland (1988 Helland , I. S. ( 1988 ). On the structure of partial least squares regression . Commun. Statist. Simul. Computat. 17 : 581607 .[Taylor & Francis Online], [Web of Science ®] [Google Scholar]). The population PLS predictor is linked to a linear model including a Krylov design matrix and a two-step estimation procedure. For the first step, the maximum likelihood approach is applied to a specific multivariate linear model, generating tools for evaluating the information in the explanatory variables. It is shown that explicit maximum likelihood estimators of the dispersion matrix can be obtained where the dispersion matrix, besides representing the variation in the error, also includes the Krylov structured design matrix describing the mean.  相似文献   

20.
This paper revisits two bivariate Pareto models for fitting competing risks data. The first model is the Frank copula model, and the second one is a bivariate Pareto model introduced by Sankaran and Nair (1993 Sankaran, P. G., and N. U. Nair. 1993. A bivariate Pareto model and its applications to reliability. Naval Research Logistics 40 (7):10131020. doi:10.1002/1520-6750(199312)40:7%3c1013::AID-NAV3220400711%3e3.0.CO;2-7.[Crossref], [Web of Science ®] [Google Scholar]). We discuss the identifiability issues of these models and develop the maximum likelihood estimation procedures including their computational algorithms and model-diagnostic procedures. Simulations are conducted to examine the performance of the maximum likelihood estimation. Real data are analyzed for illustration.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号