首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
ABSTRACT

In this paper we propose a class of skewed t link models for analyzing binary response data with covariates. It is a class of asymmetric link models designed to improve the overall fit when commonly used symmetric links, such as the logit and probit links, do not provide the best fit available for a given binary response dataset. Introducing a skewed t distribution for the underlying latent variable, we develop the class of models. For the analysis of the models, a Bayesian and non-Bayesian methods are pursued using a Markov chain Monte Carlo (MCMC) sampling based approach. Necessary theories involved in modelling and computation are provided. Finally, a simulation study and a real data example are used to illustrate the proposed methodology.  相似文献   

2.
ABSTRACT

We introduce a semi-parametric Bayesian approach based on skewed Dirichlet processes priors for location parameters in the ordinal calibration problem. This approach allows the modeling of asymmetrical error distributions. Conditional posterior distributions are implemented, thus allowing the use of Markov chains Monte Carlo to generate the posterior distributions. The methodology is applied to both simulated and real data.  相似文献   

3.
ABSTRACT

We propose an extension of parametric product partition models. We name our proposal nonparametric product partition models because we associate a random measure instead of a parametric kernel to each set within a random partition. Our methodology does not impose any specific form on the marginal distribution of the observations, allowing us to detect shifts of behaviour even when dealing with heavy-tailed or skewed distributions. We propose a suitable loss function and find the partition of the data having minimum expected loss. We then apply our nonparametric procedure to multiple change-point analysis and compare it with PPMs and with other methodologies that have recently appeared in the literature. Also, in the context of missing data, we exploit the product partition structure in order to estimate the distribution function of each missing value, allowing us to detect change points using the loss function mentioned above. Finally, we present applications to financial as well as genetic data.  相似文献   

4.
In this paper, we study the statistical inference based on the Bayesian approach for regression models with the assumption that independent additive errors follow normal, Student-t, slash, contaminated normal, Laplace or symmetric hyperbolic distribution, where both location and dispersion parameters of the response variable distribution include nonparametric additive components approximated by B-splines. This class of models provides a rich set of symmetric distributions for the model error. Some of these distributions have heavier or lighter tails than the normal as well as different levels of kurtosis. In order to draw samples of the posterior distribution of the interest parameters, we propose an efficient Markov Chain Monte Carlo (MCMC) algorithm, which combines Gibbs sampler and Metropolis–Hastings algorithms. The performance of the proposed MCMC algorithm is assessed through simulation experiments. We apply the proposed methodology to a real data set. The proposed methodology is implemented in the R package BayesGESM using the function gesm().  相似文献   

5.
Abstract

Handling data with the nonignorably missing mechanism is still a challenging problem in statistics. In this paper, we develop a fully Bayesian adaptive Lasso approach for quantile regression models with nonignorably missing response data, where the nonignorable missingness mechanism is specified by a logistic regression model. The proposed method extends the Bayesian Lasso by allowing different penalization parameters for different regression coefficients. Furthermore, a hybrid algorithm that combined the Gibbs sampler and Metropolis-Hastings algorithm is implemented to simulate the parameters from posterior distributions, mainly including regression coefficients, shrinkage coefficients, parameters in the non-ignorable missing models. Finally, some simulation studies and a real example are used to illustrate the proposed methodology.  相似文献   

6.
Abstract

In this paper, we present a flexible mechanism for constructing probability distributions on a bounded intervals which is based on the composition of the baseline cumulative probability function and the quantile transformation from another cumulative probability distribution. In particular, we are interested in the (0, 1) intervals. The composite quantile family of probability distributions contains many models that have been proposed in the recent literature and new probability distributions are introduced on the unit interval. The proposed methodology is illustrated with two examples to analyze a poverty dataset in Peru from the Bayesian paradigm and Likelihood points of view.  相似文献   

7.
In this paper, we consider a constructive representation of skewed distributions, which proposed by Ferreira and Steel (J Am Stat Assoc 101:823–829, 2006), and its basic properties is presented. We study the five versions of skew- normal distributions in this general setting. An appropriate empirical model for a skewed distribution is introduced. In data analysis, we compare this empirical model with the other four versions of skew-normal distributions, via some reasonable criteria. It is shown that the proposed empirical model has a better fit for density estimation.  相似文献   

8.
Recently, Gupta and Gupta [Analyzing skewed data by power-normal model, Test 17 (2008), pp. 197–210] proposed the power-normal distribution for which normal distribution is a special case. The power-normal distribution is a skewed distribution, whose support is the whole real line. Our main aim of this paper is to consider bivariate power-normal distribution, whose marginals are power-normal distributions. We obtain the proposed bivariate power-normal distribution from Clayton copula, and by making a suitable transformation in both the marginals. Lindley–Singpurwalla distribution also can be used to obtain the same distribution. Different properties of this new distribution have been investigated in detail. Two different estimators are proposed. One data analysis has been performed for illustrative purposes. Finally, we propose some generalizations to multivariate case also along the same line and discuss some of its properties.  相似文献   

9.
Cluster analysis is the automated search for groups of homogeneous observations in a data set. A popular modeling approach for clustering is based on finite normal mixture models, which assume that each cluster is modeled as a multivariate normal distribution. However, the normality assumption that each component is symmetric is often unrealistic. Furthermore, normal mixture models are not robust against outliers; they often require extra components for modeling outliers and/or give a poor representation of the data. To address these issues, we propose a new class of distributions, multivariate t distributions with the Box-Cox transformation, for mixture modeling. This class of distributions generalizes the normal distribution with the more heavy-tailed t distribution, and introduces skewness via the Box-Cox transformation. As a result, this provides a unified framework to simultaneously handle outlier identification and data transformation, two interrelated issues. We describe an Expectation-Maximization algorithm for parameter estimation along with transformation selection. We demonstrate the proposed methodology with three real data sets and simulation studies. Compared with a wealth of approaches including the skew-t mixture model, the proposed t mixture model with the Box-Cox transformation performs favorably in terms of accuracy in the assignment of observations, robustness against model misspecification, and selection of the number of components.  相似文献   

10.
In the analysis of correlated ordered data, mixed-effect models are frequently used to control the subject heterogeneity effects. A common assumption in fitting these models is the normality of random effects. In many cases, this is unrealistic, making the estimation results unreliable. This paper considers several flexible models for random effects and investigates their properties in the model fitting. We adopt a proportional odds logistic regression model and incorporate the skewed version of the normal, Student's t and slash distributions for the effects. Stochastic representations for various flexible distributions are proposed afterwards based on the mixing strategy approach. This reduces the computational burden being performed by the McMC technique. Furthermore, this paper addresses the identifiability restrictions and suggests a procedure to handle this issue. We analyze a real data set taken from an ophthalmic clinical trial. Model selection is performed by suitable Bayesian model selection criteria.  相似文献   

11.
Linear mixed models based on the normality assumption are widely used in health related studies. Although the normality assumption leads to simple, mathematically tractable, and powerful tests, violation of the assumption may easily invalidate the statistical inference. Transformation of variables is sometimes used to make normality approximately true. In this paper we consider another approach by replacing the normal distributions in linear mixed models by skew-t distributions, which account for skewness and heavy tails for both the random effects and the errors. The full likelihood-based estimator is often difficult to use, but a 3-step estimation procedure is proposed, followed by an application to the analysis of deglutition apnea duration in normal swallows. The example shows that skew-t models often entail more reliable inference than Gaussian models for the skewed data.  相似文献   

12.
A robust regression methodology is proposed via M-estimation. The approach adapts to the tail behavior and skewness of the distribution of the random error terms, providing for a reliable analysis under a broad class of distributions. This is accomplished by allowing the objective function, used to determine the regression parameter estimates, to be selected in a data driven manner. The asymptotic properties of the proposed estimator are established and a numerical algorithm is provided to implement the methodology. The finite sample performance of the proposed approach is exhibited through simulation and the approach was used to analyze two motivating datasets.  相似文献   

13.
In this paper, we discuss the extension of some diagnostic procedures to multivariate measurement error models with scale mixtures of skew-normal distributions (Lachos et?al., Statistics 44:541?C556, 2010c). This class provides a useful generalization of normal (and skew-normal) measurement error models since the random term distributions cover symmetric, asymmetric and heavy-tailed distributions, such as skew-t, skew-slash and skew-contaminated normal, among others. Inspired by the EM algorithm proposed by Lachos et?al. (Statistics 44:541?C556, 2010c), we develop a local influence analysis for measurement error models, following Zhu and Lee??s (J R Stat Soc B 63:111?C126, 2001) approach. This is because the observed data log-likelihood function associated with the proposed model is somewhat complex and Cook??s well-known approach can be very difficult to apply to achieve local influence measures. Some useful perturbation schemes are also discussed. In addition, a score test for assessing the homogeneity of the skewness parameter vector is presented. Finally, the methodology is exemplified through a real data set, illustrating the usefulness of the proposed methodology.  相似文献   

14.
Meta-analysis refers to a quantitative method for combining results from independent studies in order to draw overall conclusions. We consider hierarchical models including selection models under a skewed heavy tailed error distribution proposed originally by Chen, Dey, and Shao [M. H. Chen, D. K. Dey, Q. M. Shao, A new skewed link model for dichotomous quantal response data, J. Amer. Statist. Assoc. 94 (1983), pp. 1172–1186.] and Branco and Dey [D. Branco and D.K. Dey, A general class of multivariate skew-elliptical distributions, J. Multivariate Anal. 79, pp. 99–113.]. These rich classes of models combine the information of independent studies, allowing investigation of variability both between and within studies and incorporating weight functions. We constructed a detailed computational scheme under skewed normal and skewed Student's t distribution using the MCMC method. Bayesian model selection was conducted by Bayes factor under a different skewed error. Finally, we illustrated our methodology using a real data example taken from Johnson [M.F. Johnson, Comparative efficacy of Naf and SMFP dentifrices in caries prevention: a meta-analysis overview, J Eur. Organ. Caries Res. 27 (1993), pp. 328–336.].  相似文献   

15.
Multiple imputation has emerged as a popular approach to handling data sets with missing values. For incomplete continuous variables, imputations are usually produced using multivariate normal models. However, this approach might be problematic for variables with a strong non-normal shape, as it would generate imputations incoherent with actual distributions and thus lead to incorrect inferences. For non-normal data, we consider a multivariate extension of Tukey's gh distribution/transformation [38] to accommodate skewness and/or kurtosis and capture the correlation among the variables. We propose an algorithm to fit the incomplete data with the model and generate imputations. We apply the method to a national data set for hospital performance on several standard quality measures, which are highly skewed to the left and substantially correlated with each other. We use Monte Carlo studies to assess the performance of the proposed approach. We discuss possible generalizations and give some advices to practitioners on how to handle non-normal incomplete data.  相似文献   

16.
We develop a general approach to estimation and inference for income distributions using grouped or aggregate data that are typically available in the form of population shares and class mean incomes, with unknown group bounds. We derive generic moment conditions and an optimal weight matrix that can be used for generalized method-of-moments (GMM) estimation of any parametric income distribution. Our derivation of the weight matrix and its inverse allows us to express the seemingly complex GMM objective function in a relatively simple form that facilitates estimation. We show that our proposed approach, which incorporates information on class means as well as population proportions, is more efficient than maximum likelihood estimation of the multinomial distribution, which uses only population proportions. In contrast to the earlier work of Chotikapanich, Griffiths, and Rao, and Chotikapanich, Griffiths, Rao, and Valencia, which did not specify a formal GMM framework, did not provide methodology for obtaining standard errors, and restricted the analysis to the beta-2 distribution, we provide standard errors for estimated parameters and relevant functions of them, such as inequality and poverty measures, and we provide methodology for all distributions. A test statistic for testing the adequacy of a distribution is proposed. Using eight countries/regions for the year 2005, we show how the methodology can be applied to estimate the parameters of the generalized beta distribution of the second kind (GB2), and its special-case distributions, the beta-2, Singh–Maddala, Dagum, generalized gamma, and lognormal distributions. We test the adequacy of each distribution and compare predicted and actual income shares, where the number of groups used for prediction can differ from the number used in estimation. Estimates and standard errors for inequality and poverty measures are provided. Supplementary materials for this article are available online.  相似文献   

17.
We propose a three-parameter distribution referred to as the reflected- shifted-truncated gamma (RSTG) distribution to model negatively skewed data. Various properties of the proposed distribution are derived. The estimation of the model parameters is approached by maximum likelihood methods and the observed information matrix is derived. Monte Carlo simulations are performed to compare the performances of the proposed methods of estimation for both small and large samples. Using information theoretic criteria, we compare the RSTG distribution to the exponential, generalized F, generalized gamma, Gompertz, log-logistic, lognormal, Rayleigh, and Weibull distributions in three negatively skewed real datasets.  相似文献   

18.
Partially linear models (PLMs) are an important tool in modelling economic and biometric data and are considered as a flexible generalization of the linear model by including a nonparametric component of some covariate into the linear predictor. Usually, the error component is assumed to follow a normal distribution. However, the theory and application (through simulation or experimentation) often generate a great amount of data sets that are skewed. The objective of this paper is to extend the PLMs allowing the errors to follow a skew-normal distribution [A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Statist. 12 (1985), pp. 171–178], increasing the flexibility of the model. In particular, we develop the expectation-maximization (EM) algorithm for linear regression models and diagnostic analysis via local influence as well as generalized leverage, following [H. Zhu and S. Lee, Local influence for incomplete-data models, J. R. Stat. Soc. Ser. B 63 (2001), pp. 111–126]. A simulation study is also conducted to evaluate the efficiency of the EM algorithm. Finally, a suitable transformation is applied in a data set on ragweed pollen concentration in order to fit PLMs under asymmetric distributions. An illustrative comparison is performed between normal and skew-normal errors.  相似文献   

19.
Abstract

The class of transmuted distributions has received a lot of attention in the recent statistical literature. In this paper, we propose a rich family of bivariate distribution whose conditionals are transmuted distributions. The new family of distributions depends on the two baseline distributions and three dependence parameters. Apart from the general properties, we also study the distribution of the concomitance of order statistics. We study specific bivariate models. Estimation methodologies are proposed. A simulation study is conducted. The usefulness of this family is established by fitting well analyzed real life time data.  相似文献   

20.
ABSTRACT

In this paper, we propose a new probability model called the log-EIG distribution for lifetime data analysis. Some important properties of the proposed model and maximum likelihood estimation of its parameters are discussed. Its relationship with the exponential inverse Gaussian distribution is similar to that of the lognormal and the normal distributions. Through applications to well-known datasets, we show that the log-EIG distribution competes well, and in some instances even provides a better fit than the commonly used lifetime models such as the gamma, lognormal, Weibull and inverse Gaussian distributions. It can accommodate situations where an increasing failure rate model is required as well as those with a decreasing failure rate at larger times.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号