期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Sensitivity analysis for the identifiability with application to latent random effect model for the mixed data

E. Bahrami Samani 《Journal of applied statistics》2014,41(12):2761-2776

In this paper, we study the indentifiability of a latent random effect model for the mixed correlated continuous and ordinal longitudinal responses. We derive conditions for the identifiability of the covariance parameters of the responses. Also, we proposed sensitivity analysis to investigate the perturbation from the non-identifiability of the covariance parameters, it is shown how one can use some elements of covariance structure. These elements associate conditions for identifiability of the covariance parameters of the responses. Influence of small perturbation of these elements on maximal normal curvature is also studied. The model is illustrated using medical data. 相似文献

2.

General location multivariate latent variable models for mixed correlated bounded continuous,ordinal, and nominal responses with non-ignorable missing data

Elham Tabrizi Ehsan Bahrami Samani Mojtaba Ganjali 《Journal of applied statistics》2021,48(5):765

Using a multivariate latent variable approach, this article proposes some new general models to analyze the correlated bounded continuous and categorical (nominal or/and ordinal) responses with and without non-ignorable missing values. First, we discuss regression methods for jointly analyzing continuous, nominal, and ordinal responses that we motivated by analyzing data from studies of toxicity development. Second, using the beta and Dirichlet distributions, we extend the models so that some bounded continuous responses are replaced for continuous responses. The joint distribution of the bounded continuous, nominal and ordinal variables is decomposed into a marginal multinomial distribution for the nominal variable and a conditional multivariate joint distribution for the bounded continuous and ordinal variables given the nominal variable. We estimate the regression parameters under the new general location models using the maximum-likelihood method. Sensitivity analysis is also performed to study the influence of small perturbations of the parameters of the missing mechanisms of the model on the maximal normal curvature. The proposed models are applied to two data sets: BMI, Steatosis and Osteoporosis data and Tehran household expenditure budgets. 相似文献

3.

Parameter estimation approaches to tackling measurement error and multicollinearity in ordinal probit models

Jing Guan Yunfeng Zhao 《统计学通讯:理论与方法》2020,49(16):3835-3859

Abstract

The regression model with ordinal outcome has been widely used in a lot of fields because of its significant effect. Moreover, predictors measured with error and multicollinearity are long-standing problems and often occur in regression analysis. However there are not many studies on dealing with measurement error models with generally ordinal response, even fewer when they suffer from multicollinearity. The purpose of this article is to estimate parameters of ordinal probit models with measurement error and multicollinearity. First, we propose to use regression calibration and refined regression calibration to estimate parameters in ordinal probit models with measurement error. Second, we develop new methods to obtain estimators of parameters in the presence of multicollinearity and measurement error in ordinal probit model. Furthermore we also extend all the methods to quadratic ordinal probit models and talk about the situation in ordinal logistic models. These estimators are consistent and asymptotically normally distributed under general conditions. They are easy to compute, perform well and are robust against the normality assumption for the predictor variables in our simulation studies. The proposed methods are applied to some real datasets. 相似文献

4.

Concurrent generation of multivariate mixed data with variables of dissimilar types

《Journal of Statistical Computation and Simulation》2012,82(18):3595-3607

ABSTRACT

Data sets originating from wide range of research studies are composed of multiple variables that are correlated and of dissimilar types, primarily of count, binary/ordinal and continuous attributes. The present paper builds on the previous works on multivariate data generation and develops a framework for generating multivariate mixed data with a pre-specified correlation matrix. The generated data consist of components that are marginally count, binary, ordinal and continuous, where the count and continuous variables follow the generalized Poisson and normal distributions, respectively. The use of the generalized Poisson distribution provides a flexible mechanism which allows under- and over-dispersed count variables generally encountered in practice. A step-by-step algorithm is provided and its performance is evaluated using simulated and real-data scenarios. 相似文献

5.

Instrumental variable estimation in ordinal probit models with mismeasured predictors

Jing Guan Hongjian Cheng Kenneth A. Bollen D. Roland Thomas Liqun Wang 《Revue canadienne de statistique》2019,47(4):653-667

Researchers in the medical, health, and social sciences routinely encounter ordinal variables such as self‐reports of health or happiness. When modelling ordinal outcome variables, it is common to have covariates, for example, attitudes, family income, retrospective variables, measured with error. As is well known, ignoring even random error in covariates can bias coefficients and hence prejudice the estimates of effects. We propose an instrumental variable approach to the estimation of a probit model with an ordinal response and mismeasured predictor variables. We obtain likelihood‐based and method of moments estimators that are consistent and asymptotically normally distributed under general conditions. These estimators are easy to compute, perform well and are robust against the normality assumption for the measurement errors in our simulation studies. The proposed method is applied to both simulated and real data. The Canadian Journal of Statistics 47: 653–667; 2019 © 2019 Statistical Society of Canada 相似文献

6.

Latent variable models with ordinal categorical covariates

Wai-Yin Poon Hai-Bin Wang 《Statistics and Computing》2012,22(5):1135-1154

We propose a general latent variable model for multivariate ordinal categorical variables, in which both the responses and the covariates are ordinal, to assess the effect of the covariates on the responses and to model the covariance structure of the response variables. A?fully Bayesian approach is employed to analyze the model. The Gibbs sampler is used to simulate the joint posterior distribution of the latent variables and the parameters, and the parameter expansion and reparameterization techniques are used to speed up the convergence procedure. The proposed model and method are demonstrated by simulation studies and a real data example. 相似文献

7.

A note on sequential ML estimates and their asymptotic covariances

U. Küsters 《Statistical Papers》1990,31(1):131-145

A marginal and sequential maximum likelihood estimation method is described which can be used instead of full information maximum likelihood estimation if the latter method is unfeasible. It is shown that the sequential procedure yields strongly consistent and asymptotically normal estimates under relatively general regularity conditions. It is shown that the covariance matrix of the sequential ML estimator does not coincide with the inverse of the Fisher information matrix. Hence, the corrected covariance matrix is derived. The application of the sequential procedure to the multivariate probit model with dichotomous, ordered categorical, single-sided censored and double-sided censored endogenous variables is included. This research was partially supported by a dissertation grant of theStudienstiftung des Deutschen Volkes. Comments and suggestions on earlier drafts by Gerhard Arminger, Giorgio Calzolari, Bernd Kortzen and an anonymous referee are gratefully acknowledged. 相似文献

8.

A mixture latent variable model for modeling mixed data in heterogeneous populations and its applications

Leila Amiri Mojtaba Khazaei Mojtaba Ganjali 《AStA Advances in Statistical Analysis》2018,102(1):95-115

Latent variable models are widely used for jointly modeling of mixed data including nominal, ordinal, count and continuous data. In this paper, we consider a latent variable model for jointly modeling relationships between mixed binary, count and continuous variables with some observed covariates. We assume that, given a latent variable, mixed variables of interest are independent and count and continuous variables have Poisson distribution and normal distribution, respectively. As such data may be extracted from different subpopulations, consideration of an unobserved heterogeneity has to be taken into account. A mixture distribution is considered (for the distribution of the latent variable) which accounts the heterogeneity. The generalized EM algorithm which uses the Newton–Raphson algorithm inside the EM algorithm is used to compute the maximum likelihood estimates of parameters. The standard errors of the maximum likelihood estimates are computed by using the supplemented EM algorithm. Analysis of the primary biliary cirrhosis data is presented as an application of the proposed model. 相似文献

9.

Two-Part Models for Analysis of Agatston Scores with Possible Proportionality Constraints

Cong Han Richard Kronmal 《统计学通讯:理论与方法》2013,42(1):99-111

ABSTRACT

Logit-linear and probit-linear two-part models can be used to analyze data that are a mixture of zeros and positive continuous responses. The slopes in the linear part of a model can be constrained to be proportional to the slopes in the logit or probit part. In this article, it is shown that implementing such a constraint will decrease (in Loewner ordering) the asymptotic covariance matrix of the maximum likelihood estimates. A case study is provided using coronary artery calcification data from the Multi-Ethnic Study of Atherosclerosis. 相似文献

10.

Accelerating Monte Carlo Markov chain convergence for cumulative-link generalized linear models

Mary Kathryn Cowles 《Statistics and Computing》1996,6(2):101-111

The ordinal probit, univariate or multivariate, is a generalized linear model (GLM) structure that arises frequently in such disparate areas of statistical applications as medicine and econometrics. Despite the straightforwardness of its implementation using the Gibbs sampler, the ordinal probit may present challenges in obtaining satisfactory convergence.We present a multivariate Hastings-within-Gibbs update step for generating latent data and bin boundary parameters jointly, instead of individually from their respective full conditionals. When the latent data are parameters of interest, this algorithm substantially improves Gibbs sampler convergence for large datasets. We also discuss Monte Carlo Markov chain (MCMC) implementation of cumulative logit (proportional odds) and cumulative complementary log-log (proportional hazards) models with latent data. 相似文献

11.

On marginal likelihood inference for the intra-class correlation coefficient

M. Safiul Haq V. Ming Ng 《统计学通讯:理论与方法》2013,42(2):179-189

A p-component set of responses have been constructed by a location-scale transformation to a p-component set of error variables, the covariance matrix of the set of error variables being of intra-class covariance structure:all variances being unity, and covariance being equal [IML0001]. A sample of size n has been described as a conditional structural model, conditional on the value of the intra-class correlation coefficient ρ. The conditional technique of structural inference provides the marginal likelihood function of ρ based on the standardized residuals. For the normal case, the marginal likelihood function of ρ is seen to be dependent on the standardized residuals through the sample intra-class correlation coefficient. By the likelihood modulation technique, the nonnull distribution of the sample intra-class correlation coefficient has also been obtained. 相似文献

12.

Parsimonious Estimation of the Covariance Matrix in Multinomial Probit Models

Edward Cripps Denzil G. Fiebig 《Econometric Reviews》2013,32(2):146-157

This article presents a Bayesian analysis of a multinomial probit model by building on previous work that specified priors on identified parameters. The main contribution of our article is to propose a prior on the covariance matrix of the latent utilities that permits elements of the inverse of the covariance matrix to be identically zero. This allows a parsimonious representation of the covariance matrix when such parsimony exists. The methodology is applied to both simulated and real data, and its ability to obtain more efficient estimators of the covariance matrix and regression coefficients is assessed using simulated data. 相似文献

13.

A Selection Procedure for the Number of Signals in Presence of Colored Noise

Pinyuen Chen Lifang Hsu 《统计学通讯:理论与方法》2013,42(10):1741-1757

Ranking and selection theory is used to estimate the number of signals present in colored noise. The data structure follows the well-known MUSIC (MUltiple SIgnal Classification) model. We deal with the eigenvalues of a covariance matrix, using the MUSIC model and colored noise. The data matrix can be written as the product of two matrices. The first matrix is the sample covariance matrix of the observed vectors. The second matrix is the inverse of the sample covariance matrix of reference vectors. We propose a multi-step selection procedure to construct a confidence interval on the number of signals present in a data set. Properties of this procedure will be stated and proved. Those properties will be used to compute the required parameters (procedure constants). Numerical examples are given to illustrate our theory. 相似文献

14.

Joint modeling of mixed skewed continuous and ordinal longitudinal responses: a Bayesian approach

M. Teimourian M. Ganjali D. Berridge 《Journal of applied statistics》2015,42(10):2233-2256

In this paper, a joint model for analyzing multivariate mixed ordinal and continuous responses, where continuous outcomes may be skew, is presented. For modeling the discrete ordinal responses, a continuous latent variable approach is considered and for describing continuous responses, a skew-normal mixed effects model is used. A Bayesian approach using Markov Chain Monte Carlo (MCMC) is adopted for parameter estimation. Some simulation studies are performed for illustration of the proposed approach. The results of the simulation studies show that the use of the separate models or the normal distributional assumption for shared random effects and within-subject errors of continuous and ordinal variables, instead of the joint modeling under a skew-normal distribution, leads to biased parameter estimates. The approach is used for analyzing a part of the British Household Panel Survey (BHPS) data set. Annual income and life satisfaction are considered as the continuous and the ordinal longitudinal responses, respectively. The annual income variable is severely skewed, therefore, the use of the normality assumption for the continuous response does not yield acceptable results. The results of data analysis show that gender, marital status, educational levels and the amount of money spent on leisure have a significant effect on annual income, while marital status has the highest impact on life satisfaction. 相似文献

15.

Data reconciliation of nonnormal observations with nonlinear constraints

Oliver Cencic Rudolf Frühwirth 《Journal of applied statistics》2018,45(13):2411-2428

This paper presents a new method for the reconciliation of data described by arbitrary continuous probability distributions, with the focus on nonlinear constraints. The main idea, already applied to linear constraints in a previous paper, is to restrict the joint prior probability distribution of the observed variables with model constraints to get a joint posterior probability distribution. Because in general the posterior probability density function cannot be calculated analytically, it is shown that it has decisive advantages to sample from the posterior distribution by a Markov chain Monte Carlo (MCMC) method. From the resulting sample of observed and unobserved variables various characteristics of the posterior distribution can be estimated, such as the mean, the full covariance matrix, marginal posterior densities, as well as marginal moments, quantiles, and HPD intervals. The procedure is illustrated by examples from material flow analysis and chemical engineering. 相似文献

16.

Investigating the sensitivity of Gaussian processes to the choice of their correlation function and prior specifications

《Journal of Statistical Computation and Simulation》2012,82(8):681-699

A Gaussian process (GP) can be thought of as an infinite collection of random variables with the property that any subset, say of dimension n, of these variables have a multivariate normal distribution of dimension n, mean vector β and covariance matrix Σ [O'Hagan, A., 1994, Kendall's Advanced Theory of Statistics, Vol. 2B, Bayesian Inference (John Wiley & Sons, Inc.)]. The elements of the covariance matrix are routinely specified through the multiplication of a common variance by a correlation function. It is important to use a correlation function that provides a valid covariance matrix (positive definite). Further, it is well known that the smoothness of a GP is directly related to the specification of its correlation function. Also, from a Bayesian point of view, a prior distribution must be assigned to the unknowns of the model. Therefore, when using a GP to model a phenomenon, the researcher faces two challenges: the need of specifying a correlation function and a prior distribution for its parameters. In the literature there are many classes of correlation functions which provide a valid covariance structure. Also, there are many suggestions of prior distributions to be used for the parameters involved in these functions. We aim to investigate how sensitive the GPs are to the (sometimes arbitrary) choices of their correlation functions. For this, we have simulated 25 sets of data each of size 64 over the square [0, 5]×[0, 5] with a specific correlation function and fixed values of the GP's parameters. We then fit different correlation structures to these data, with different prior specifications and check the performance of the adjusted models using different model comparison criteria. 相似文献

17.

Latent variable model for mixed correlated power series and ordinal longitudinal responses with non ignorable missing values

F. Razie M. Ganjali 《统计学通讯:理论与方法》2017,46(12):5738-5753

We propose a joint model based on a latent variable for analyzing mixed power series and ordinal longitudinal data with and without missing values. A bivariate probit regression model is used for the missing mechanisms. Random effects are used to take into account the correlation between longitudinal responses. A full likelihood-based approach is used to yield maximum-likelihood estimates of the model parameters. Our model is applied to a medical data set, obtained from an observational study on women where the correlated responses are the ordinal response of osteoporosis of the spine and the power series response of the number of joint damages. Sensitivity analysis is also performed to study the influence of small perturbations of the parameters of the missing mechanisms and overdispersion of the model on likelihood displacement. 相似文献

18.

Using a mixture model for multiple imputation in the presence of outliers: the 'Healthy for life' project

Michael R. Elliott Nicolas Stettler 《Journal of the Royal Statistical Society. Series C, Applied statistics》2007,56(1):63-78

Summary. We consider the problem of obtaining population-based inference in the presence of missing data and outliers in the context of estimating the prevalence of obesity and body mass index measures from the 'Healthy for life' study. Identifying multiple outliers in a multivariate setting is problematic because of problems such as masking, in which groups of outliers inflate the covariance matrix in a fashion that prevents their identification when included, and swamping, in which outliers skew covariances in a fashion that makes non-outlying observations appear to be outliers. We develop a latent class model that assumes that each observation belongs to one of K unobserved latent classes, with each latent class having a distinct covariance matrix. We consider the latent class covariance matrix with the largest determinant to form an 'outlier class'. By separating the covariance matrix for the outliers from the covariance matrices for the remainder of the data, we avoid the problems of masking and swamping. As did Ghosh-Dastidar and Schafer, we use a multiple-imputation approach, which allows us simultaneously to conduct inference after removing cases that appear to be outliers and to promulgate uncertainty in the outlier status through the model inference. We extend the work of Ghosh-Dastidar and Schafer by embedding the outlier class in a larger mixture model, consider penalized likelihood and posterior predictive distributions to assess model choice and model fit, and develop the model in a fashion to account for the complex sample design. We also consider the repeated sampling properties of the multiple imputation removal of outliers. 相似文献

19.

A Non-Gaussian Spatial Process Model for Opacity of Flocculated Paper

Patrick E. Brown Peter J. Diggle Robin Henderson 《Scandinavian Journal of Statistics》2003,30(2):355-368

ABSTRACT. Product quality in the paper-making industry can be assessed by opacity of a linear trace through continuous production sheets, summarized in spectral form. We adopt a class of non-Gaussian stochastic models for continuous spatial variation to describe data of this type. The model has flexible covariance structure, physically interpretable parameters and allows several scales of variation for the underlying process. We derive the spectral properties of the model, and develop methods of parameter estimation based on maximum likelihood in the frequency domain. The methods are illustrated using sample data from a UK mill. 相似文献

20.

A distance-based rounding strategy for post-imputation ordinal data

Hakan Demirtas 《Journal of applied statistics》2010,37(3):489-500

Multiple imputation has emerged as a widely used model-based approach in dealing with incomplete data in many application areas. Gaussian and log-linear imputation models are fairly straightforward to implement for continuous and discrete data, respectively. However, in missing data settings which include a mix of continuous and discrete variables, correct specification of the imputation model could be a daunting task owing to the lack of flexible models for the joint distribution of variables of different nature. This complication, along with accessibility to software packages that are capable of carrying out multiple imputation under the assumption of joint multivariate normality, appears to encourage applied researchers for pragmatically treating the discrete variables as continuous for imputation purposes, and subsequently rounding the imputed values to the nearest observed category. In this article, I introduce a distance-based rounding approach for ordinal variables in the presence of continuous ones. The first step of the proposed rounding process is predicated upon creating indicator variables that correspond to the ordinal levels, followed by jointly imputing all variables under the assumption of multivariate normality. The imputed values are then converted to the ordinal scale based on their Euclidean distances to a set of indicators, with minimal distance corresponding to the closest match. I compare the performance of this technique to crude rounding via commonly accepted accuracy and precision measures with simulated data sets. 相似文献