期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Dynamic latent trait models with mixed hidden Markov structure for mixed longitudinal outcomes

Yue Zhang Kiros Berhane 《Journal of applied statistics》2016,43(4):704-720

We propose a general Bayesian joint modeling approach to model mixed longitudinal outcomes from the exponential family for taking into account any differential misclassification that may exist among categorical outcomes. Under this framework, outcomes observed without measurement error are related to latent trait variables through generalized linear mixed effect models. The misclassified outcomes are related to the latent class variables, which represent unobserved real states, using mixed hidden Markov models (MHMMs). In addition to enabling the estimation of parameters in prevalence, transition and misclassification probabilities, MHMMs capture cluster level heterogeneity. A transition modeling structure allows the latent trait and latent class variables to depend on observed predictors at the same time period and also on latent trait and latent class variables at previous time periods for each individual. Simulation studies are conducted to make comparisons with traditional models in order to illustrate the gains from the proposed approach. The new approach is applied to data from the Southern California Children Health Study to jointly model questionnaire-based asthma state and multiple lung function measurements in order to gain better insight about the underlying biological mechanism that governs the inter-relationship between asthma state and lung function development. 相似文献

2.

Multivariate Latent Growth Models for Mixed Data with Covariate Effects

《统计学通讯:理论与方法》2012,41(16-17):3079-3093

The paper presents an extension of a new class of multivariate latent growth models (Bianconcini and Cagnone, 2012) to allow for covariate effects on manifest, latent variables and random effects. The new class of models combines: (i) multivariate latent curves that describe the temporal behavior of the responses, and (ii) a factor model that specifies the relationship between manifest and latent variables. Based on the Generalized Linear and Latent Variable Model framework (Bartholomew and Knott, 1999), the response variables are assumed to follow different distributions of the exponential family, with item-specific linear predictors depending on both latent variables and measurement errors. A full maximum likelihood method is used to estimate all the model parameters simultaneously. Data coming from the Data WareHouse of the University of Bologna are used to illustrate the methodology. 相似文献

3.

Semiparametric transformation models with Bayesian P-splines

Xin-Yuan Song Zhao-Hua Lu 《Statistics and Computing》2012,22(5):1085-1098

In this paper, we aim to develop a semiparametric transformation model. Nonparametric transformation functions are modeled with Bayesian P-splines. The transformed variables can be fitted to a general nonlinear mixed model, including linear or nonlinear regression models, mixed effect models, factor analysis models, and other latent variable models as special cases. Markov chain Monte Carlo algorithms are implemented to estimate transformation functions and unknown quantities in the model. The performance of the developed methodology is demonstrated with a simulation study. Its application to a real study on polydrug use is presented. 相似文献

4.

Bayesian analysis for confirmatory factor model with finite-dimensional Dirichlet prior mixing

Xia Yemao Pan Maolin 《统计学通讯:理论与方法》2017,46(9):4599-4619

Confirmatory factor analysis (CFA) model is a useful multivariate statistical tool for interpreting relationships between latent variables and manifest variables. Often statistical results based on a single CFA are seriously distorted when data set takes on heterogeneity. To address the heterogeneity resulting from the multivariate responses, we propose a Bayesian semiparametric modeling for CFA. The approach relies on using a prior over the space of mixing distributions with finite components. Blocked Gibbs sampler is implemented to cope with the posterior analysis. Results obtained from a simulation study and a real data set are presented to illustrate the methodology. 相似文献

5.

A Bayesian analysis on time series structural equation models

Yi-Fu Wang Tsai-Hung Fan 《Journal of statistical planning and inference》2011,141(6):2071-2078

Structural equation models (SEM) have been extensively used in behavioral, social, and psychological research to model relations between the latent variables and the observations. Most software packages for the fitting of SEM rely on frequentist methods. Traditional models and software are not appropriate for analysis of the dependent observations such as time-series data. In this study, a structural equation model with a time series feature is introduced. A Bayesian approach is used to solve the model with the aid of the Markov chain Monte Carlo method. Bayesian inferences as well as prediction with the proposed time series structural equation model can also reveal certain unobserved relationships among the observations. The approach is successfully employed using real Asian, American and European stock return data. 相似文献

6.

Generalized Linear Factor Models: A New Local EM Estimation Algorithm

Mohamed Saidane Xavier Bry Christian Lavergne 《统计学通讯:理论与方法》2013,42(16):2944-2958

In this article, a general approach to latent variable models based on an underlying generalized linear model (GLM) with factor analysis observation process is introduced. We call these models Generalized Linear Factor Models (GLFM). The observations are produced from a general model framework that involves observed and latent variables that are assumed to be distributed in the exponential family. More specifically, we concentrate on situations where the observed variables are both discretely measured (e.g., binomial, Poisson) and continuously distributed (e.g., gamma). The common latent factors are assumed to be independent with a standard multivariate normal distribution. Practical details of training such models with a new local expectation-maximization (EM) algorithm, which can be considered as a generalized EM-type algorithm, are also discussed. In conjunction with an approximated version of the Fisher score algorithm (FSA), we show how to calculate maximum likelihood estimates of the model parameters, and to yield inferences about the unobservable path of the common factors. The methodology is illustrated by an extensive Monte Carlo simulation study and the results show promising performance. 相似文献

7.

Semiparametric latent variable regression models for spatiotemporal modelling of mobile source particles in the greater Boston area 总被引：1，自引：0，他引：1

Alexandros Gryparis Brent A. Coull Joel Schwartz Helen H. Suh 《Journal of the Royal Statistical Society. Series C, Applied statistics》2007,56(2):183-209

Summary. Traffic particle concentrations show considerable spatial variability within a metropolitan area. We consider latent variable semiparametric regression models for modelling the spatial and temporal variability of black carbon and elemental carbon concentrations in the greater Boston area. Measurements of these pollutants, which are markers of traffic particles, were obtained from several individual exposure studies that were conducted at specific household locations as well as 15 ambient monitoring sites in the area. The models allow for both flexible non-linear effects of covariates and for unexplained spatial and temporal variability in exposure. In addition, the different individual exposure studies recorded different surrogates of traffic particles, with some recording only outdoor concentrations of black or elemental carbon, some recording indoor concentrations of black carbon and others recording both indoor and outdoor concentrations of black carbon. A joint model for outdoor and indoor exposure that specifies a spatially varying latent variable provides greater spatial coverage in the area of interest. We propose a penalized spline formulation of the model that relates to generalized kriging of the latent traffic pollution variable and leads to a natural Bayesian Markov chain Monte Carlo algorithm for model fitting. We propose methods that allow us to control the degrees of freedom of the smoother in a Bayesian framework. Finally, we present results from an analysis that applies the model to data from summer and winter separately. 相似文献

8.

An extended exponentiated-G-negative binomial family with threshold effect

Josemar Rodrigues Gauss M. Cordeiro Jorge Bazan 《Journal of applied statistics》2014,41(9):2056-2074

In this paper, we formulate a very flexible family of models which unifies most recent lifetime distributions. The main idea is to obtain a cumulative distribution function to transform the baseline distribution with an activation mechanism characterized by a latent threshold variable. The new family has a strong biological interpretation from the competitive risks point of view and the Box–Cox transformation provides an elegant manner to interpret the effect on the baseline distribution to obtain this alternative model. Several structural properties of the new model are investigated. A Bayesian analysis using Markov Chain Monte Carlo procedure is developed to illustrate with a real data the usefulness of the proposed family. 相似文献

9.

Bayesian factor analysis for spatially correlated data: application to cancer incidence data in Scotland

Maura Mezzetti 《Statistical Methods and Applications》2012,21(1):49-74

A hierarchical Bayesian factor model for multivariate spatially correlated data is proposed. Multiple cancer incidence data in Scotland are jointly analyzed, looking for common components, able to detect etiological factors of diseases hidden behind the data. The proposed method searches factor scores incorporating a dependence within observations due to a geographical structure. The great flexibility of the Bayesian approach allows the inclusion of prior opinions about adjacent regions having highly correlated observable and latent variables. The proposed model is an extension of a model proposed by Rowe (2003a) and starts from the introduction of separable covariance matrix for the observations. A Gibbs sampling algorithm is implemented to sample from the posterior distributions. 相似文献

10.

Bayesian finite mixtures with an unknown number of components: The allocation sampler

Agostino Nobile Alastair T. Fearnside 《Statistics and Computing》2007,17(2):147-162

A new Markov chain Monte Carlo method for the Bayesian analysis of finite mixture distributions with an unknown number of components is presented. The sampler is characterized by a state space consisting only of the number of components and the latent allocation variables. Its main advantage is that it can be used, with minimal changes, for mixtures of components from any parametric family, under the assumption that the component parameters can be integrated out of the model analytically. Artificial and real data sets are used to illustrate the method and mixtures of univariate and of multivariate normals are explicitly considered. The problem of label switching, when parameter inference is of interest, is addressed in a post-processing stage. 相似文献

11.

The independent factor analysis approach to latent variable modelling

Angela Montanari 《Statistics》2013,47(4):397-416

Independent factor analysis (IFA) has recently been proposed in the signal processing literature as a way to model a set of observed variables through linear combinations of latent independent variables and a noise term. A peculiarity of the method is that it defines a probability density function for the latent variables by mixtures of Gaussians. The aim of this paper is to cast the method into a more rigorous statistical framework and to propose some developments. In the first part, we present the IFA model in its population version, address identifiability issues and draw some parallels between the IFA model and the ordinary factor analysis (FA) one. Then we show that the IFA model may be reinterpreted as an independent component analysis-based rotation of an ordinary FA solution. We also give evidence that the IFA model represents a special case of mixture of factor analysers. In the second part, we address inferential issues, also deriving the standard errors for the model parameter estimates and providing model selection criteria. Finally, we present some empirical results on real data sets. 相似文献

12.

A hierarchical model for space–time surveillance data on meningococcal disease incidence

Leonhard Knorr-Held Sylvia Richardson 《Journal of the Royal Statistical Society. Series C, Applied statistics》2003,52(2):169-183

Summary. We describe a model-based approach to analyse space–time surveillance data on meningococcal disease. Such data typically comprise a number of time series of disease counts, each representing a specific geographical area. We propose a hierarchical formulation, where latent parameters capture temporal, seasonal and spatial trends in disease incidence. We then add—for each area—a hidden Markov model to describe potential additional (autoregressive) effects of the number of cases at the previous time point. Different specifications for the functional form of this autoregressive term are compared which involve the number of cases in the same or in neighbouring areas. The two states of the Markov chain can be interpreted as representing an 'endemic' and a 'hyperendemic' state. The methodology is applied to a data set of monthly counts of the incidence of meningococcal disease in the 94 départements of France from 1985 to 1997. Inference is carried out by using Markov chain Monte Carlo simulation techniques in a fully Bayesian framework. We emphasize that a central feature of our model is the possibility of calculating—for each region and each time point—the posterior probability of being in a hyperendemic state, adjusted for global spatial and temporal trends, which we believe is of particular public health interest. 相似文献

13.

Latent variable models with ordinal categorical covariates

Wai-Yin Poon Hai-Bin Wang 《Statistics and Computing》2012,22(5):1135-1154

We propose a general latent variable model for multivariate ordinal categorical variables, in which both the responses and the covariates are ordinal, to assess the effect of the covariates on the responses and to model the covariance structure of the response variables. A?fully Bayesian approach is employed to analyze the model. The Gibbs sampler is used to simulate the joint posterior distribution of the latent variables and the parameters, and the parameter expansion and reparameterization techniques are used to speed up the convergence procedure. The proposed model and method are demonstrated by simulation studies and a real data example. 相似文献

14.

Modeling a mixture of ordinal and continuous repeated measures

《Journal of Statistical Computation and Simulation》2012,82(10):873-886

We study the correlation structure for a mixture of ordinal and continuous repeated measures using a Bayesian approach. We assume a multivariate probit model for the ordinal variables and a normal linear regression for the continuous variables, where latent normal variables underlying the ordinal data are correlated with continuous variables in the model. Due to the probit model assumption, we are required to sample a covariance matrix with some of the diagonal elements equal to one. The key computational idea is to use parameter-extended data augmentation, which involves applying the Metropolis-Hastings algorithm to get a sample from the posterior distribution of the covariance matrix incorporating the relevant restrictions. The methodology is illustrated through a simulated example and through an application to data from the UCLA Brain Injury Research Center. 相似文献

15.

Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations 总被引：7，自引：0，他引：7

Håvard Rue Sara Martino Nicolas Chopin 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(2):319-392

Summary. Structured additive regression models are perhaps the most commonly used class of models in statistical applications. It includes, among others, (generalized) linear models, (generalized) additive models, smoothing spline models, state space models, semiparametric regression, spatial and spatiotemporal models, log-Gaussian Cox processes and geostatistical and geoadditive models. We consider approximate Bayesian inference in a popular subset of structured additive regression models, latent Gaussian models , where the latent field is Gaussian, controlled by a few hyperparameters and with non-Gaussian response variables. The posterior marginals are not available in closed form owing to the non-Gaussian response variables. For such models, Markov chain Monte Carlo methods can be implemented, but they are not without problems, in terms of both convergence and computational time. In some practical applications, the extent of these problems is such that Markov chain Monte Carlo sampling is simply not an appropriate tool for routine analysis. We show that, by using an integrated nested Laplace approximation and its simplified version, we can directly compute very accurate approximations to the posterior marginals. The main benefit of these approximations is computational: where Markov chain Monte Carlo algorithms need hours or days to run, our approximations provide more precise estimates in seconds or minutes. Another advantage with our approach is its generality, which makes it possible to perform Bayesian analysis in an automatic, streamlined way, and to compute model comparison criteria and various predictive measures so that models can be compared and the model under study can be challenged. 相似文献

16.

Bayesian density regression for discrete outcomes

Georgios Papageorgiou 《Australian & New Zealand Journal of Statistics》2019,61(3):336-359

We develop Bayesian models for density regression with emphasis on discrete outcomes. The problem of density regression is approached by considering methods for multivariate density estimation of mixed scale variables, and obtaining conditional densities from the multivariate ones. The approach to multivariate mixed scale outcome density estimation that we describe represents discrete variables, either responses or covariates, as discretised versions of continuous latent variables. We present and compare several models for obtaining these thresholds in the challenging context of count data analysis where the response may be over‐ and/or under‐dispersed in some of the regions of the covariate space. We utilise a nonparametric mixture of multivariate Gaussians to model the directly observed and the latent continuous variables. The paper presents a Markov chain Monte Carlo algorithm for posterior sampling, sufficient conditions for weak consistency, and illustrations on density, mean and quantile regression utilising simulated and real datasets. 相似文献

17.

Bayesian Analysis of Ordinal Survey Data Using the Dirichlet Process to Account for Respondent Personality Traits

Saman Muthukumarana Tim B. Swartz 《统计学通讯:模拟与计算》2013,42(1):82-98

This article presents a Bayesian latent variable model used to analyze ordinal response survey data by taking into account the characteristics of respondents. The ordinal response data are viewed as multivariate responses arising from continuous latent variables with known cut-points. Each respondent is characterized by two parameters that have a Dirichlet process as their joint prior distribution. The proposed mechanism adjusts for classes of personalities. The model is applied to student survey data in course evaluations. Goodness-of-fit (GoF) procedures are developed for assessing the validity of the model. The proposed GoF procedures are simple, intuitive, and do not seem to be a part of current Bayesian practice. 相似文献

18.

Isotone additive latent variable models

Sylvain Sardy Maria-Pia Victoria-Feser 《Statistics and Computing》2012,22(2):647-659

For manifest variables with additive noise and for a given number of latent variables with an assumed distribution, we propose to nonparametrically estimate the association between latent and manifest variables. Our estimation is a two step procedure: first it employs standard factor analysis to estimate the latent variables as theoretical quantiles of the assumed distribution; second, it employs the additive models’ backfitting procedure to estimate the monotone nonlinear associations between latent and manifest variables. The estimated fit may suggest a different latent distribution or point to nonlinear associations. We show on simulated data how, based on mean squared errors, the nonparametric estimation improves on factor analysis. We then employ the new estimator on real data to illustrate its use for exploratory data analysis. 相似文献

19.

Bayesian Tobit quantile regression with single-index models

《Journal of Statistical Computation and Simulation》2012,82(6):1247-1263

Based on the Bayesian framework of utilizing a Gaussian prior for the univariate nonparametric link function and an asymmetric Laplace distribution (ALD) for the residuals, we develop a Bayesian treatment for the Tobit quantile single-index regression model (TQSIM). With the location-scale mixture representation of the ALD, the posterior inferences of the latent variables and other parameters are achieved via the Markov Chain Monte Carlo computation method. TQSIM broadens the scope of applicability of the Tobit models by accommodating nonlinearity in the data. The proposed method is illustrated by two simulation examples and a labour supply dataset. 相似文献

20.

有序响应变量的贝叶斯模型选择及其在COPD疾病防治中的应用

赵为华王玲胡丹青冯俊丰《统计研究》2020,37(3):85-93

慢性阻塞性肺病(COPD)是一种发病率、死亡率都非常高的疾病，且COPD的诊断和严重程度分级依赖于肺功能的检查，但是由于肺功能检查仪器价格昂贵，使得这项检查在很多经济欠发达地区尤其是农村基层医院并没有普及。本文基于有序响应变量模型致力于研究一种便于基层和社区使用的可以初步判别COPD病情的模型，以期提高我国基层和社区的COPD 防治水平。利用贝叶斯变量选择方法和数据增强的潜变量策略得到了易于实施的Gibbs后验抽样算法。数值模拟分析进一步说明了本文提出的有序响应变量贝叶斯模型选择方法的有效性，实例分析得到了易于判别COPD严重程度的稀疏模型。相似文献