期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Semiparametric Estimation with Profile Algorithm for Longitudinal Binary Data

Suliadi Suliadi Noor Akma Ibrahim Isa Daud 《统计学通讯:模拟与计算》2013,42(1):138-152

This article considers analyzing longitudinal binary data semiparametrically and proposing GEE-Smoothing spline in the estimation of parametric and nonparametric components. The method is an extension of the parametric generalized estimating equation to semiparametric. The nonparametric component is estimated by smoothing spline approach, i.e., natural cubic spline. We use profile algorithm in the estimation of both parametric and nonparametric components. Properties of the estimators are evaluated by simulation. 相似文献

2.

EM Estimation for Finite Mixture Models with Known Mixture Component Size

Chen Teel Allan R. Sampson 《统计学通讯:模拟与计算》2015,44(6):1545-1556

We consider the use of an EM algorithm for fitting finite mixture models when mixture component size is known. This situation can occur in a number of settings, where individual membership is unknown but aggregate membership is known. When the mixture component size, i.e., the aggregate mixture component membership, is known, it is common practice to treat only the mixing probability as known. This approach does not, however, entirely account for the fact that the number of observations within each mixture component is known, which may result in artificially incorrect estimates of parameters. By fully capitalizing on the available information, the proposed EM algorithm shows robustness to the choice of starting values and exhibits numerically stable convergence properties. 相似文献

3.

Mixed Graphical Models with Missing Data and the Partial Imputation EM Algorithm 总被引：2，自引：0，他引：2

Zhi Geng Kang Wan & Feng Tao 《Scandinavian Journal of Statistics》2000,27(3):433-444

In this paper we discuss graphical models for mixed types of continuous and discrete variables with incomplete data. We use a set of hyperedges to represent an observed data pattern. A hyperedge is a set of variables observed for a group of individuals. In a mixed graph with two types of vertices and two types of edges, dots and circles represent discrete and continuous variables respectively. A normal graph represents a graphical model and a hypergraph represents an observed data pattern. In terms of the mixed graph, we discuss decomposition of mixed graphical models with incomplete data, and we present a partial imputation method which can be used in the EM algorithm and the Gibbs sampler to speed their convergence. For a given mixed graphical model and an observed data pattern, we try to decompose a large graph into several small ones so that the original likelihood can be factored into a product of likelihoods with distinct parameters for small graphs. For the case that a graph cannot be decomposed due to its observed data pattern, we can impute missing data partially so that the graph can be decomposed. 相似文献

4.

Marginal Correlation in Longitudinal Binary Data Based on Generalized Linear Mixed Models

Tony Vangeneugden Geert Molenberghs Annouschka Laenen Helena Geys Caroline Beunckens Cristina Sotto 《统计学通讯:理论与方法》2013,42(19):3540-3557

This work aims at investigating marginal correlation within and between longitudinal data sequences. Useful and intuitive approximate expressions are derived based on generalized linear mixed models. Data from four double-blind randomized clinical trials are used to estimate the intra-class coefficient of reliability for a binary response. Additionally, the correlation between such a binary response and a continuous response is derived to evaluate the criterion validity of the binary response variable and the established continuous response variable. 相似文献

5.

Modeling Bimodal Discrete Data Using Conway-Maxwell-Poisson Mixture Models

Pragya Sur Galit Shmueli Smarajit Bose Paromita Dubey 《商业与经济统计学杂志》2013,31(3):352-365

Bimodal truncated count distributions are frequently observed in aggregate survey data and in user ratings when respondents are mixed in their opinion. They also arise in censored count data, where the highest category might create an additional mode. Modeling bimodal behavior in discrete data is useful for various purposes, from comparing shapes of different samples (or survey questions) to predicting future ratings by new raters. The Poisson distribution is the most common distribution for fitting count data and can be modified to achieve mixtures of truncated Poisson distributions. However, it is suitable only for modeling equidispersed distributions and is limited in its ability to capture bimodality. The Conway–Maxwell–Poisson (CMP) distribution is a two-parameter generalization of the Poisson distribution that allows for over- and underdispersion. In this work, we propose a mixture of CMPs for capturing a wide range of truncated discrete data, which can exhibit unimodal and bimodal behavior. We present methods for estimating the parameters of a mixture of two CMP distributions using an EM approach. Our approach introduces a special two-step optimization within the M step to estimate multiple parameters. We examine computational and theoretical issues. The methods are illustrated for modeling ordered rating data as well as truncated count data, using simulated and real examples. 相似文献

6.

Multivariate Latent Growth Models for Mixed Data with Covariate Effects

《统计学通讯:理论与方法》2012,41(16-17):3079-3093

The paper presents an extension of a new class of multivariate latent growth models (Bianconcini and Cagnone, 2012) to allow for covariate effects on manifest, latent variables and random effects. The new class of models combines: (i) multivariate latent curves that describe the temporal behavior of the responses, and (ii) a factor model that specifies the relationship between manifest and latent variables. Based on the Generalized Linear and Latent Variable Model framework (Bartholomew and Knott, 1999), the response variables are assumed to follow different distributions of the exponential family, with item-specific linear predictors depending on both latent variables and measurement errors. A full maximum likelihood method is used to estimate all the model parameters simultaneously. Data coming from the Data WareHouse of the University of Bologna are used to illustrate the methodology. 相似文献

7.

Weak Instrumental Variables Models for Longitudinal Data

Zongwu Cai Henong Li 《Econometric Reviews》2013,32(4):361-389

This article considers the estimation and testing of a within-group two-stage least squares (TSLS) estimator for instruments with varying degrees of weakness in a longitudinal (panel) data model. We show that adding the repeated cross-sectional information into a regression model can improve the estimation in weak instruments. Moreover, the consistency and limiting distribution of the TSLS estimator are established when both N and T tend to infinity. Some asymptotically pivotal tests are extended to a longitudinal data model and their asymptotic properties are examined. A Monte Carlo experiment is conducted to evaluate the finite sample performance of the proposed estimators. 相似文献

8.

Models for longitudinal data with censored changepoints

Christopher H. Jackson Linda D. Sharples 《Journal of the Royal Statistical Society. Series C, Applied statistics》2004,53(1):149-162

Summary. In longitudinal studies of biological markers, different individuals may have different underlying patterns of response. In some applications, a subset of individuals experiences latent events, causing an instantaneous change in the level or slope of the marker trajectory. The paper presents a general mixture of hierarchical longitudinal models for serial biomarkers. Interest centres both on the time of the event and on levels of the biomarker before and after the event. In observational studies where marker series are incomplete, the latent event can be modelled by a survival distribution. Risk factors for the occurrence of the event can be investigated by including covariates in the survival distribution. A combination of Gibbs, Metropolis–Hastings and reversible jump Markov chain Monte Carlo sampling is used to fit the models to serial measurements of forced expiratory volume from lung transplant recipients. 相似文献

9.

Moments of Mixture Periodic Autoregressive Models

M. Bentarzi M. Merzougui 《统计学通讯:理论与方法》2013,42(22):3937-3947

This article deals with the study of some properties of a mixture periodically correlated autoregressive (MPAR _S) time series model, which extends the mixture time invariant parameter autoregressive (MAR) model, that has recently received a considerable interest from many economic time series analysts, to mixture periodic parameter autoregressive model. The aim behind this extension is to make the model able to capture, in addition to all features captured by the classical MAR model, the periodicity feature exhibited by the autocovariance structure of many encountered financial and environmental time series with eventual multimodal distributions. Our main contribution here is obtaining of the second moment periodically stationary condition for a MPAR _S(K; 2,…, 2) model, furthermore the closed-form of the second moment is obtained. 相似文献

10.

Generalized Linear Factor Models: A New Local EM Estimation Algorithm

Mohamed Saidane Xavier Bry Christian Lavergne 《统计学通讯:理论与方法》2013,42(16):2944-2958

In this article, a general approach to latent variable models based on an underlying generalized linear model (GLM) with factor analysis observation process is introduced. We call these models Generalized Linear Factor Models (GLFM). The observations are produced from a general model framework that involves observed and latent variables that are assumed to be distributed in the exponential family. More specifically, we concentrate on situations where the observed variables are both discretely measured (e.g., binomial, Poisson) and continuously distributed (e.g., gamma). The common latent factors are assumed to be independent with a standard multivariate normal distribution. Practical details of training such models with a new local expectation-maximization (EM) algorithm, which can be considered as a generalized EM-type algorithm, are also discussed. In conjunction with an approximated version of the Fisher score algorithm (FSA), we show how to calculate maximum likelihood estimates of the model parameters, and to yield inferences about the unobservable path of the common factors. The methodology is illustrated by an extensive Monte Carlo simulation study and the results show promising performance. 相似文献

11.

Smooth-Threshold GEE Variable Selection in High-Dimensional Partially Linear Models with Longitudinal Data

Ruiqin Tian Liugen Xue 《统计学通讯:模拟与计算》2015,44(7):1720-1734

We consider the problem of variable selection in high-dimensional partially linear models with longitudinal data. A variable selection procedure is proposed based on the smooth-threshold generalized estimating equation (SGEE). The proposed procedure automatically eliminates inactive predictors by setting the corresponding parameters to be zero, and simultaneously estimates the nonzero regression coefficients by solving the SGEE. We establish the asymptotic properties in a high-dimensional framework where the number of covariates p_n increases as the number of clusters n increases. Extensive Monte Carlo simulation studies are conducted to examine the finite sample performance of the proposed variable selection procedure. 相似文献

12.

An EM and a Stochastic Version of the EM Algorithm for Nonparametric Hidden Semi-Markov Models

Sonia Malefaki Nikolaos Limnios 《统计学通讯:模拟与计算》2013,42(2):240-261

The Hidden semi-Markov models (HSMMs) were introduced to overcome the constraint of a geometric sojourn time distribution for the different hidden states in the classical hidden Markov models. Several variations of HSMMs were proposed that model the sojourn times by a parametric or a nonparametric family of distributions. In this article, we concentrate our interest on the nonparametric case where the duration distributions are attached to transitions and not to states as in most of the published papers in HSMMs. Therefore, it is worth noticing that here we treat the underlying hidden semi-Markov chain in its general probabilistic structure. In that case, Barbu and Limnios (2008 Barbu , V. , Limnios , N. ( 2008 ). Semi-Markov Chains and Hidden Semi-Markov Models Toward Applications: Their Use in Reliability and DNA Analysis . New York : Springer . [Google Scholar]) proposed an Expectation–Maximization (EM) algorithm in order to estimate the semi-Markov kernel and the emission probabilities that characterize the dynamics of the model. In this article, we consider an improved version of Barbu and Limnios' EM algorithm which is faster than the original one. Moreover, we propose a stochastic version of the EM algorithm that achieves comparable estimates with the EM algorithm in less execution time. Some numerical examples are provided which illustrate the efficient performance of the proposed algorithms. 相似文献

13.

Asymptotic Normality for the Partially Linear EV Models with Longitudinal Data

Qiang Liu 《统计学通讯:理论与方法》2013,42(7):1149-1158

In this article, we consider a partially linear EV regression model under longitudinal data. By using a weighted kernel method and modified least-squared method, the estimators of unknown parameter, the unknown function are constructed and the asymptotic normality of the estimators are derived. Simulation studies are conducted to illustrate the finite-sample performance of the proposed method. 相似文献

14.

Variable Selection in Semiparametric Quantile Modeling for Longitudinal Data

Kangning Wang 《统计学通讯:理论与方法》2013,42(11):2243-2266

We propose a penalized quantile regression for partially linear varying coefficient (VC) model with longitudinal data to select relevant non parametric and parametric components simultaneously. Selection consistency and oracle property are established. Furthermore, if linear part and VC part are unknown, we propose a new unified method, which can do three types of selections: separation of varying and constant effects, selection of relevant variables, and it can be carried out conveniently in one step. Consistency in the three types of selections and oracle property in estimation are established as well. Simulation studies and real data analysis also confirm our method. 相似文献

15.

A Mixture Model for Longitudinal Partially Ranked Data

Brian Francis Regina Dittrich Reinhold Hatzinger Les Humphreys 《统计学通讯:理论与方法》2014,43(4):722-734

This article discusses the use of mixture models in the analysis of longitudinal partially ranked data, where respondents, for example, choose only the preferred and second preferred out of a set of items. To model such data we convert it to a set of paired comparisons. Covariates can be incorporated into the model. We use a nonparametric mixture to account for unmeasured variability in individuals over time. The resulting multi-valued mass points can be interpreted as latent classes of the items. The work is illustrated by two questions on (post)materialism in three sweeps of the British Household Panel Survey. 相似文献

16.

Modeling Longitudinal Obesity Data with Intermittent Missingness Using a New Latent Variable Model

Li Qin Lisa Weissfeld Marsha D. Marcus Michele D. Levine Feng Dai 《统计学通讯:模拟与计算》2016,45(6):2018-2031

We propose a latent variable model for informative missingness in longitudinal studies which is an extension of latent dropout class model. In our model, the value of the latent variable is affected by the missingness pattern and it is also used as a covariate in modeling the longitudinal response. So the latent variable links the longitudinal response and the missingness process. In our model, the latent variable is continuous instead of categorical and we assume that it is from a normal distribution. The EM algorithm is used to obtain the estimates of the parameter we are interested in and Gauss–Hermite quadrature is used to approximate the integration of the latent variable. The standard errors of the parameter estimates can be obtained from the bootstrap method or from the inverse of the Fisher information matrix of the final marginal likelihood. Comparisons are made to the mixed model and complete-case analysis in terms of a clinical trial dataset, which is Weight Gain Prevention among Women (WGPW) study. We use the generalized Pearson residuals to assess the fit of the proposed latent variable model. 相似文献

17.

Using Convex Combination of Copulas and EM Algorithm for Credibility

Lixin Song Fang Liu 《统计学通讯:理论与方法》2014,43(6):1222-1233

In this article, using longitudinal data, we develop the theory of credibility by copula model. The convex combination of copulas is used to describe the dependencies among claims. Finally, for comparing with the results of a single copula, using EM algorithm, some simulations of Massachusetts automobile claims are presented. 相似文献

18.

On Mixture Periodic Vector Autoregressive Models

M. Bentarzi L. Djeddou 《统计学通讯:模拟与计算》2013,42(10):2325-2352

This article deals with the study of some properties of a mixture periodically correlated n-variate vector autoregressive (MPVAR) time series model, which extends the mixture time invariant parameter n-vector autoregressive (MVAR) model that has been recently studied by Fong et al. (2007 Fong, P.W., Li, W.K., Yau, C.W., Wong, C.S. (2007). On a mixture vector autoregressive model. The Canadian Journal of Statistics 35:135–150.[Crossref], [Web of Science ®] , [Google Scholar]). Our main contributions here are, on the one side, the obtaining of the second moment periodically stationary condition for a n-variate MPVAR_S(n; K; 2, …, 2) model; furthermore, the closed-form of the second moment is obtained and, on the other side, the estimation, via the Expectation-Maximization (EM) algorithm, of the coefficient matrices and the error variance matrix. 相似文献

19.

Empirical Likelihood Inferences for Semiparametric Varying-Coefficient Partially Linear Models with Longitudinal Data

Peixin Zhao Liugen Xue 《统计学通讯:理论与方法》2013,42(11):1898-1914

In this article, empirical likelihood inferences for semiparametric varying-coefficient partially linear models with longitudinal data are investigated. We propose a groupwise empirical likelihood procedure to handle the inter-series dependence of the longitudinal data. By using residual-adjustment, an empirical likelihood ratio function for the nonparametric component is constructed, and a nonparametric version Wilks' phenomenons is proved. Compared with methods based on normal approximations, the empirical likelihood does not require consistent estimators for the asymptotic variance and bias. A simulation study is undertaken to assess the finite sample performance of the proposed confidence regions. 相似文献

20.

Comparison of Mixed-Effects Models for Skew-Normal Responses with an Application to AIDS Data: A Bayesian Approach

Yangxin Huang Getachew Dagne 《统计学通讯:模拟与计算》2013,42(6):1268-1287

The potency of antiretroviral agents in AIDS clinical trials can be assessed on the basis of a viral response such as viral decay rate or change in viral load (number of HIV RNA copies in plasma). Linear, nonlinear, and nonparametric mixed-effects models have been proposed to estimate such parameters in viral dynamic models. However, there are two critical questions that stand out: whether these models achieve consistent estimates for viral decay rates, and which model is more appropriate for use in practice. Moreover, one often assumes that a model random error is normally distributed, but this assumption may be unrealistic, obscuring important features of within- and among-subject variations. In this article, we develop a skew-normal (SN) Bayesian linear mixed-effects (SN-BLME) model, an SN Bayesian nonlinear mixed-effects (SN-BNLME) model, and an SN Bayesian semiparametric nonlinear mixed-effects (SN-BSNLME) model that relax the normality assumption by considering model random error to have an SN distribution. We compare the performance of these SN models, and also compare their performance with the corresponding normal models. An AIDS dataset is used to test the proposed models and methods. It was found that there is a significant incongruity in the estimated viral decay rates. The results indicate that SN-BSNLME model is preferred to the other models, implying that an arbitrary data truncation is not necessary. The findings also suggest that it is important to assume a model with an SN distribution in order to achieve reasonable results when the data exhibit skewness. 相似文献