期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Estimation in nonlinear mixed-effects models using heavy-tailed distributions 总被引：1，自引：0，他引：1

Cristian Meza Felipe Osorio Rolando De?la?Cruz 《Statistics and Computing》2012,22(1):121-139

Nonlinear mixed-effects models are very useful to analyze repeated measures data and are used in a variety of applications. Normal distributions for random effects and residual errors are usually assumed, but such assumptions make inferences vulnerable to the presence of outliers. In this work, we introduce an extension of a normal nonlinear mixed-effects model considering a subclass of elliptical contoured distributions for both random effects and residual errors. This elliptical subclass, the scale mixtures of normal (SMN) distributions, includes heavy-tailed multivariate distributions, such as Student-t, the contaminated normal and slash, among others, and represents an interesting alternative to outliers accommodation maintaining the elegance and simplicity of the maximum likelihood theory. We propose an exact estimation procedure to obtain the maximum likelihood estimates of the fixed-effects and variance components, using a stochastic approximation of the EM algorithm. We compare the performance of the normal and the SMN models with two real data sets. 相似文献

2.

Double hierarchical generalized linear models (with discussion) 总被引：2，自引：0，他引：2

Youngjo Lee John A. Nelder 《Journal of the Royal Statistical Society. Series C, Applied statistics》2006,55(2):139-185

Summary. We propose a class of double hierarchical generalized linear models in which random effects can be specified for both the mean and dispersion. Heteroscedasticity between clusters can be modelled by introducing random effects in the dispersion model, as is heterogeneity between clusters in the mean model. This class will, among other things, enable models with heavy-tailed distributions to be explored, providing robust estimation against outliers. The h -likelihood provides a unified framework for this new class of models and gives a single algorithm for fitting all members of the class. This algorithm does not require quadrature or prior probabilities. 相似文献

3.

Using a mixture model for multiple imputation in the presence of outliers: the 'Healthy for life' project

Michael R. Elliott Nicolas Stettler 《Journal of the Royal Statistical Society. Series C, Applied statistics》2007,56(1):63-78

Summary. We consider the problem of obtaining population-based inference in the presence of missing data and outliers in the context of estimating the prevalence of obesity and body mass index measures from the 'Healthy for life' study. Identifying multiple outliers in a multivariate setting is problematic because of problems such as masking, in which groups of outliers inflate the covariance matrix in a fashion that prevents their identification when included, and swamping, in which outliers skew covariances in a fashion that makes non-outlying observations appear to be outliers. We develop a latent class model that assumes that each observation belongs to one of K unobserved latent classes, with each latent class having a distinct covariance matrix. We consider the latent class covariance matrix with the largest determinant to form an 'outlier class'. By separating the covariance matrix for the outliers from the covariance matrices for the remainder of the data, we avoid the problems of masking and swamping. As did Ghosh-Dastidar and Schafer, we use a multiple-imputation approach, which allows us simultaneously to conduct inference after removing cases that appear to be outliers and to promulgate uncertainty in the outlier status through the model inference. We extend the work of Ghosh-Dastidar and Schafer by embedding the outlier class in a larger mixture model, consider penalized likelihood and posterior predictive distributions to assess model choice and model fit, and develop the model in a fashion to account for the complex sample design. We also consider the repeated sampling properties of the multiple imputation removal of outliers. 相似文献

4.

Bayesian analysis of multivariate t linear mixed models with missing responses at random

《Journal of Statistical Computation and Simulation》2012,82(17):3594-3612

The multivariate t linear mixed model (MtLMM) has been recently proposed as a robust tool for analysing multivariate longitudinal data with atypical observations. Missing outcomes frequently occur in longitudinal research even in well controlled situations. As a powerful alternative to the traditional expectation maximization based algorithm employing single imputation, we consider a Bayesian analysis of the MtLMM to account for the uncertainties of model parameters and missing outcomes through multiple imputation. An inverse Bayes formulas sampler coupled with Metropolis-within-Gibbs scheme is used to effectively draw the posterior distributions of latent data and model parameters. The techniques for multiple imputation of missing values, estimation of random effects, prediction of future responses, and diagnostics of potential outliers are investigated as well. The proposed methodology is illustrated through a simulation study and an application to AIDS/HIV data. 相似文献

5.

Mixture of linear mixed models using multivariate t distribution

《Journal of Statistical Computation and Simulation》2012,82(4):771-787

Linear mixed models are widely used when multiple correlated measurements are made on each unit of interest. In many applications, the units may form several distinct clusters, and such heterogeneity can be more appropriately modelled by a finite mixture linear mixed model. The classical estimation approach, in which both the random effects and the error parts are assumed to follow normal distribution, is sensitive to outliers, and failure to accommodate outliers may greatly jeopardize the model estimation and inference. We propose a new mixture linear mixed model using multivariate t distribution. For each mixture component, we assume the response and the random effects jointly follow a multivariate t distribution, to conveniently robustify the estimation procedure. An efficient expectation conditional maximization algorithm is developed for conducting maximum likelihood estimation. The degrees of freedom parameters of the t distributions are chosen data adaptively, for achieving flexible trade-off between estimation robustness and efficiency. Simulation studies and an application on analysing lung growth longitudinal data showcase the efficacy of the proposed approach. 相似文献

6.

Bayesian inference for the variance components in general mixed linear models

Muthiya Rajagopalan Lyle Broemeling 《统计学通讯:理论与方法》2013,42(6):701-723

The general mixed linear model, containing both the fixed and random effects, is considered. Using gamma priors for the variance components, the conditional posterior distributions of the fixed effects and the variance components, conditional on the random effects, are obtained. Using the normal approximation for the multiple t distribution, approximations are obtained for the posterior distributions of the variance components in infinite series form. The same approximation Is used to obtain closed expressions for the moments of the variance components. An example is considered to illustrate the procedure and a numerical study examines the closeness of the approximations. 相似文献

7.

Bayesian Longitudinal Data Analysis with Mixed Models and Thick-tailed Distributions using MCMC

G. J. M. Rosa D. Gianola C. R. Padovani 《Journal of applied statistics》2004,31(7):855-873

Linear mixed effects models are frequently used to analyse longitudinal data, due to their flexibility in modelling the covariance structure between and within observations. Further, it is easy to deal with unbalanced data, either with respect to the number of observations per subject or per time period, and with varying time intervals between observations. In most applications of mixed models to biological sciences, a normal distribution is assumed both for the random effects and for the residuals. This, however, makes inferences vulnerable to the presence of outliers. Here, linear mixed models employing thick-tailed distributions for robust inferences in longitudinal data analysis are described. Specific distributions discussed include the Student-t, the slash and the contaminated normal. A Bayesian framework is adopted, and the Gibbs sampler and the Metropolis-Hastings algorithms are used to carry out the posterior analyses. An example with data on orthodontic distance growth in children is discussed to illustrate the methodology. Analyses based on either the Student-t distribution or on the usual Gaussian assumption are contrasted. The thick-tailed distributions provide an appealing robust alternative to the Gaussian process for modelling distributions of the random effects and of residuals in linear mixed models, and the MCMC implementation allows the computations to be performed in a flexible manner. 相似文献

8.

Exponential-Bound Property of Estimators and Variable Selection in Generalized Additive Models

Xiaoming Wang 《统计学通讯:理论与方法》2013,42(6):1105-1122

In this article, utilizing a scale mixture of skew-normal distribution in which mixing random variable is assumed to follow a mixture model with varying weights for each observation, we introduce a generalization of skew-normal linear regression model with the aim to provide resistant results. This model, which also includes the skew-slash distribution in a particular case, allows us to accommodate and detect outlying observations under the skew-normal linear regression model. Inferences about the model are carried out through the empirical Bayes approach. The conditions for propriety of the posterior and for existence of posterior moments are given under the standard noninformative priors for regression and scale parameters as well as proper prior for skewness parameter. Then, for Bayesian inference, a Markov chain Monte Carlo method is described. Since posterior results depend on the prior hyperparameters, we estimate them adopting the empirical Bayes method as well as using a Monte Carlo EM algorithm. Furthermore, to identify possible outliers, we also apply the Bayes factor obtained through the generalized Savage-Dickey density ratio. Examining the proposed approach on simulated instance and real data, it is found to provide not only satisfactory parameter estimates rather allow identifying outliers favorably. 相似文献

9.

Identification of outliers in a one-way random effects model

Jürgen?Wellmann Email author Ursula?Gather 《Statistical Papers》2003,44(3):335-348

We distinguish between three types of outliers in a one-way random effects model. These are formally described in terms of their position relative to the main part of the observations. We propose simple rules for identifying such outliers and give an example which involves median-based statistics. 相似文献

10.

Bayesian analysis of multivariate mortality data with large families

M.-H. Chen D. K. Dey & D. Sinha 《Journal of the Royal Statistical Society. Series C, Applied statistics》2000,49(1):129-144

This paper presents a Bayesian method for the analysis of toxicological multivariate mortality data when the discrete mortality rate for each family of subjects at a given time depends on familial random effects and the toxicity level experienced by the family. Our aim is to model and analyse one set of such multivariate mortality data with large family sizes: the potassium thiocyanate (KSCN) tainted fish tank data of O'Hara Hines. The model used is based on a discretized hazard with additional time-varying familial random effects. A similar previous study (using sodium thiocyanate (NaSCN)) is used to construct a prior for the parameters in the current study. A simulation-based approach is used to compute posterior estimates of the model parameters and mortality rates and several other quantities of interest. Recent tools in Bayesian model diagnostics and variable subset selection have been incorporated to verify important modelling assumptions regarding the effects of time and heterogeneity among the families on the mortality rate. Further, Bayesian methods using predictive distributions are used for comparing several plausible models. 相似文献

11.

Bayesian analysis of dynamic panel data by penalized quantile regression

Ali Aghamohammadi 《Statistical Methods and Applications》2018,27(1):91-108

Existing literature on quantile regression for panel data models with individual effects advocates the application of penalization to reduce the dynamic panel bias and increase the efficiency of the estimators. In this paper, we consider penalized quantile regression for dynamic panel data with random effects from a Bayesian perspective, where the penalty involves an adaptive Lasso shrinkage of the random effects. We also address the role of initial conditions in dynamic panel data models, emphasizing joint modeling of start-up and subsequent responses. For posterior inference, an efficient Gibbs sampler is developed to simulate the parameters from the posterior distributions. Through simulation studies and analysis of a real data set, we assess the performance of the proposed Bayesian method. 相似文献

12.

Semiparametric Bayesian classification with longitudinal markers 总被引：1，自引：0，他引：1

Rolando De la Cruz-Mesía Fernando A. Quintana Peter Müller 《Journal of the Royal Statistical Society. Series C, Applied statistics》2007,56(2):119-137

Summary. We analyse data from a study involving 173 pregnant women. The data are observed values of the β human chorionic gonadotropin hormone measured during the first 80 days of gestational age, including from one up to six longitudinal responses for each woman. The main objective in this study is to predict normal versus abnormal pregnancy outcomes from data that are available at the early stages of pregnancy. We achieve the desired classification with a semiparametric hierarchical model. Specifically, we consider a Dirichlet process mixture prior for the distribution of the random effects in each group. The unknown random-effects distributions are allowed to vary across groups but are made dependent by using a design vector to select different features of a single underlying random probability measure. The resulting model is an extension of the dependent Dirichlet process model, with an additional probability model for group classification. The model is shown to perform better than an alternative model which is based on independent Dirichlet processes for the groups. Relevant posterior distributions are summarized by using Markov chain Monte Carlo methods. 相似文献

13.

Density-Tempered Marginalized Sequential Monte Carlo Samplers

Jin-Chuan Duan Andras Fulop 《商业与经济统计学杂志》2013,31(2):192-202

We propose a density-tempered marginalized sequential Monte Carlo (SMC) sampler, a new class of samplers for full Bayesian inference of general state-space models. The dynamic states are approximately marginalized out using a particle filter, and the parameters are sampled via a sequential Monte Carlo sampler over a density-tempered bridge between the prior and the posterior. Our approach delivers exact draws from the joint posterior of the parameters and the latent states for any given number of state particles and is thus easily parallelizable in implementation. We also build into the proposed method a device that can automatically select a suitable number of state particles. Since the method incorporates sample information in a smooth fashion, it delivers good performance in the presence of outliers. We check the performance of the density-tempered SMC algorithm using simulated data based on a linear Gaussian state-space model with and without misspecification. We also apply it on real stock prices using a GARCH-type model with microstructure noise. 相似文献

14.

Adaptive Bayesian Procedures Using Random Series Priors

下载免费PDF全文

Weining Shen Subhashis Ghosal 《Scandinavian Journal of Statistics》2015,42(4):1194-1213

We consider a general class of prior distributions for nonparametric Bayesian estimation which uses finite random series with a random number of terms. A prior is constructed through distributions on the number of basis functions and the associated coefficients. We derive a general result on adaptive posterior contraction rates for all smoothness levels of the target function in the true model by constructing an appropriate ‘sieve’ and applying the general theory of posterior contraction rates. We apply this general result on several statistical problems such as density estimation, various nonparametric regressions, classification, spectral density estimation and functional regression. The prior can be viewed as an alternative to the commonly used Gaussian process prior, but properties of the posterior distribution can be analysed by relatively simpler techniques. An interesting approximation property of B‐spline basis expansion established in this paper allows a canonical choice of prior on coefficients in a random series and allows a simple computational approach without using Markov chain Monte Carlo methods. A simulation study is conducted to show that the accuracy of the Bayesian estimators based on the random series prior and the Gaussian process prior are comparable. We apply the method on Tecator data using functional regression models. 相似文献

15.

On the difference in inference and prediction between the joint and independent f-error models for seemingly unrelated regressions

Jeanne Kowalski Jose R. Mendoza-Blanco Xim M. Tu Leon J. Gleser 《统计学通讯:理论与方法》2013,42(9):2119-2140

We consider likelihood and Bayesian inferences for seemingly unrelated (linear) regressions for the joint niultivariate terror (e.g. Zellner, 1976) and the independent t-error (e.g. Maronna, 1976) models. For likelihood inference, the scale matrix and the shape parameter for the joint terror model cannot be consistently estimated because of the lack of adequate information to identify the latter. The joint terror model also yields the same MLEs for the regression coefficients and the scale matrix as for the independent normal error model. which are not robust against outliers. Further, linear hypotheses with respect

to the regression coefficients also give rise to the same mill distributions AS for the independent normal error model, though the MLE has a non-normal limiting distribution. In contrast to the striking similarities between the joint t-error and the independent normal error models, the independent f-error model yields AiLEs that are lubust against uuthers. Since the MLE of the shape parameter reflects the tails of the data distributions, this model extends the independent normal error model for modeling data distributions with relatively t hicker tails. These differences are also discussed with respect to the posterior and predictive distributions for Bayesian inference. 相似文献

16.

A bayesian wls approach to generalized linear models

Peyton Cook Lyle D. Broemeling 《统计学通讯:理论与方法》2013,42(12):3323-3347

We use a Bayesian approach to fitting a linear regression model to transformations of the natural parameter for the exponential class of distributions. The usual Bayesian approach is to assume that a linear model exactly describes the relationship among the natural parameters. We assume only that a linear model is approximately in force. We approximate the theta-links by using a linear model obtained by minimizing the posterior expectation of a loss function.While some posterior results can be obtained analytically considerable generality follows from an exact Monte Carlo method for obtaining random samples of parameter values or functions of parameter values from their respective posterior distributions. The approach that is presented is justified for small samples, requires only one-dimensional numerical integrations, and allows for the use of regression matrices with less than full column rank. Two numerical examples are provided. 相似文献

17.

Topics on Dynamic Panel Data Models with Random Effects Using Semi-Parametric Bayesian Approach

Iraj Kazemi Reyhaneh Rikhtehgaran 《统计学通讯:理论与方法》2014,43(8):1630-1648

A common assumption in fitting panel data models is normality of stochastic subject effects. This can be extremely restrictive, making vague most potential features of true distributions. The objective of this article is to propose a modeling strategy, from a semi-parametric Bayesian perspective, to specify a flexible distribution for the random effects in dynamic panel data models. This is addressed here by assuming the Dirichlet process mixture model to introduce Dirichlet process prior for the random-effects distribution. We address the role of initial conditions in dynamic processes, emphasizing on joint modeling of start-up and subsequent responses. We adopt Gibbs sampling techniques to approximate posterior estimates. These important topics are illustrated by a simulation study and also by testing hypothetical models in two empirical contexts drawn from economic studies. We use modified versions of information criteria to compare the fitted models. 相似文献

18.

A non-iterative Bayesian sampling algorithm for censored Student-t linear regression models

《Journal of Statistical Computation and Simulation》2012,82(16):3337-3355

ABSTRACT

In this paper, we consider an effective Bayesian inference for censored Student-t linear regression model, which is a robust alternative to the usual censored Normal linear regression model. Based on the mixture representation of the Student-t distribution, we propose a non-iterative Bayesian sampling procedure to obtain independently and identically distributed samples approximately from the observed posterior distributions, which is different from the iterative Markov Chain Monte Carlo algorithm. We conduct model selection and influential analysis using the posterior samples to choose the best fitted model and to detect latent outliers. We illustrate the performance of the procedure through simulation studies, and finally, we apply the procedure to two real data sets, one is the insulation life data with right censoring and the other is the wage rates data with left censoring, and we get some interesting results. 相似文献

19.

Bayesian estimates in a one-way ANOVA random effects model

Guorui Bian 《Australian & New Zealand Journal of Statistics》2002,44(1):99-108

Bayesian estimators of variance components are developed, based on posterior mean and posterior mode, respectively, in a one-way ANOVA random effects model with independent prior distributions. The formulas for the proposed estimators are simple. The estimators give sensible results for 'badly-behaved' datasets, where the standard unbiased estimates are negative. They are markedly robust as compared to the existing estimators such as the maximum likelihood estimators and the maximum posterior density estimators. 相似文献

20.

A Bayesian Hierarchical Model for Multiple Comparisons in Mixed Models

Qie Li 《统计学通讯:理论与方法》2013,42(23):5071-5090

We propose a Bayesian hierarchical model for multiple comparisons in mixed models where the repeated measures on subjects are described with the subject random effects. The model facilitates inferences in parameterizing the successive differences of the population means, and for them, we choose independent prior distributions that are mixtures of a normal distribution and a discrete distribution with its entire mass at zero. For the other parameters, we choose conjugate or vague priors. The performance of the proposed hierarchical model is investigated in the simulated and two real data sets, and the results illustrate that the proposed hierarchical model can effectively conduct a global test and pairwise comparisons using the posterior probability that any two means are equal. A simulation study is performed to analyze the type I error rate, the familywise error rate, and the test power. The Gibbs sampler procedure is used to estimate the parameters and to calculate the posterior probabilities. 相似文献