期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

C. S. Wong & W. K. Li 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2000,62(1):95-115

We generalize the Gaussian mixture transition distribution (GMTD) model introduced by Le and co-workers to the mixture autoregressive (MAR) model for the modelling of non-linear time series. The models consist of a mixture of K stationary or non-stationary AR components. The advantages of the MAR model over the GMTD model include a more full range of shape changing predictive distributions and the ability to handle cycles and conditional heteroscedasticity in the time series. The stationarity conditions and autocorrelation function are derived. The estimation is easily done via a simple EM algorithm and the model selection problem is addressed. The shape changing feature of the conditional distributions makes these models capable of modelling time series with multimodal conditional distributions and with heteroscedasticity. The models are applied to two real data sets and compared with other competing models. The MAR models appear to capture features of the data better than other competing models do. 相似文献

2.

Discretizing a compound distribution with application to categorical modelling

Monique Graf Desislava Nedyalkova 《Statistics》2017,51(3):685-710

Many probability distributions can be represented as compound distributions. Consider some parameter vector as random. The compound distribution is the expected distribution of the variable of interest given the random parameters. Our idea is to define a partition of the domain of definition of the random parameters, so that we can represent the expected density of the variable of interest as a finite mixture of conditional densities. We then model the mixture probabilities of the conditional densities using information on population categories, thus modifying the original overall model. We thus obtain specific models for sub-populations that stem from the overall model. The distribution of a sub-population of interest is thus completely specified in terms of mixing probabilities. All characteristics of interest can be derived from this distribution and the comparison between sub-populations easily proceeds from the comparison of the mixing probabilities. A real example based on EU-SILC data is given. Then the methodology is investigated through simulation. 相似文献

3.

Case-deletion Influence Measures for the Data from Multivariate t Distributions

Feng-Chang Xie Bo-Cheng Wei Jin-Guan Lin 《Journal of applied statistics》2007,34(8):907-921

For the data from multivariate t distributions, it is very hard to make an influence analysis based on the probability density function since its expression is intractable. In this paper, we present a technique for influence analysis based on the mixture distribution and EM algorithm. In fact, the multivariate t distribution can be considered as a particular Gaussian mixture by introducing the weights from the Gamma distribution. We treat the weights as the missing data and develop the influence analysis for the data from multivariate t distributions based on the conditional expectation of the complete-data log-likelihood function in the EM algorithm. Several case-deletion measures are proposed for detecting influential observations from multivariate t distributions. Two numerical examples are given to illustrate our methodology. 相似文献

4.

Mixture of linear mixed models using multivariate t distribution

《Journal of Statistical Computation and Simulation》2012,82(4):771-787

Linear mixed models are widely used when multiple correlated measurements are made on each unit of interest. In many applications, the units may form several distinct clusters, and such heterogeneity can be more appropriately modelled by a finite mixture linear mixed model. The classical estimation approach, in which both the random effects and the error parts are assumed to follow normal distribution, is sensitive to outliers, and failure to accommodate outliers may greatly jeopardize the model estimation and inference. We propose a new mixture linear mixed model using multivariate t distribution. For each mixture component, we assume the response and the random effects jointly follow a multivariate t distribution, to conveniently robustify the estimation procedure. An efficient expectation conditional maximization algorithm is developed for conducting maximum likelihood estimation. The degrees of freedom parameters of the t distributions are chosen data adaptively, for achieving flexible trade-off between estimation robustness and efficiency. Simulation studies and an application on analysing lung growth longitudinal data showcase the efficacy of the proposed approach. 相似文献

5.

Bivariate Kumaraswamy distribution: properties and a new method to generate bivariate classes

Wagner Barreto-Souza Artur J. Lemonte 《Statistics》2013,47(6):1321-1342

In this paper, we introduce a bivariate Kumaraswamy (BVK) distribution whose marginals are Kumaraswamy distributions. The cumulative distribution function of this bivariate model has absolutely continuous and singular parts. Representations for the cumulative and density functions are presented and properties such as marginal and conditional distributions, product moments and conditional moments are obtained. We show that the BVK model can be obtained from the Marshall and Olkin survival copula and obtain a tail dependence measure. The estimation of the parameters by maximum likelihood is discussed and the Fisher information matrix is determined. We propose an EM algorithm to estimate the parameters. Some simulations are presented to verify the performance of the direct maximum-likelihood estimation and the proposed EM algorithm. We also present a method to generate bivariate distributions from our proposed BVK distribution. Furthermore, we introduce a BVK distribution which has only an absolutely continuous part and discuss some of its properties. Finally, a real data set is analysed for illustrative purposes. 相似文献

6.

A bivariate mixture of negative binomial distributions and its applications

Deepak Singh 《统计学通讯:理论与方法》2020,49(17):4162-4177

Abstract

We construct a new bivariate mixture of negative binomial distributions which represents over-dispersed data more efficiently. This is an extension of a univariate mixture of beta and negative binomial distributions. Characteristics of this joint distribution are studied including conditional distributions. Some properties of the correlation coefficient are explored. We demonstrate the applicability of our proposed model by fitting to three real data sets with correlated count data. A comparison is made with some previously used models to show the effectiveness of the new model. 相似文献

7.

Identification of longitudinal biomarkers for survival by a score test derived from a joint model of longitudinal and competing risks data

Feng-Shou Ko 《Journal of applied statistics》2014,41(10):2270-2281

In this paper, we consider joint modelling of repeated measurements and competing risks failure time data. For competing risks time data, a semiparametric mixture model in which proportional hazards model are specified for failure time models conditional on cause and a multinomial model for the marginal distribution of cause conditional on covariates. We also derive a score test based on joint modelling of repeated measurements and competing risks failure time data to identify longitudinal biomarkers or surrogates for a time to event outcome in competing risks data. 相似文献

8.

Bayesian analysis of time series Poisson data

Man-Suk Oh Yong Bin Lim 《Journal of applied statistics》2001,28(2):259-271

This paper provides a practical simulation-based Bayesian analysis of parameter-driven models for time series Poisson data with the AR(1) latent process. The posterior distribution is simulated by a Gibbs sampling algorithm. Full conditional posterior distributions of unknown variables in the model are given in convenient forms for the Gibbs sampling algorithm. The case with missing observations is also discussed. The methods are applied to real polio data from 1970 to 1983. 相似文献

9.

Two-stage hierarchical modeling for analysis of subpopulations in conditional distributions

Chervoneva I Zhan T Iglewicz B Hauck WW Birk DE 《Journal of applied statistics》2012,39(2):445-460

In this work, we develop modeling and estimation approach for the analysis of cross-sectional clustered data with multimodal conditional distributions where the main interest is in analysis of subpopulations. It is proposed to model such data in a hierarchical model with conditional distributions viewed as finite mixtures of normal components. With a large number of observations in the lowest level clusters, a two-stage estimation approach is used. In the first stage, the normal mixture parameters in each lowest level cluster are estimated using robust methods. Robust alternatives to the maximum likelihood estimation are used to provide stable results even for data with conditional distributions such that their components may not quite meet normality assumptions. Then the lowest level cluster-specific means and standard deviations are modeled in a mixed effects model in the second stage. A small simulation study was conducted to compare performance of finite normal mixture population parameter estimates based on robust and maximum likelihood estimation in stage 1. The proposed modeling approach is illustrated through the analysis of mice tendon fibril diameters data. Analyses results address genotype differences between corresponding components in the mixtures and demonstrate advantages of robust estimation in stage 1. 相似文献

10.

Quantile smoothing in financial time series 总被引：1，自引：1，他引：0

Klaus Abberger 《Statistical Papers》1997,38(2):125-148

Various parametric models have been designed to analyze volatility in time series of financial market data. For maximum likelihood estimation these parametric methods require the assumption of a known conditional distribution. In this paper we examine the conditional distribution of daily DAX returns with the help of nonparametric methods. We use kernel estimators for conditional quantiles resulting from a kernel estimation of conditional distributions. This work was financially supported by the Deutsche Forschungsgemeinschaft 相似文献

11.

A Semiparametric Bayesian Approach to Multivariate Longitudinal Data

Ghosh P Hanson T 《Australian & New Zealand Journal of Statistics》2010,52(3):275-288

We extend the standard multivariate mixed model by incorporating a smooth time effect and relaxing distributional assumptions. We propose a semiparametric Bayesian approach to multivariate longitudinal data using a mixture of Polya trees prior distribution. Usually, the distribution of random effects in a longitudinal data model is assumed to be Gaussian. However, the normality assumption may be suspect, particularly if the estimated longitudinal trajectory parameters exhibit multimodality and skewness. In this paper we propose a mixture of Polya trees prior density to address the limitations of the parametric random effects distribution. We illustrate the methodology by analyzing data from a recent HIV-AIDS study. 相似文献

12.

Zero-modified power series distribution and its Hurdle distribution version

K. S. Conceição F. Louzada E. S. Helou 《Journal of Statistical Computation and Simulation》2017,87(9):1842-1862

This paper presents a new family of distributions for count data, the so called zero-modified power series (ZMPS), which is an extension of the power series (PS) distribution family, whose support starts at zero. This extension consists in modifying the probability of observing zero of each PS distribution, enabling the new zero-modified distribution to appropriately accommodate data which have any amount of zero observations (for instance, zero-inflated or zero-deflated data). The Hurdle distribution version of the ZMPS distribution is presented. PS distributions included in the proposed ZMPS family are the Poisson, Generalized Poisson, Geometric, Binomial, Negative Binomial and Generalized Negative Binomial distributions. The paper also describes the properties and particularities of the new distribution family for count data. The distribution parameters are estimated via maximum likelihood method and the use of the new family is illustrated in three real data sets. We emphasize that the new distribution family can accommodate sets of count data without any previous knowledge on the characteristic of zero-inflation or zero-deflation present in the data. 相似文献

13.

General results for the beta Weibull distribution

Gauss M. Cordeiro Edwin M.M. Ortega 《Journal of Statistical Computation and Simulation》2013,83(6):1082-1114

In this paper, we study some mathematical properties of the beta Weibull (BW) distribution, which is a quite flexible model in analysing positive data. It contains the Weibull, exponentiated exponential, exponentiated Weibull and beta exponential distributions as special sub-models. We demonstrate that the BW density can be expressed as a mixture of Weibull densities. We provide their moments and two closed-form expressions for their moment-generating function. We examine the asymptotic distributions of the extreme values. Explicit expressions are derived for the mean deviations, Bonferroni and Lorenz curves, reliability and two entropies. The density of the BW-order statistics is a mixture of Weibull densities and two closed-form expressions are derived for their moments. The estimation of the parameters is approached by two methods: moments and maximum likelihood. We compare the performances of the estimates obtained from both the methods by simulation. The expected information matrix is derived. For the first time, we introduce a log-BW regression model to analyse censored data. The usefulness of the BW distribution is illustrated in the analysis of three real data sets. 相似文献

14.

A comparison of nonparametric priors in hierarchical mixture modelling for AFT regression

Raffaele Argiento Alessandra Guglielmi Antonio Pievatolo 《Journal of statistical planning and inference》2009,139(12):3989-4005

We will pursue a Bayesian nonparametric approach in the hierarchical mixture modelling of lifetime data in two situations: density estimation, when the distribution is a mixture of parametric densities with a nonparametric mixing measure, and accelerated failure time (AFT) regression modelling, when the same type of mixture is used for the distribution of the error term. The Dirichlet process is a popular choice for the mixing measure, yielding a Dirichlet process mixture model for the error; as an alternative, we also allow the mixing measure to be equal to a normalized inverse-Gaussian prior, built from normalized inverse-Gaussian finite dimensional distributions, as recently proposed in the literature. Markov chain Monte Carlo techniques will be used to estimate the predictive distribution of the survival time, along with the posterior distribution of the regression parameters. A comparison between the two models will be carried out on the grounds of their predictive power and their ability to identify the number of components in a given mixture density. 相似文献

15.

Mixture inverse Gaussian for unobserved heterogeneity in the autoregressive conditional duration model

Emilio Gómez–Déniz 《统计学通讯:理论与方法》2017,46(18):9007-9025

In this paper, we assume that the duration of a process has two different intrinsic components or phases which are independent. The first is the time it takes for a trade to be initiated in the market (for example, the time during which agents obtain knowledge about the market in which they are operating and accumulate information, which is coherent with Brownian motion) and the second is the subsequent time required for the trade to develop into a complete duration. Of course, if the first time is zero then the trade is initiated immediately and no initial knowledge is required. If we assume a specific compound Bernoulli distribution for the first time and an inverse Gaussian distribution for the second, the resulting convolution model has a mixture of an inverse Gaussian distribution with its reciprocal, which allows us to specify and test the unobserved heterogeneity in the autoregressive conditional duration (ACD) model.

Our proposals make it possible not only to capture various density shapes of the durations but also easily to accommodate the behaviour of the tail of the distribution and the non monotonic hazard function. The proposed model is easy to fit and characterizes the behaviour of the conditional durations reasonably well in terms of statistical criteria based on point and density forecasts. 相似文献

16.

A new look at the inverse Gaussian distribution with applications to insurance and economic data

Antonio Punzo 《Journal of applied statistics》2019,46(7):1260-1287

Insurance and economic data are often positive, and we need to take into account this peculiarity in choosing a statistical model for their distribution. An example is the inverse Gaussian (IG), which is one of the most famous and considered distributions with positive support. With the aim of increasing the use of the IG distribution on insurance and economic data, we propose a convenient mode-based parameterization yielding the reparametrized IG (rIG) distribution; it allows/simplifies the use of the IG distribution in various branches of statistics, and we give some examples. In nonparametric statistics, we define a smoother based on rIG kernels. By construction, the estimator is well-defined and does not allocate probability mass to unrealistic negative values. We adopt likelihood cross-validation to select the smoothing parameter. In robust statistics, we propose the contaminated IG distribution, a heavy-tailed generalization of the rIG distribution to accommodate mild outliers. Finally, for model-based clustering and semiparametric density estimation, we present finite mixtures of rIG distributions. We use the EM algorithm to obtain maximum likelihood estimates of the parameters of the mixture and contaminated models. We use insurance data about bodily injury claims, and economic data about incomes of Italian households, to illustrate the models. 相似文献

17.

Model Selection Using Conditional Densities

Alberto Contreras-Cristán 《统计学通讯:模拟与计算》2013,42(3):455-469

In this article we propose a new method to select a discrete model f(x; θ), based on the conditional density of a sample given the value of a sufficient statistic for θ. The main idea is to work with a broad family of discrete distributions, called the family of power series distribution, for which there is a common sufficient statistic for the parameter of interest. The proposed method uses the maximum conditional density in order to select the best model.

We compare our proposal with the usual methodology based on Bayes factors. We provide several examples that show that our proposal works fine in most instances. Bayes factors are strongly dependent on the prior information about the parameters. Since our method does not require the specification of a prior distribution, it provides a useful alternative to Bayes factors. 相似文献

18.

Conditional Prior Proposals in Dynamic Models 总被引：2，自引：0，他引：2

Leonhard Knorr-Held 《Scandinavian Journal of Statistics》1999,26(1):129-144

ABSTRACT. Dynamic models extend state space models to non-normal observations. This paper suggests a specific hybrid Metropolis–Hastings algorithm as a simple device for Bayesian inference via Markov chain Monte Carlo in dynamic models. Hastings proposals from the (conditional) prior distribution of the unknown, time-varying parameters are used to update the corresponding full conditional distributions. It is shown through simulated examples that the methodology has optimal performance in situations where the prior is relatively strong compared to the likelihood. Typical examples include smoothing priors for categorical data. A specific blocking strategy is proposed to ensure good mixing and convergence properties of the simulated Markov chain. It is also shown that the methodology is easily extended to robust transition models using mixtures of normals. The applicability is illustrated with an analysis of a binomial and a binary time series, known in the literature. 相似文献

19.

A Monte Carlo Markov chain algorithm for a class of mixture time series models

John W. Lau Mike K. P. So 《Statistics and Computing》2011,21(1):69-81

This article generalizes the Monte Carlo Markov Chain (MCMC) algorithm, based on the Gibbs weighted Chinese restaurant (gWCR) process algorithm, for a class of kernel mixture of time series models over the Dirichlet process. This class of models is an extension of Lo’s (Ann. Stat. 12:351–357, 1984) kernel mixture model for independent observations. The kernel represents a known distribution of time series conditional on past time series and both present and past latent variables. The latent variables are independent samples from a Dirichlet process, which is a random discrete (almost surely) distribution. This class of models includes an infinite mixture of autoregressive processes and an infinite mixture of generalized autoregressive conditional heteroskedasticity (GARCH) processes. 相似文献

20.

Modeling proportions and marginal counts simultaneously for clustered multinomial data with random cluster sizes

Guohua Yan Renjun Ma 《Journal of applied statistics》2016,43(6):1074-1087

Clustered multinomial data with random cluster sizes commonly appear in health, environmental and ecological studies. Traditional approaches for analyzing clustered multinomial data contemplate two assumptions. One of these assumptions is that cluster sizes are fixed, whereas the other demands cluster sizes to be positive. Randomness of the cluster sizes may be the determinant of the within-cluster correlation and between-cluster variation. We propose a baseline-category mixed model for clustered multinomial data with random cluster sizes based on Poisson mixed models. Our orthodox best linear unbiased predictor approach to this model depends only on the moment structure of unobserved distribution-free random effects. Our approach also consolidates the marginal and conditional modeling interpretations. Unlike the traditional methods, our approach can accommodate both random and zero cluster sizes. Two real-life multinomial data examples, crime data and food contamination data, are used to manifest our proposed methodology. 相似文献