Global concavity of the likelihood function is proved by means of an inequality involving the trigamma function. The computation of maximum likelihood estimates is discussed.  相似文献   

In estimating the proportion ‘cured’ after adjuvant treatment, a population of cancer patients can be assumed to be a mixture of two Gompertz subpopulations, those who will die of other causes with no evidence of disease relapse and those who will die of their primary cancer. Estimates of the parameters of the component dying of other causes can be obtained from census data, whereas maximum likelihood estimates for the proportion cured and for the parameters of the component of patients dying of cancer can be obtained from follow-up data.

This paper examines, through simulation of follow-up data, the feasibility of maximum likelihood estimation of a mixture of two Gompertz distributions when censoring occurs. Means, variances and mean square error of the maximum likelihood estimates and the estimated asymptotic variance-covariance matrix is obtained from the simulated samples. The relationship of these variances with sample size, proportion censored, mixing proportion and population parameters are considered.

Moderate sample size typical of cooperative trials yield clinically acceptable estimates. Both increasing sample size and decreasing proportion of censored data decreases variance and covariance of the unknown parameters. Useful results can be obtained with data which are as much as 50% censored. Moreover, if the sample size is sufficiently large, survival data which are as much as 70% censored can yield satisfactory results.  相似文献   

In this paper, we consider a mixture of two uniform distributions and derive L-moment estimators of its parameters. Three possible ways of mixing two uniforms, namely with neither overlap nor gap, with overlap, and with gap, are studied. The performance of these L-moment estimators in terms of bias and efficiency is compared to that obtained by means of the conventional method of moments (MM), modified maximum likelihood (MML) method and the usual maximum likelihood (ML) method. These intensive simulations reveal that MML estimators are the best in most of the cases, and the L-moment estimators are less subject to bias in estimation for some mixtures and more efficient in most of the cases than the conventional MM estimators. The L-moment estimators are, in some cases, more efficient than the ML and MML estimators.  相似文献   

The skew-normal and the skew-t distributions are parametric families which are currently under intense investigation since they provide a more flexible formulation compared to the classical normal and t distributions by introducing a parameter which regulates their skewness. While these families enjoy attractive formal properties from the probability viewpoint, a practical problem with their usage in applications is the possibility that the maximum likelihood estimate of the parameter which regulates skewness diverges. This situation has vanishing probability for increasing sample size, but for finite samples it occurs with non-negligible probability, and its occurrence has unpleasant effects on the inferential process. Methods for overcoming this problem have been put forward both in the classical and in the Bayesian formulation, but their applicability is restricted to simple situations. We formulate a proposal based on the idea of penalized likelihood, which has connections with some of the existing methods, but it applies more generally, including the multivariate case.  相似文献   

This paper deals with the problem of maximum likelihood estimation for a mixture of skew Student-t-normal distributions, which is a novel model-based tool for clustering heterogeneous (multiple groups) data in the presence of skewed and heavy-tailed outcomes. We present two analytically simple EM-type algorithms for iteratively computing the maximum likelihood estimates. The observed information matrix is derived for obtaining the asymptotic standard errors of parameter estimates. A small simulation study is conducted to demonstrate the superiority of the skew Student-t-normal distribution compared to the skew t distribution. The proposed methodology is particularly useful for analyzing multimodal asymmetric data as produced by major biotechnological platforms like flow cytometry. We provide such an application with the help of an illustrative example.  相似文献   

Maximum likelihood estimation under constraints for estimation in the Wishart class of distributions, is considered. It provides a unified approach to estimation in a variety of problems concerning covariance matrices. Virtually all covariance structures can be translated to constraints on the covariances. This includes covariance matrices with given structure such as linearly patterned covariance matrices, covariance matrices with zeros, independent covariance matrices and structurally dependent covariance matrices. The methodology followed in this paper provides a useful and simple approach to directly obtain the exact maximum likelihood estimates. These maximum likelihood estimates are obtained via an estimation procedure for the exponential class using constraints.  相似文献   

When functional data are not homogenous, for example, when there are multiple classes of functional curves in the dataset, traditional estimation methods may fail. In this article, we propose a new estimation procedure for the mixture of Gaussian processes, to incorporate both functional and inhomogenous properties of the data. Our method can be viewed as a natural extension of high-dimensional normal mixtures. However, the key difference is that smoothed structures are imposed for both the mean and covariance functions. The model is shown to be identifiable, and can be estimated efficiently by a combination of the ideas from expectation-maximization (EM) algorithm, kernel regression, and functional principal component analysis. Our methodology is empirically justified by Monte Carlo simulations and illustrated by an analysis of a supermarket dataset.  相似文献   

This article investigates maximum a-posteriori (MAP) estimation of autoregressive model parameters when the innovations (errors) follow a finite mixture of distributions that, in turn, are scale-mixtures of skew-normal distributions (SMSN), an attractive and extremely flexible family of probabilistic distributions. The proposed model allows to fit different types of data which can be associated with different noise levels, and provides a robust modelling with great flexibility to accommodate skewness, heavy tails, multimodality and stationarity simultaneously. Also, the existence of convenient hierarchical representations of the SMSN random variables allows us to develop an EM-type algorithm to perform the MAP estimates. A comprehensive simulation study is then conducted to illustrate the superior performance of the proposed method. The new methodology is also applied to annual barley yields data.  相似文献   

This article considers the maximum likelihood estimation (MLE) of a class of stationary and invertible vector autoregressive fractionally integrated moving-average (VARFIMA) processes considered in Equation (26) of Luceño [A fast likelihood approximation for vector general linear processes with long series: Application to fractional differencing, Biometrika 83 (1996), pp. 603–614] or Model A of Lobato [Consistency of the averaged cross-periodogram in long memory series, J. Time Ser. Anal. 18 (1997), pp. 137–155] where each component y i, t is a fractionally integrated process of order d i , i=1, …, r. Under the conditions outlined in Assumption 1 of this article, the conditional likelihood function of this class of VARFIMA models can be efficiently and exactly calculated with a conditional likelihood Durbin–Levinson (CLDL) algorithm proposed herein. This CLDL algorithm is based on the multivariate Durbin–Levinson algorithm of Whittle [On the fitting of multivariate autoregressions and the approximate canonical factorization of a spectral density matrix, Biometrika 50 (1963), pp. 129–134] and the conditional likelihood principle of Box and Jenkins [Time Series Analysis, Forecasting, and Control, 2nd ed., Holden-Day, San Francisco, CA]. Furthermore, the conditions in the aforementioned Assumption 1 are general enough to include the model considered in Andersen et al. [Modeling and forecasting realized volatility, Econometrica 71 (2003), 579–625] for describing the behaviour of realized volatility and the model studied in Haslett and Raftery [Space–time modelling with long-memory dependence: Assessing Ireland's wind power resource, Appl. Statist. 38 (1989), pp. 1–50] for spatial data as its special cases. As the computational cost of implementing the CLDL algorithm is much lower than that of using the algorithms proposed in Sowell [Maximum likelihood estimation of fractionally integrated time series models, Working paper, Carnegie-Mellon University], we are thus able to conduct a Monte Carlo experiment to investigate the finite sample performance of the CLDL algorithm for the 3-dimensional VARFIMA processes with the sample size of 400. The simulation results are very satisfactory and reveal the great potentials of using the CLDL method for empirical applications.  相似文献   

This paper deals with the maximum likelihood estimation of parameters when the sample (x1…xn ) may heve k spuriously generated observations from another distribution, say G≠F, where F is the distribution of the target population. If G is stochastically larger than F, then these k observations may give rise to k extreme observations or ‘outliers’. This situation is often described by a so-called ‘k-outlier model’ in which in addition to the parameters involved in F and G, the set ν={ν1,…,νk} of indices, for which xνj , j=1,…,k, come from G, is also unknow.  相似文献   

The paper shows that many estimation methods, including ML, moments, even-points, empirical c.f. and minimum chi-square, can be regarded as scoring procedures using weighted sums of the discrepancies between observed and expected frequencies The nature of the weights is investigated for many classes of distributions; the study of approximations to the weights clarifies the relationships between estimation methods, and also leads to useful formulae for initial values for ML iteration.  相似文献   

The estimation of the parameter of a mixed model analysis of variance by maximum likelihood methods is discussed. The functional iteration method is studied and found to have good comptuational properties. The estimates are studied via Monte Carlo techniques and their small sample properties are observed; it is found that the MLE's may be biased but that they have good Mean Square Error properties.  相似文献   

Unobservable individual effects in models of duration will cause estimation bias that include the structural parameters as well as the duration dependence. The maximum penalized likelihood estimator is examined as an estimator for the survivor model with heterogeneity. Proofs of the existence and uniqueness of the maximum penalized likelihood estimator in duration model with general forms of unobserved heterogeneity are provided. Some small sample evidence on the behavior of the maximum penalized likelihood estimator is given. The maximum penalized likelihood estimator is shown to be computationally feasible and to provide reasonable estimates in most cases.  相似文献   

In this paper we study the interaction between the estimation of the fractional differencing parameter d of ARFIMA models and the common practice of instantaneous transformation of the observed time series. At this aim, we first discuss the effect of a nonlinear transformation of the data on the identification of the process and on the estimate of d. Thus, we propose a joint estimation of the Box-Cox parameter and d by means of a modified normalized version of the Whittle likelihood. Then, the variance and covariance matrix of the parameters estimates is obtained. Finally, a Monte Carlo study is performed in order to check the behaviour of the proposed estimators in finite samples.The paper is the result of a joint research of the two authors. As far as it concerns this version of the work, A. DElia wrote Sects. 2, 3, 4, while D. Piccolo wrote Sects. 1, 5, 6.  相似文献   

Mixtures of factor analyzers (MFAs) have been popularly used to cluster the high-dimensional data. However, the traditional estimation method is based on the normality assumptions of random terms and thus is sensitive to outliers. In this article, we introduce a robust estimation procedure of MFAs using the trimmed likelihood estimator. We use a simulation study and a real data application to demonstrate the robustness of the trimmed estimation procedure and compare it with the traditional normality-based maximum likelihood estimate.  相似文献   

Nonparametric maximum likelihood estimation of bivariate survival probabilities is developed for interval censored survival data. We restrict our attention to the situation where response times within pairs are not distinguishable, and the univariate survival distribution is the same for any individual within any pair. Campbell's (1981) model is modified to incorporate this restriction. Existence and uniqueness of maximum likelihood estimators are discussed. This methodology is illustrated with a bivariate life table analysis of an angioplasty study where each patient undergoes two procedures.  相似文献   

the estimation of variance components of heteroscedastic random model is discussed in this paper. Maximum Likelihood (ML) is described for one-way heteroscedastic random models. The proportionality condition that cell variance is proportional to the cell sample size, is used to eliminate the efffect of heteroscedasticity. The algebraic expressions of the estimators are obtained for the model. It is seen that the algebraic expressions of the estimators depend mainly on the inverse of the variance-covariance matrix of the observation vector. So, the variance-covariance matrix is obtained and the formulae for the inversions are given. A Monte Carlo study is conducted. Five different variance patterns with different numbers of cells are considered in this study. For each variance pattern, 1000 Monte Carlo samples are drawn. Then the Monte Carlo biases and Monte Carlo MSE’s of the estimators of variance components are calculated. In respect of both bias and MSE, the Maximum Likelihood (ML) estimators of variance components are found to be sufficiently good.  相似文献   

Randomized response techniques are designed to obtain usable data on sensitive issues while protecting the privacy of individuals. In this paper, based on repeating the randomized response technique, a new technique called repeated randomized response is introduced to increase the protection of privacy and efficiency of estimator for proportion of sensitive attribute. By using this technique, the proportion of academic cheating is estimated among students of Shahid Chamran University of Ahvaz, Ahvaz, Iran.  相似文献   

The maximum likelihood estimator (MLE) for the survival function STunder the proportional hazards model of censorship is derived and shown to differ from the Abdushukurov-Cheng-Lin estimator when the class of allowable distributions includes all continuous and discrete distributions. The estimators are compared via an example. The MLE is calculated using a Newton-Raphson iterative procedure and implemented via a FORTRAN algorithm.  相似文献   

Maximum likelihood estimation of a mean and a covariance matrix whose structure is constrained only to general positive semi-definiteness is treated in this paper. Necessary and sufficient conditions for the local optimality of mean and covariance matrix estimates are given. Observations are assumed to be independent. When the observations are also assumed to be identically distributed, the optimality conditions are used to obtain the mean and covariance matrix solutions in closed form. For the nonidentically distributed observation case, a general numerical technique which integrates scoring and Newton's iterations to solve the optimality condition equations is presented, and convergence performance is examined.  相似文献   

