期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A generalization of the Solis-Wets method

Miguel de Carvalho 《Journal of statistical planning and inference》2012,142(3):633-644

In this paper we focus on the application of global stochastic optimization methods to extremum estimators. We propose a general stochastic method—the master method—which includes several stochastic optimization algorithms as a particular case. The proposed method is sufficiently general to include the Solis-Wets method, the improving hit-and-run algorithm, and a stochastic version of the zigzag algorithm. A matrix formulation of the master method is presented and some specific results are given for the stochastic zigzag algorithm. Convergence of the proposed method is established under a mild set of conditions, and a simple regression model is used to illustrate the method. 相似文献

2.

An improved SAEM algorithm for maximum likelihood estimation in mixtures of non linear mixed effects models

Marc Lavielle Cyprien Mbogning 《Statistics and Computing》2014,24(5):693-707

We propose a new methodology for maximum likelihood estimation in mixtures of non linear mixed effects models (NLMEM). Such mixtures of models include mixtures of distributions, mixtures of structural models and mixtures of residual error models. Since the individual parameters inside the NLMEM are not observed, we propose to combine the EM algorithm usually used for mixtures models when the mixture structure concerns an observed variable, with the Stochastic Approximation EM (SAEM) algorithm, which is known to be suitable for maximum likelihood estimation in NLMEM and also has nice theoretical properties. The main advantage of this hybrid procedure is to avoid a simulation step of unknown group labels required by a “full” version of SAEM. The resulting MSAEM (Mixture SAEM) algorithm is now implemented in the Monolix software. Several criteria for classification of subjects and estimation of individual parameters are also proposed. Numerical experiments on simulated data show that MSAEM performs well in a general framework of mixtures of NLMEM. Indeed, MSAEM provides an estimator close to the maximum likelihood estimator in very few iterations and is robust with regard to initialization. An application to pharmacokinetic (PK) data demonstrates the potential of the method for practical applications. 相似文献

3.

A- and D-optimal designs for a log contrast model for experiments with mixtures

Ling-Yau Chan Ying-Nan Guan 《Journal of applied statistics》2001,28(5):537-546

A- and D-optimal designs are investigated for a log contrast model suggested by Aitchison & Bacon-Shone for experiments with mixtures. It is proved that when the number of mixture components q is an even integer, A- and D-optimal designs are identical; and when q is an odd integer, A- and D-optimal designs are different, but they share some common support points and are very close to each other in efficiency. Optimal designs with a minimum number of support points are also constructed for 3, 4, 5 and 6 mixture components. 相似文献

4.

Design and analysis of mixture experiments with process variable

Upendra Kumar Pradhan Krishan Lal Sukanta Dash K. N. Singh 《统计学通讯:理论与方法》2017,46(1):259-270

A mixture experiment is an experiment in which the response is assumed to depend on the relative proportions of the ingredients present in the mixture and not on the total amount of the mixture. In such experiment process, variables do not form any portion of the mixture but the levels changed could affect the blending properties of the ingredients. Sometimes, the mixture experiments are costly and the experiments are to be conducted in less number of runs. Here, a general method for construction of efficient mixture experiments in a minimum number of runs by the method for projection of efficient response surface design onto the constrained region is obtained. The efficient designs with a less number of runs have been constructed for 3rd, 4th, and 5th component of mixture experiments with one process variable. 相似文献

5.

Bayesian temporal density estimation with autoregressive species sampling models

Youngin Jo Seongil Jo Yung-Seop Lee Jaeyong Lee 《Journal of the Korean Statistical Society》2018,47(3):248-262

We propose a novel Bayesian nonparametric (BNP) model, which is built on a class of species sampling models, for estimating density functions of temporal data. In particular, we introduce species sampling mixture models with temporal dependence. To accommodate temporal dependence, we define dependent species sampling models by modeling random support points and weights through an autoregressive model, and then we construct the mixture models based on the collection of these dependent species sampling models. We propose an algorithm to generate posterior samples and present simulation studies to compare the performance of the proposed models with competitors that are based on Dirichlet process mixture models. We apply our method to the estimation of densities for the price of apartment in Seoul, the closing price in Korea Composite Stock Price Index (KOSPI), and climate variables (daily maximum temperature and precipitation) of around the Korean peninsula. 相似文献

6.

The specification of vector autoregressive moving average models

《Journal of Statistical Computation and Simulation》2012,82(8):547-565

In this paper we propose a new identification method based on the residual white noise autoregressive criterion (Pukkila et al., 1990) to select the order of VARMA structures. Results from extensive simulation experiments based on different model structures with varying number of observations and number of component series are used to demonstrate the performance of this new procedure. We also use economic and business data to compare the model structures selected by this order selection method with those identified in other published studies. 相似文献

7.

Comparing and generating Latin Hypercube designs in Kriging models

Giovanni Pistone Grazia Vicario 《AStA Advances in Statistical Analysis》2010,94(4):353-366

In Computer Experiments (CE), a careful selection of the design points is essential for predicting the system response at untried points, based on the values observed at tried points. In physical experiments, the protocol is based on Design of Experiments, a methodology whose basic principles are questioned in CE. When the responses of a CE are modeled as jointly Gaussian random variables with their covariance depending on the distance between points, the use of the so called space-filling designs (random designs, stratified designs and Latin Hypercube designs) is a common choice, because it is expected that the nearer the untried point is to the design points, the better is the prediction. In this paper we focus on the class of Latin Hypercube (LH) designs. The behavior of various LH designs is examined according to the Gaussian assumption with exponential correlation, in order to minimize the total prediction error at the points of a regular lattice. In such a special case, the problem is reduced to an algebraic statistical model, which is solved using both symbolic algebraic software and statistical software. We provide closed-form computation of the variance of the Gaussian linear predictor as a function of the design, in order to make a comparison between LH designs. In principle, the method applies to any number of factors and any number of levels, and also to classes of designs other than LHs. In our current implementation, the applicability is limited by the high computational complexity of the algorithms involved. 相似文献

8.

Unsupervised learning of regression mixture models with unknown number of components

《Journal of Statistical Computation and Simulation》2012,82(12):2308-2334

ABSTRACT

We propose a new unsupervised learning algorithm to fit regression mixture models with unknown number of components. The developed approach consists in a penalized maximum likelihood estimation carried out by a robust expectation–maximization (EM)-like algorithm. We derive it for polynomial, spline, and B-spline regression mixtures. The proposed learning approach is unsupervised: (i) it simultaneously infers the model parameters and the optimal number of the regression mixture components from the data as the learning proceeds, rather than in a two-fold scheme as in standard model-based clustering using afterward model selection criteria, and (ii) it does not require accurate initialization unlike the standard EM for regression mixtures. The developed approach is applied to curve clustering problems. Numerical experiments on simulated and real data show that the proposed algorithm performs well and provides accurate clustering results, and confirm its benefit for practical applications. 相似文献

9.

A factor model approach for the joint segmentation with between‐series correlation

Xavier Collilieux Emilie Lebarbier Stphane Robin 《Scandinavian Journal of Statistics》2019,46(3):686-705

We consider the detection of changes in the mean of a set of time series. The breakpoints are allowed to be series specific, and the series are assumed to be correlated. The correlation between the series is supposed to be constant along time but is allowed to take an arbitrary form. We show that such a dependence structure can be encoded in a factor model. Thanks to this representation, the inference of the breakpoints can be achieved via dynamic programming, which remains one the most efficient algorithms. We propose a model selection procedure to determine both the number of breakpoints and the number of factors. This proposed method is implemented in the FASeg R package, which is available on the CRAN. We demonstrate the performances of our procedure through simulation experiments and present an application to geodesic data. 相似文献

10.

Hybrid Dirichlet mixture models for functional data

Sonia Petrone Michele Guindani Alan E. Gelfand 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(4):755-782

Summary. In functional data analysis, curves or surfaces are observed, up to measurement error, at a finite set of locations, for, say, a sample of n individuals. Often, the curves are homogeneous, except perhaps for individual-specific regions that provide heterogeneous behaviour (e.g. 'damaged' areas of irregular shape on an otherwise smooth surface). Motivated by applications with functional data of this nature, we propose a Bayesian mixture model, with the aim of dimension reduction, by representing the sample of n curves through a smaller set of canonical curves. We propose a novel prior on the space of probability measures for a random curve which extends the popular Dirichlet priors by allowing local clustering: non-homogeneous portions of a curve can be allocated to different clusters and the n individual curves can be represented as recombinations (hybrids) of a few canonical curves. More precisely, the prior proposed envisions a conceptual hidden factor with k -levels that acts locally on each curve. We discuss several models incorporating this prior and illustrate its performance with simulated and real data sets. We examine theoretical properties of the proposed finite hybrid Dirichlet mixtures, specifically, their behaviour as the number of the mixture components goes to ∞ and their connection with Dirichlet process mixtures. 相似文献

11.

Bank Business Models at Zero Interest Rates

André Lucas Julia Schaumburg Bernd Schwaab 《商业与经济统计学杂志》2013,31(3):542-555

We propose a novel observation-driven finite mixture model for the study of banking data. The model accommodates time-varying component means and covariance matrices, normal and Student’s t distributed mixtures, and economic determinants of time-varying parameters. Monte Carlo experiments suggest that units of interest can be classified reliably into distinct components in a variety of settings. In an empirical study of 208 European banks between 2008Q1–2015Q4, we identify six business model components and discuss how their properties evolve over time. Changes in the yield curve predict changes in average business model characteristics. 相似文献

12.

Design considerations for small experiments and simple logistic regression

《Journal of Statistical Computation and Simulation》2012,82(1):81-91

Inference for a generalized linear model is generally performed using asymptotic approximations for the bias and the covariance matrix of the parameter estimators. For small experiments, these approximations can be poor and result in estimators with considerable bias. We investigate the properties of designs for small experiments when the response is described by a simple logistic regression model and parameter estimators are to be obtained by the maximum penalized likelihood method of Firth [Firth, D., 1993, Bias reduction of maximum likelihood estimates. Biometrika, 80, 27–38]. Although this method achieves a reduction in bias, we illustrate that the remaining bias may be substantial for small experiments, and propose minimization of the integrated mean square error, based on Firth's estimates, as a suitable criterion for design selection. This approach is used to find locally optimal designs for two support points. 相似文献

13.

Orthogonally Blocked Mixture Experiments in Ellipsoidal Restricted Regions

Philip Prescott 《统计学通讯:理论与方法》2013,42(5):763-784

In practical situations involving mixtures formed from several ingredients, interest is sometimes centered on the response in an ellipsoidal neighborhood around a standard formulation. We show that standard, orthogonally blocked, response surface designs, defined on a q ? 1 dimensional unit sphere, may be transformed into similarly orthogonally blocked q-ingredient mixture designs defined within an ellipsoid centered at the standard formulation. The method is illustrated using several examples of mixture experiments with three, four, and five ingredients, arranged in two, three, or four orthogonal blocks, obtained by projecting standard central composite designs and Box–Behnken designs into the ellipsoidal mixture region. Rotations of the resulting designs within the ellipsoidal regions are also considered. 相似文献

14.

Discrepancy for uniform design of experiments with mixtures

Jian-Hui Ning Yong-Dao Zhou Kai-Tai Fang 《Journal of statistical planning and inference》2011,141(4):1487-1496

The uniform design is a kind of space filling design that is robust against the model specification. The uniform design has been widely applied to experiments with mixtures. In this paper, we propose a new discrepancy DM₂-discrepancy as a new criterion to measure the uniformity of designs with mixtures. A computational formula of the new discrepancy, by the functional method, is also given. This property overcome the main disadvantage of the discrepancies proposed before. 相似文献

15.

Nonparametric boundary detection

Jianbin Chen Igor Zurbenko 《统计学通讯:理论与方法》2013,42(12):2999-3014

We propose a two-step nonparametric method for detecting the boundary curve of an object in an image. First we treat boundary points as change-points on lines across the image, and identify them by the one-sided kernel smoothing method. After obtaining potential boundary points, we use the principal curve method to smooth these points in order to obtain an estimate of smooth boundary curve, Computer simulations are provided to illustrate the effectiveness of the method. 相似文献

16.

Nearly optimal orthogonally blocked desings for a quadratic mixture model with q components

Philip Prescott 《统计学通讯:理论与方法》2013,42(10):2559-2580

In experiments with mixtures involving process variables, orthogonal block designs may be used to allow estimation of the parameters of the mixture components independently of estimation of the parameters of the process variables. In the class of orthogonally blocked designs based on pairs of suitably chosen Latin squares, the optimal designs consist primarily of binary blends of the mixture components, regardless of how many ingredients are available for the mixture. This paper considers ways of modifying these optimal designs so that some or all of the runs used in the experiment include a minimum proportion of each mixture ingredient. The designs considered are nearly optimal in the sense that the experimental points are chosen to follow ridges of maxima in the optimality criteria. Specific designs are discussed for mixtures involving three and four components and distinctions are identified for different designs with the same optimality properties. The ideas presented for these specific designs are readily extended to mixtures with q>4 components. 相似文献

17.

A Class of Multidimensional Latent Class IRT Models for Ordinal Polytomous Item Responses

Silvia Bacci Francesco Bartolucci Michela Gnaldi 《统计学通讯:理论与方法》2014,43(4):787-800

We propose a class of multidimensional Item Response Theory models for polytomously-scored items with ordinal response categories. This class extends an existing class of multidimensional models for dichotomously-scored items in which the latent abilities are represented by a random vector assumed to have a discrete distribution, with support points corresponding to different latent classes in the population. In the proposed approach, we allow for different parameterizations for the conditional distribution of the response variables given the latent traits, which depend on the type of link function and the constraints imposed on the item parameters. Moreover, we suggest a strategy for model selection that is based on a series of steps consisting of selecting specific features, such as the dimension of the model (number of latent traits), the number of latent classes, and the specific parameterization. In order to illustrate the proposed approach, we analyze a dataset from a study on anxiety and depression on a sample of oncological patients. 相似文献

18.

Mixtures of Gaussian copula factor analyzers for clustering high dimensional data

《Journal of the Korean Statistical Society》2019,48(3):480-492

Mixtures of factor analyzers is a useful model-based clustering method which can avoid the curse of dimensionality in high-dimensional clustering. However, this approach is sensitive to both diverse non-normalities of marginal variables and outliers, which are commonly observed in multivariate experiments. We propose mixtures of Gaussian copula factor analyzers (MGCFA) for clustering high-dimensional clustering. This model has two advantages; (1) it allows different marginal distributions to facilitate fitting flexibility of the mixture model, (2) it can avoid the curse of dimensionality by embedding the factor-analytic structure in the component-correlation matrices of the mixture distribution.An EM algorithm is developed for the fitting of MGCFA. The proposed method is free of the curse of dimensionality and allows any parametric marginal distribution which fits best to the data. It is applied to both synthetic data and a microarray gene expression data for clustering and shows its better performance over several existing methods. 相似文献

19.

Optimal multi-criteria designs for Fourier regression models

《Journal of statistical planning and inference》2001,96(2):387-401

Riccomagno, Schwabe and Wynn (RSW) (1997) have given a necessary and sufficient condition for obtaining a complete Fourier regression model with a design based on lattice points that is D-optimal. However, in practice, the number of factors to be considered may be large, or the experimental data may be restricted or not homogeneous. To address these difficulties we extend the results of RSW to obtain a sufficient condition for an incomplete interaction Fourier model design based on lattice points that is D-, A-, E- and G-optimal. We also propose an algorithm for finding such optimal designs that requires fewer design points than those obtained using RSW's generators when the underlying model is a complete interaction model. 相似文献

20.

Semiparametric Estimation of a Two-component Mixture Model where One Component is known 总被引：1，自引：0，他引：1

LAURENT BORDES CÉLINE DELMAS PIERRE VANDEKERKHOVE 《Scandinavian Journal of Statistics》2006,33(4):733-752

Abstract. We consider a two-component mixture model where one component distribution is known while the mixing proportion and the other component distribution are unknown. These kinds of models were first introduced in biology to study the differences in expression between genes. The various estimation methods proposed till now have all assumed that the unknown distribution belongs to a parametric family. In this paper, we show how this assumption can be relaxed. First, we note that generally the above model is not identifiable, but we show that under moment and symmetry conditions some 'almost everywhere' identifiability results can be obtained. Where such identifiability conditions are fulfilled we propose an estimation method for the unknown parameters which is shown to be strongly consistent under mild conditions. We discuss applications of our method to microarray data analysis and to the training data problem. We compare our method to the parametric approach using simulated data and, finally, we apply our method to real data from microarray experiments. 相似文献