期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Nonparametric Bayes methods for directional data

Lawrence J. Brunner Albert Y. Lo 《Revue canadienne de statistique》1994,22(3):401-412

A model for directional data in q dimensions is studied. The data are assumed to arise from a distribution with a density on a sphere of q — 1 dimensions. The density is unimodal and rotationally symmetric, but otherwise of unknown form. The posterior distribution of the unknown mode (mean direction) is derived, and small-sample posterior inference is discussed. The posterior mean of the density is also given. A numerical method for evaluating posterior quantities based on sampling a Markov chain is introduced. This method is generally applicable to problems involving unknown monotone functions. 相似文献

2.

Distribution of the Λ-Criterion for Sphericity Test and Its Connection to a Generalized Dirichlet Model

Seemon Thomas Alex Thannippara 《统计学通讯:模拟与计算》2013,42(7):1385-1395

It is shown that the exact null distribution of the likelihood ratio criterion for sphericity test in the p-variate normal case and the marginal distribution of the first component of a (p ? 1)-variate generalized Dirichlet model with a given set of parameters are identical. The exact distribution of the likelihood ratio criterion so obtained has a general format for every p. A novel idea is introduced here through which the complicated exact null distribution of the sphericity test criterion in multivariate statistical analysis is converted into an easily tractable marginal density in a generalized Dirichlet model. It provides a direct and easiest method of computation of p-values. The computation of p-values and a table of critical points corresponding to p = 3 and 4 are also presented. 相似文献

3.

Detection and Localization in Test Accuracy: A Bayesian Perspective

Lyle D. Broemeling 《统计学通讯:理论与方法》2013,42(8):1555-1564

In assessing the area under the ROC curve for the accuracy of a diagnostic test, it is imperative to detect and locate multiple abnormalities per image. This approach takes that into account by adopting a statistical model that allows for correlation between the reader scores of several regions of interest (ROI).

The ROI method of partitioning the image is taken. The readers give a score to each ROI in the image and the statistical model takes into account the correlation between the scores of the ROI's of an image in estimating test accuracy. The test accuracy is given by Pr[Y > Z] + (1/2)Pr[Y = Z], where Y is an ordinal diagnostic measurement of an affected ROI, and Z is the diagnostic measurement of an unaffected ROI. This way of measuring test accuracy is equivalent to the area under the ROC curve. The parameters are the parameters of a multinomial distribution, then based on the multinomial distribution, a Bayesian method of inference is adopted for estimating the test accuracy.

Using a multinomial model for the test results, a Bayesian method based on the predictive distribution of future diagnostic scores is employed to find the test accuracy. By resampling from the posterior distribution of the model parameters, samples from the posterior distribution of test accuracy are also generated. Using these samples, the posterior mean, standard deviation, and credible intervals are calculated in order to estimate the area under the ROC curve. This approach is illustrated by estimating the area under the ROC curve for a study of the diagnostic accuracy of magnetic resonance angiography for diagnosis of arterial atherosclerotic stenosis. A generalization to multiple readers and/or modalities is proposed.

A Bayesian way to estimate test accuracy is easy to perform with standard software packages and has the advantage of employing the efficient inclusion of information from prior related imaging studies. 相似文献

4.

Which Factors are Risk Factors in Asset Pricing? A Model Scan Framework

Siddhartha Chib Xiaming Zeng 《商业与经济统计学杂志》2020,38(4):771-783

Abstract

A key question for understanding the cross-section of expected returns of equities is the following: which factors, from a given collection of factors, are risk factors, equivalently, which factors are in the stochastic discount factor (SDF)? Though the SDF is unobserved, assumptions about which factors (from the available set of factors) are in the SDF restricts the joint distribution of factors in specific ways, as a consequence of the economic theory of asset pricing. A different starting collection of factors that go into the SDF leads to a different set of restrictions on the joint distribution of factors. The conditional distribution of equity returns has the same restricted form, regardless of what is assumed about the factors in the SDF, as long as the factors are traded, and hence the distribution of asset returns is irrelevant for isolating the risk-factors. The restricted factors models are distinct (nonnested) and do not arise by omitting or including a variable from a full model, thus precluding analysis by standard statistical variable selection methods, such as those based on the lasso and its variants. Instead, we develop what we call a Bayesian model scan strategy in which each factor is allowed to enter or not enter the SDF and the resulting restricted models (of which there are 114,674 in our empirical study) are simultaneously confronted with the data. We use a Student-t distribution for the factors, and model-specific independent Student-t distribution for the location parameters, a training sample to fix prior locations, and a creative way to arrive at the joint distribution of several other model-specific parameters from a single prior distribution. This allows our method to be essentially a scaleable and tuned-black-box method that can be applied across our large model space with little to no user-intervention. The model marginal likelihoods, and implied posterior model probabilities, are compared with the prior probability of 1/114,674 of each model to find the best-supported model, and thus the factors most likely to be in the SDF. We provide detailed simulation evidence about the high finite-sample accuracy of the method. Our empirical study with 13 leading factors reveals that the highest marginal likelihood model is a Student-t distributed factor model with 5 degrees of freedom and 8 risk factors. 相似文献

5.

Testing homogeneity in a heteroscedastic contaminated normal mixture

Xiaoqing Niu Pengfei Li 《Journal of applied statistics》2019,46(8):1478-1491

Large-scale simultaneous hypothesis testing appears in many areas. A well-known inference method is to control the false discovery rate. One popular approach is to model the z-scores derived from the individual t-tests and then use this model to control the false discovery rate. We propose a heteroscedastic contaminated normal mixture to describe the distribution of z-scores and design an EM-test for testing homogeneity in this class of mixture models. The proposed EM-test can be used to investigate whether a collection of z-scores has arisen from a single normal distribution or whether a heteroscedastic contaminated normal mixture is more appropriate. We show that the EM-test statistic has a shifted mixture of chi-squared limiting distribution. Simulation results show that the proposed testing procedure has accurate type-I error and significantly larger power than its competitors under a variety of model specifications. A real-data example is analysed to exemplify the application of the proposed method. 相似文献

6.

Discrete regularized discriminant analysis

Gilles Celeux Abdallah Mkhadri 《Statistics and Computing》1992,2(3):143-151

A method of regularized discriminant analysis for discrete data, denoted DRDA, is proposed. This method is related to the regularized discriminant analysis conceived by Friedman (1989) in a Gaussian framework for continuous data. Here, we are concerned with discrete data and consider the classification problem using the multionomial distribution. DRDA has been conceived in the small-sample, high-dimensional setting. This method has a median position between multinomial discrimination, the first-order independence model and kernel discrimination. DRDA is characterized by two parameters, the values of which are calculated by minimizing a sample-based estimate of future misclassification risk by cross-validation. The first parameter is acomplexity parameter which provides class-conditional probabilities as a convex combination of those derived from the full multinomial model and the first-order independence model. The second parameter is asmoothing parameter associated with the discrete kernel of Aitchison and Aitken (1976). The optimal complexity parameter is calculated first, then, holding this parameter fixed, the optimal smoothing parameter is determined. A modified approach, in which the smoothing parameter is chosen first, is discussed. The efficiency of the method is examined with other classical methods through application to data. 相似文献

7.

An extended fatigue life distribution

Gauss M. Cordeiro Edwin M.M. Ortega 《Statistics》2013,47(3):626-653

A five-parameter extended fatigue life model called the McDonald–Birnbaum–Saunders (McBS) distribution is proposed. It extends the Birnbaum–Saunders and beta Birnbaum–Saunders [G.M. Cordeiro and A.J. Lemonte, The β-Birnbaum–Saunders distribution: An improved distribution for fatigue life modeling. Comput. Statist. Data Anal. 55 (2011), pp. 1445–1461] distributions and also the new Kumaraswamy–Birnbaum–Saunders distribution. We obtain the ordinary moments, generating function, mean deviations and quantile function. The method of maximum likelihood is used to estimate the model parameters and its potentiality is illustrated with an application to a real fatigue data set. Further, we propose a new extended regression model based on the logarithm of the McBS distribution. This model can be very useful to the analysis of real data and could give more realistic fits than other special regression models. 相似文献

8.

A new four-parameter discrete distribution with bathtub and unimodal failure rate

Vahid Nekoukhou Hamid Bidram 《Journal of applied statistics》2015,42(12):2654-2670

In this paper, a discrete counterpart of the general class of continuous beta-G distributions is introduced. A discrete analog of the beta generalized exponential distribution of Barreto-Souza et al. [2], as an important special case of the proposed class, is studied. This new distribution contains some previously known discrete distributions as well as two new models. The hazard rate function of the new model can be increasing, decreasing, bathtub-shaped and upside-down bathtub. Some distributional and moment properties of the new distribution as well as its order statistics are discussed. Estimation of the parameters is illustrated using the maximum likelihood method and, finally, the model with a real data set is examined. 相似文献

9.

The Marshall-Olkin Fréchet Distribution

E. Krishna T. Alice Miroslav M. Ristić 《统计学通讯:理论与方法》2013,42(22):4091-4107

We introduce a new distribution, namely Marshall–Olkin Fréchet distribution. The probability density and hazard rate functions are derived and their shape properties are considered. Expressions for the nth moments are given. Various results with respect to quantiles, Rényi entropy and order statistics are obtained. The unknown parameters of the new distribution are estimated using the maximum likelihood estimation method adopting three different iterative procedures. The model is applied on a real data set on survival times.

[Supplementary materials are available for this article. Go to the publisher's online edition of Communications in Statistics—Theory and Methods for the following free supplemental resource: A file that will allow the random variables from MOF distribution to be generated.] 相似文献

10.

Mean residual weighted versus the length-biased Rayleigh distribution

《Journal of Statistical Computation and Simulation》2012,82(14):2823-2838

In some statistical applications, data may not be considered as a random sample of the whole population and some subjects have less probability of belonging to the sample. Consequently, statistical inferences for such data sets, usually yields biased estimation. In such situations, the length-biased version of the original random variable as a special weighted distribution often produces better inferences. An alternative weighted distribution based on the mean residual life is suggested to treat the biasedness. The Rayleigh distribution is applied in many real applications, hence the proposed method of weighting is performed to produce a new lifetime distribution based on the Rayleigh model. In addition, statistical properties of the proposed distribution is investigated. A simulation study and a real data set are prepared to illustrate that the mean residual weighted Rayleigh distribution gives a better fit than the original and also the length-biased Rayleigh distribution. 相似文献

11.

Bayesian analysis of mixture modelling using the multivariate t distribution

Lin Tsung I. Lee Jack C. Ni Huey F. 《Statistics and Computing》2004,14(2):119-130

A finite mixture model using the multivariate t distribution has been shown as a robust extension of normal mixtures. In this paper, we present a Bayesian approach for inference about parameters of t-mixture models. The specifications of prior distributions are weakly informative to avoid causing nonintegrable posterior distributions. We present two efficient EM-type algorithms for computing the joint posterior mode with the observed data and an incomplete future vector as the sample. Markov chain Monte Carlo sampling schemes are also developed to obtain the target posterior distribution of parameters. The advantages of Bayesian approach over the maximum likelihood method are demonstrated via a set of real data. 相似文献

12.

A latent variable model for analyzing mixed longitudinal (k,l)-inflated count and ordinal responses

F. Razie M. Ganjali 《Journal of applied statistics》2016,43(12):2203-2224

A random effects model for analyzing mixed longitudinal count and ordinal data is presented where the count response is inflated in two points (k and l) and an (k,l)-Inflated Power series distribution is used as its distribution. A full likelihood-based approach is used to obtain maximum likelihood estimates of parameters of the model. For data with non-ignorable missing values models with probit model for missing mechanism are used.The dependence between longitudinal sequences of responses and inflation parameters are investigated using a random effects approach. Also, to investigate the correlation between mixed ordinal and count responses of each individuals at each time, a shared random effect is used. In order to assess the performance of the model, a simulation study is performed for a case that the count response has (k,l)-Inflated Binomial distribution. Performance comparisons of count-ordinal random effect model, Zero-Inflated ordinal random effects model and (k,l)-Inflated ordinal random effects model are also given. The model is applied to a real social data set from the first two waves of the national longitudinal study of adolescent to adult health (Add Health study). In this data set, the joint responses are the number of days in a month that each individual smoked as the count response and the general health condition of each individual as the ordinal response. For the count response there is incidence of excess values of 0 and 30. 相似文献

13.

A new generalized weighted Weibull distribution with decreasing,increasing, upside-down bathtub,N-shape and M-shape hazard rate

Filippo Domma Francesca Condino 《Journal of applied statistics》2017,44(16):2978-2993

Recently, Domma et al. [An extension of Azzalinis method, J. Comput. Appl. Math. 278 (2015), pp. 37–47] proposed an extension of Azzalini's method. This method can attract readers due to its flexibility and ease of applicability. Most of the weighted Weibull models that have been introduced are with monotonic hazard rate function. This fact limits their applicability. So, our aim is to build a new weighted Weibull distribution with monotonic and non-monotonic hazard rate function. A new weighted Weibull distribution, so-called generalized weighted Weibull (GWW) distribution, is introduced by a method exposed in Domma et al. [13]. GWW distribution possesses decreasing, increasing, upside-down bathtub, N-shape and M-shape hazard rate. Also, it is very easy to derive statistical properties of the GWW distribution. Finally, we consider application of the GWW model on a real data set, providing simulation study too. 相似文献

14.

The star-shaped Λ-coalescent and Fleming–Viot process

Robert Griffiths Shuhei Mano 《随机性模型》2016,32(4):606-631

The star-shaped Λ-coalescent and corresponding Λ-Fleming–Viot process, where the Λ measure has a single atom at unity, are studied in this article. The transition functions and stationary distribution of the Λ-Fleming–Viot process are derived in a two-type model with mutation. The distribution of the number of non-mutant lines back in time in the star-shaped Λ-coalescent is found. Extensions are made to a model with d types, either with parent-independent mutation or general Markov mutation, and an infinitely-many-types model, when d → ∞. An eigenfunction expansion for the transition functions is found, which has polynomial right eigenfunctions and left eigenfunctions described by hyperfunctions. A further star-shaped model with general frequency-dependent change is considered and the stationary distribution in the Fleming–Viot process derived. This model includes a star-shaped Λ-Fleming–Viot process with mutation and selection. In a general Λ-coalescent explicit formulae for the transition functions and stationary distribution, when there is mutation, are unknown. However, in this article, explicit formulae are derived in the star-shaped coalescent. 相似文献

15.

The generalized Gudermannian distribution: inference and volatility modelling

Emrah Altun 《Statistics》2019,53(2):364-386

In this paper, we introduce a new distribution, called generalized Gudermannian (GG) distribution, and its skew extension for GARCH models in modelling daily Value-at-Risk (VaR). Basic structural properties of the proposed distribution are obtained including probability density and cumulative distribution functions, moments, and stochastic representation. The maximum likelihood method is used to estimate unknown parameters of the proposed model and finite sample performance of maximum likelihood estimates are evaluated by means of Monte-Carlo simulation study. The real data application on Nikkei 225 index is given to demonstrate the performance of GARCH model specified under skew extension of GG innovation distribution against normal, Student's-t, skew normal and generalized error and skew generalized error distributions in terms of the accuracy of VaR forecasts. The empirical results show that the GARCH model with GG innovation distribution produces the most accurate VaR forecasts for all confidence levels. 相似文献

16.

A MCMC algorithm to fit a general exchangeable model

Jim Albert 《统计学通讯:模拟与计算》2013,42(3):573-592

Consider the exchangeable Bayesian hierarchical model where observations y_i are independently distributed from sampling densities with unknown means, the means µi, are a random sample from a distribution g, and the parameters of g are assigned a known distribution h. A simple algorithm is presented for summarizing the posterior distribution based on Gibbs sampling and the Metropolis algorithm. The software program Matlab is used to implement the algorithm and provide a graphical output analysis. An binomial example is used to illustrate the flexibility of modeling possible using this algorithm. Methods of model checking and extensions to hierarchical regression modeling are discussed. 相似文献

17.

On a nonlinear Birnbaum–Saunders model based on a bivariate construction and its characteristics

Mohsen Khosravi Ahad Jamalizadeh Emilio Porcu 《统计学通讯:理论与方法》2013,42(3):772-793

Abstract

The Birnbaum-Saunders (BS) distribution is an asymmetric probability model that is receiving considerable attention. In this article, we propose a methodology based on a new class of BS models generated from the Student-t distribution. We obtain a recurrence relationship for a BS distribution based on a nonlinear skew–t distribution. Model parameters estimators are obtained by means of the maximum likelihood method, which are evaluated by Monte Carlo simulations. We illustrate the obtained results by analyzing two real data sets. These data analyses allow the adequacy of the proposed model to be shown and discussed by applying model selection tools. 相似文献

18.

On the validation of fiducial techniques

Andr Plante 《Revue canadienne de statistique》1979,7(2):217-226

A structured model is essentially a family of random vectors X_θ defined on a probability space with values in a sample space. If, for a given sample value x and for each ω in the probability space, there is at most one parameter value θ for which X_θ(ω) is equal to x, then the model is called additive at x. When a certain conditional distribution exists, a frequency interpretation specific to additive structured models holds, and is summarized in a unique structured distribution for the parameter. Many of the techniques used by Fisher in deriving and handling his fiducial probability distribution are shown to be valid when dealing with a structured distribution. 相似文献

19.

Composite Estimating Equation Method for the Accelerated Failure Time Model with Length‐biased Sampling Data

下载免费PDF全文

Zhiping Qiu Jing Qin Yong Zhou 《Scandinavian Journal of Statistics》2016,43(2):396-415

Length‐biased sampling data are often encountered in the studies of economics, industrial reliability, epidemiology, genetics and cancer screening. The complication of this type of data is due to the fact that the observed lifetimes suffer from left truncation and right censoring, where the left truncation variable has a uniform distribution. In the Cox proportional hazards model, Huang & Qin (Journal of the American Statistical Association, 107, 2012, p. 107) proposed a composite partial likelihood method which not only has the simplicity of the popular partial likelihood estimator, but also can be easily performed by the standard statistical software. The accelerated failure time model has become a useful alternative to the Cox proportional hazards model. In this paper, by using the composite partial likelihood technique, we study this model with length‐biased sampling data. The proposed method has a very simple form and is robust when the assumption that the censoring time is independent of the covariate is violated. To ease the difficulty of calculations when solving the non‐smooth estimating equation, we use a kernel smoothed estimation method (Heller; Journal of the American Statistical Association, 102, 2007, p. 552). Large sample results and a re‐sampling method for the variance estimation are discussed. Some simulation studies are conducted to compare the performance of the proposed method with other existing methods. A real data set is used for illustration. 相似文献

20.

Gaussian Scale Mixture Models for Robust Linear Multivariate Regression with Missing Data

Juha Ala-Luhtala Robert Piché 《统计学通讯:模拟与计算》2016,45(3):791-813

We present an algorithm for multivariate robust Bayesian linear regression with missing data. The iterative algorithm computes an approximative posterior for the model parameters based on the variational Bayes (VB) method. Compared to the EM algorithm, the VB method has the advantage that the variance for the model parameters is also computed directly by the algorithm. We consider three families of Gaussian scale mixture models for the measurements, which include as special cases the multivariate t distribution, the multivariate Laplace distribution, and the contaminated normal model. The observations can contain missing values, assuming that the missing data mechanism can be ignored. A Matlab/Octave implementation of the algorithm is presented and applied to solve three reference examples from the literature. 相似文献