首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The performance of Anderson's classification statistic based on a post-stratified random sample is examined. It is assumed that the training sample is a random sample from a stratified population consisting of two strata with unknown stratum weights. The sample is first segregated into the two strata by post-stratification. The unknown parameters for each of the two populations are then estimated and used in the construction of the plug-in discriminant. Under this procedure, it is shown that additional estimation of the stratum weight will not seriously affect the performance of Anderson's classification statistic. Furthermore, our discriminant enjoys a much higher efficiency than the procedure based on an unclassified sample from a mixture of normals investigated by Ganesalingam and McLachlan (1978).  相似文献   

2.
Linear maps of a single unclassified observation are used to estimate the mixing proportion in a mixture of two populations with homogeneous variances in the presence of covariates. with complete knowledge of the parameters of the individual populations, the linear map for which the estimator is unbiased and has minimum variance amongst all similar estimators can be determined. Plug-in estimator based on independent training samples from the component populations can be constructed and is asymptotically equivalent to Cochran's classification statistic V* for covariate classification; see Memon and Okamoto (1970). Under normality assumptions, asymptotic expansion of the distribution of the plug-in estimator is available. In the absence of covariates, our estimator reduces to that suggested by Walker (1980) who has investigated the problem based on information on large unclassified samples from a mixture of two populations with heterogeneous variances. In contrast, distribution of Walker's estimator seems intractable in moderate sample sizes even with normality assumption.  相似文献   

3.
In this paper we consider the problem of testing the means of k multivariate normal populations with additional data from an unknown subset of the k populations. The purpose of this research is to offer test procedures utilizing all the available data for the multivariate analysis of variance problem because the additional data may contain valuable information about the parameters of the k populations. The standard procedure uses only the data from identified populations. We provide a test using all available data based upon Hotelling' s generalized T2statistic. The power of this test is computed using Betz's approximation of Hotelling' s generalized T2statistic by an F-distribution. A comparison of the power of the test and the standard test procedure is also given.  相似文献   

4.
In many financial applications, Poisson mixture regression models are commonly used to analyze heterogeneous count data. When fitting these models, the observed counts are supposed to come from two or more subpopulations and parameter estimation is typically performed by means of maximum likelihood via the Expectation–Maximization algorithm. In this study, we discuss briefly the procedure for fitting Poisson mixture regression models by means of maximum likelihood, the model selection and goodness-of-fit tests. These models are applied to a real data set for credit-scoring purposes. We aim to reveal the impact of demographic and financial variables in creating different groups of clients and to predict the group to which each client belongs, as well as his expected number of defaulted payments. The model's conclusions are very interesting, revealing that the population consists of three groups, contrasting with the traditional good versus bad categorization approach of the credit-scoring systems.  相似文献   

5.
A procedure for estimating Phillips's function by direct least squares is given and asymptotic standard errors obtained. The method is applied to two sets of data, including Phillips's. The limitations of the method as revealed by the practical examples are considered.  相似文献   

6.
Hahn (1977) suggested a procedure for constructing prediction intervals for the difference between the means of two future samples from normal populations having equal variance, based on past samples selected from both populations. In this paper, we extend Hahn's work by constructing simultaneous prediction intervals for all pairwise differences among the means of k ≥ 2 future samples from normal populations with equal variances, using past samples taken from each of the k populations. For K = 2, this generalization reduces to Hahn's special case. These prediction intervals may be used when one has sampled the performance of several products and wishes to simultaneously as- sess the differences in future sample mean performance of these products with a predetermined overall coverage probability. The use of the new procedure is demonstrated with a numerical example.  相似文献   

7.
Various methods for clustering mixed-mode data are compared. It is found that a method based on a finite mixture model in which the observed categorical variables are generated from underlying continuous variables out-performs more conventional methods when applied to artificially generated data. This method also performs best when applied to Fisher's iris data in which two of the variables are categorized by applying thresholds.  相似文献   

8.
This study proposes a simple way to perform a power analysis of Mantel's test applied to squared Euclidean distance matrices. The general statistical aspects of the simple Mantel's test are reviewed. The Monte Carlo method is used to generate bivariate Gaussian variables in order to create squared Euclidean distance matrices. The power of the parametric correlation t-test applied to raw data is also evaluated and compared with that of Mantel's test. The standard procedure for calculating punctual power levels is used for validation. The proposed procedure allows one to draw the power curve by running the test only once, dispensing with the time demanding standard procedure of Monte Carlo simulations. Unlike the standard procedure, it does not depend on a knowledge of the distribution of the raw data. The simulated power function has all the properties of the power analysis theory and is in agreement with the results of the standard procedure.  相似文献   

9.
The inverse Gaussian family of non negative, skewed random variables is analytically simple, and its inference theory is well known to be analogous to the normal theory in numerous ways. Hence, it is widely used for modeling non negative positively skewed data. In this note, we consider the problem of testing homogeneity of order restricted means of several inverse Gaussian populations with a common unknown scale parameter using an approach based on the classical methods, such as Fisher's, for combining independent tests. Unlike the likelihood approach which can only be readily applied to a limited number of restrictions and the settings of equal sample sizes, this approach is applicable to problems involving a broad variety of order restrictions and arbitrary sample size settings, and most importantly, no new null distributions are needed. An empirical power study shows that, in case of the simple order, the test based on Fisher's combination method compares reasonably with the corresponding likelihood ratio procedure.  相似文献   

10.
A consistent test for difference in locations between two bivariate populations is proposed, The test is similar as the Mann-Whitney test and depends on the exceedances of slopes of the two samples where slope for each sample observation is computed by taking the ratios of the observed values. In terms of the slopes, it reduces to a univariate problem, The power of the test has been compared with those of various existing tests by simulation. The proposed test statistic is compared with Mardia's(1967) test statistics, Peters-Randies(1991) test statistic, Wilcoxon's rank sum test. statistic and Hotelling' T2 test statistic using Monte Carlo technique. It performs better than other statistics compared for small differences in locations between two populations when underlying population is population 7(light tailed population) and sample size 15 and 18 respectively. When underlying population is population 6(heavy tailed population) and sample sizes are 15 and 18 it performas better than other statistic compared except Wilcoxon's rank sum test statistics for small differences in location between two populations. It performs better than Mardia's(1967) test statistic for large differences in location between two population when underlying population is bivariate normal mixture with probability p=0.5, population 6, Pearson type II population and Pearson type VII population for sample size 15 and 18 .Under bivariate normal population it performs as good as Mardia' (1967) test statistic for small differences in locations between two populations and sample sizes 15 and 18. For sample sizes 25 and 28 respectively it performs better than Mardia's (1967) test statistic when underlying population is population 6, Pearson type II population and Pearson type VII population  相似文献   

11.
This paper presents a two‐stage procedure for estimating the conditional support curve of a random variable X, given the information of a random vector X. Quantile estimation is followed by an extremal analysis on the residuals for problems which can be written as regression models. The technique is applied to data from the National Bureau of Economic Research and US Census Bureau's Center for Economic Studies which contain all four‐digit manufacturing industries. Simulation results show that in linear regression models the proposed estimation procedure is more efficient than the extreme linear regression quantile.  相似文献   

12.
This paper presents a selection procedure that combines Bechhofer's indifference zone selection and Gupta's subset selection approaches, by using a preference threshold. For normal populations with common known variance, a subset is selected of all populations that have sample sums within the distance of this threshold from the largest sample sum. We derive the minimal necessary sample size and the value for the preference threshold, in order to satisfy two probability requirements for correct selection, one related to indifference zone selection, the other to subset selection. Simulation studies are used to illustrate the method.  相似文献   

13.
This paper is intended to make a contribution to the ongoing debate about declining social mobility in Great Britain by analyzing mobility tables based on data from the 1991 British Household Panel Survey and the 2005 General Household Survey. The models proposed here generalize Hauser's levels models and allow for a semi-parametric analysis of change in social mobility. The cell frequencies are assumed to be equal to the product of three effects: the effect of the father's position for the given year, the effect of the son's position for the given year, and the mobility effect related to the difference between the father's and the son's positions. A generalization of the iterative proportional fitting procedure is proposed and applied to computing the maximum likelihood estimates of the cell frequencies. The standard errors of the estimated parameters are computed under the product-multinomial sampling assumption. The results indicate opposing trends of mobility between the two timepoints. Fewer steps up or down in the society became less likely, while more steps became somewhat more likely.  相似文献   

14.
The article presents the Bayesian inference for the parameters of randomly censored Burr-type XII distribution with proportional hazards. The joint conjugate prior of the proposed model parameters does not exist; we consider two different systems of priors for Bayesian estimation. The explicit forms of the Bayes estimators are not possible; we use Lindley's method to obtain the Bayes estimates. However, it is not possible to obtain the Bayesian credible intervals with Lindley's method; we suggest the Gibbs sampling procedure for this purpose. Numerical experiments are performed to check the properties of the different estimators. The proposed methodology is applied to a real-life data for illustrative purposes. The Bayes estimators are compared with the Maximum likelihood estimators via numerical experiments and real data analysis. The model is validated using posterior predictive simulation in order to ascertain its appropriateness.  相似文献   

15.
A model-based classification technique is developed, based on mixtures of multivariate t-factor analyzers. Specifically, two related mixture models are developed and their classification efficacy studied. An AECM algorithm is used for parameter estimation, and convergence of these algorithms is determined using Aitken's acceleration. Two different techniques are proposed for model selection: the BIC and the ICL. Our classification technique is applied to data on red wine samples from Italy and to fatty acid measurements on Italian olive oils. These results are discussed and compared to more established classification techniques; under this comparison, our mixture models give excellent classification performance.  相似文献   

16.
Tong ‘1978’ proposed an adaptive approach as an alternative to the classical indifference-zone formulation of the problems of ranking and selection. With a fixed pre-selected y*‘1/k < y* < 1’ his procedure calls for the termination of vector-at-a-time sampling when the estimated probability of a correct selection exceeds Y* for the first time. The purpose of this note is to show that for the case of two normal populations with common known variance, the expected number of vector-observations required by Tong's procedure to terminate sampling approaches infinity as the two population means approach equality for Y* ≥ 0.8413.It is conjectured that this phenomenon also persists if the two largest of K ≥3 population means approach equality. Since in the typical ranking and selection setting it usually is assumed that the experimenter has no knowledge concerning the differences between the population means, the experimenter who uses Tong's procedure clearly does so at his own risk.  相似文献   

17.
Amemiya's generalized least squares method for the estimation of simultaneous equation modeis with qualitative or limited dependent variables is known to be efficient relative to many popular two stage estimators. This note points out that test statistics for overidentification restrictions can be obtained as by-products of Amerniya's generalized least squares procedure. Amemiya's procedure is shown to be a minimum chisquare method. The Amemiya procedure is valuable both for efficient estimation and for model evaluation of such models.  相似文献   

18.
We compare the selection procedure of Levin and Robbins [1981. Selecting the highest probability in binomial or multinomial trials. Proc. Nat. Acad. Sci. USA 78, 4663–4666.] with the procedure of Paulson [1994. Sequential procedures for selecting the best one of k Koopman–Darmois populations. Sequential Analysis 13, 207–220.] to identify the best of several binomial populations with sequential elimination of unlikely candidates. We point out situations in which the Levin–Robbins procedure dominates the Paulson procedure in terms of the duration of the experiment, the expected total number of observations, and the expected number of failures. Because the Levin–Robbins procedure is also easier to implement than Paulson's procedure and gives a tighter guarantee for the probability of correct selection, we conclude that it holds a competitive edge over Paulson's procedure.  相似文献   

19.
A subset selection procedure is developed for selecting a subset containing the multinomial population that has the highest value of a certain linear combination of the multinomial cell probabilities; such population is called the ‘best’. The multivariate normal large sample approximation to the multinomial distribution is used to derive expressions for the probability of a correct selection, and for the threshold constant involved in the procedure. The procedure guarantees that the probability of a correct selection is at least at a pre-assigned level. The proposed procedure is an extension of Gupta and Sobel's [14] selection procedure for binomials and of Bakir's [2] restrictive selection procedure for multinomials. One illustration of the procedure concerns population income mobility in four countries: Peru, Russia, South Africa and the USA. Analysis indicates that Russia and Peru fall in the selected subset containing the best population with respect to income mobility from poverty to a higher-income status. The procedure is also applied to data concerning grade distribution for students in a certain freshman class.  相似文献   

20.
Sequential analyses in clinical trials have ethical and economic advantages over fixed sample size methods. The sequential probability ratio test (SPRT) is a hypothesis testing procedure which evaluates data as it is collected. The original SPRT was developed by Wald for one-parameter families of distributions and later extended by Bartlett to handle the case of nuisance parameters. However, Bartlett's SPRT requires independent and identically distributed observations. In this paper we show that Bartlett's SPRT can be applied to generalized linear model (GLM) contexts. Then we propose an SPRT analysis methodology for a Poisson generalized linear mixed model (GLMM) that is suitable for our application to the design of a multicenter randomized clinical trial that compares two preventive treatments for surgical site infections. We validate the methodology with a simulation study that includes a comparison to Neyman–Pearson and Bayesian fixed sample size test designs and the Wald SPRT.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号