首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The maximum of k functions defined on R n , n ≥ 1, by f max (x) = max{f 1 (x),…, f k (x)}, ? x ? R n , can have important roles in Statistics, particularly in Classification. Through its relation with the Bayes error, which is the reference error in classification, it can serve to compute numerical bounds for errors in other classification schemes. It can also serve to define the joint L1-distance between more than two densities, which, in turn, will serve as a useful tool in Classification and Cluster Analyses. It has a vast potential application in digital image processing too. Finally, its versatile role can be seen in several numerical examples, related to the analysis of Fisher's classical iris data in multidimensional spaces.  相似文献   

2.
A recent article in this journal presented a variety of expressions for the coefficient of determination (R 2) and demonstrated that these expressions were generally not equivalent. The article discussed potential pitfalls in interpreting the R 2 statistic in ordinary least-squares regression analysis. The current article extends this discussion to the case in which regression models are fit by weighted least squares and points out an additional pitfall that awaits the unwary data analyst. We show that unthinking reliance on the R 2 statistic can lead to an overly optimistic interpretation of the proportion of variance accounted for in the regression. We propose a modification of the estimator and demonstrate its utility by example.  相似文献   

3.
Statistics that usually accompany the regression model do not provide insight into the quality of the data or the potential influence of the individual observations on the estimates. In this study, the Q2 statistic is used as a criterion for detecting influential observations or outliers. The statistic is derived from the jackknifed residuals, the squared sum of which is generally known as the prediction sum of squares or PRESS. This article compares R 2 with Q2 and suggests that the latter be used as part of the data-quality check. It is shown, for two separate data sets obtained from regional cost of living and U.S. food industry studies, that in the presence of outliers the Q2 statistic can be negative, because it is sensitive to the choice of regressors and the inclusion of influential observations. Once the outliers are dropped from the sample, the discrepancy between Q2 and R 2 values is negligible.  相似文献   

4.
Abstract

It is common to monitor several correlated quality characteristics using the Hotelling's T 2 statistic. However, T 2 confounds the location shift with scale shift and consequently it is often difficult to determine the factors responsible for out of control signal in terms of the process mean vector and/or process covariance matrix. In this paper, we propose a diagnostic procedure called ‘D-technique’ to detect the nature of shift. For this purpose, two sets of regression equations, each consisting of regression of a variable on the remaining variables, are used to characterize the ‘structure’ of the ‘in control’ process and that of ‘current’ process. To determine the sources responsible for an out of control state, it is shown that it is enough to compare these two structures using the dummy variable multiple regression equation. The proposed method is operationally simpler and computationally advantageous over existing diagnostic tools. The technique is illustrated with various examples.  相似文献   

5.
Let K n (a) be the number of observations in the interval (M n ,?a, M n ), where M n is the maximum value in a sequence of size n. We study the asymptotic properties of K n (a) under the F α-scheme and discuss the influence of the associated sequence α n on the limit behaviour of this random variable.  相似文献   

6.
This short article shows an unified approach to representing and computing the cumulative distribution function for noncentral t, F, and χ2. Unlike the existing algorithms, which involve different expansion and/or recurrence, the new approach consistently represents all the three noncentral cumulative distribution functions as the integral of the normal cumulative distribution function and χ2 density function.  相似文献   

7.
In statistical process control applications, the multivariate T 2 control chart based on Hotelling's T 2 statistic is useful for detecting the presence of special causes of variation. In particular, use of the T 2 statistic based on the successive differences covariance matrix estimator has been shown to be very effective in detecting the presence of a sustained step or ramp shift in the mean vector. However, the exact distribution of this statistic is unknown. In this article, we derive the maximum value of the T 2 statistic based on the successive differences covariance matrix estimator. This distributional property is crucial for calculating an approximate upper control limit of a T 2 control chart based on successive differences, as described in Williams et al. (2006 Williams , J. D. , Woodall , W. H. , Birch , J. B. , Sullivan , J. H. ( 2006 ). On the distribution of T 2 statistics based on successive differences . J. Qual. Technol. 38 : 217229 .[Taylor & Francis Online], [Web of Science ®] [Google Scholar]).  相似文献   

8.
This article studies the minima stable property of the general multivariate Pareto distributions MP(k)(I), MP(k)(II), MP(k)(III), MP(k)(IV) which can be applied to characterize the MP(k) distribution via its weighted ordered coordinates minima and marginal distribution. Also, the multivariate semi-Pareto distribution (denoted by MSP) is discerned in the class of geometric minima infinite divisible and geometric minima stable distributions. If the exponent measure is satisfied by some functional equation, then the geometric minima stable property can be used to characterize the MSP distribution. Finally, the finite sample minima infinite divisible property of the MP(k)(I), (II), and (IV) distributions is also discussed.  相似文献   

9.
Zuo (2004) investigated the simplified replacement finite sample breakdown point of weighted L p -depth and L p -median for some appropriate weight functions. The addition breakdown point of weighted L p -depth functions is studied firstly in this article. In addition, for some other weight functions different from those in Zuo (2004 Zuo , Y. ( 2004 ). Robustness of weighted L p -depth and L p -median . Allgemeines Statistics Archiv. 88 : 215234 . [Google Scholar]), we establish the lower bounds of these two types of breakdown point of weighted L 2-median.  相似文献   

10.
When a process is monitored with a T 2 control chart in a Phase II setting, the MYT decomposition is a valuable diagnostic tool for interpreting signals in terms of the process variables. The decomposition splits a signaling T 2 statistic into independent components that can be associated with either individual variables or groups of variables. Since these components are T 2 statistics with known distributions, they can be used to determine which of the process variable(s) contribute to the signal. However, this procedure cannot be applied directly to Phase I since the distributions of the individual components are unknown. In this article, we develop the MYT decomposition procedure for a Phase I operation, when monitoring a random sample of individual observations and identifying outliers. We use a relationship between the T 2 statistic in Phase I with the corresponding T 2 statistic resulting when an observation is omitted from this sample to derive the distributions of these components and demonstrate the Phase I application of the MYT decomposition.  相似文献   

11.
Three new test statistics are introduced for correlated categorical data in stratified R×C tables. They are similar in form to the standard generalized Cochran-Mantel-Haenszel statistics but modified to handle correlated outcomes. Two of these statistics are asymptotically valid in both many-strata (sparse data) and large-strata limiting models. The third one is designed specifically for the many-strata case but is valid even with a small number of strata. This latter statistic is also appropriate when strata are assumed to be random.  相似文献   

12.
13.
This paper investigates alternatives to MIU estimators in noncentral X 2 and F distributions. Two directions are pursued. In the first, a general approach for uniformly improving on MVU estimators is described and illustrated. In the second, Bayesian, procedures are characterized and illustrated as well. This effort extends earlier work of Perlman and Rasmussen and of Neff and Strawderman.  相似文献   

14.
In this paper we consider the issue of constructing retrospective T 2 control chart limits so as to control the overall probability of a false alarm at a specified value. We describe an exact method for constructing the control limits for retrospective examination. We then consider Bonferroni-adjustments to Alt's control limit and to the standard x 2 control limit as alternatives to the exact limit since it is computationally cumbersome to find the exact limit. We present the results of some simulation experiments that are carried out to compare the performance of these control limits. The results indicate that the Bonferroni-adjusted Alt's control limit performs better that the Bonferroni-adjusted x 2 control limit. Furthermore, it appears that the Bonferroni-adjusted Alt's control limit is more than adequate for controlling the overall false alarm probability at a specified value.  相似文献   

15.
The paper examplifies with Hsu’s model a general pattern as how to derive results of variance component estimation from well known results on mean estimation, as far as linear model theory is concerned. This ’ dispersion-mean-correspondence‘provides new and short proofs for various theorems from the literature, concerning unbiased invariant quadratic estimators with minimum BAYES risk or minimum variance. For pure variance component models, unbiased non-negative quadratic estimability is characterized in terms of the design matrices.  相似文献   

16.
This article examines several goodness-of-fit measures in the binary probit regression model. Existing pseudo-R 2 measures are reviewed, two modified and one new pseudo-R 2 measure are proposed. For the probit regression model, empirical comparisons are made for different goodness-of-fit measures with the squared sample correlation coefficient of the observed response and the predicted probabilities. As an illustration, the goodness-of-fit measures are applied to a “paid labor force” data set.  相似文献   

17.
We provide a simple result on the H-decomposition of a U-statistics that allows for easy determination of its magnitude when the statistic’s kernel depends on the sample size n. The result provides a direct and convenient method to characterize the asymptotic magnitude of semiparametric and nonparametric estimators or test statistics involving high dimensional sums. We illustrate the use of our result in previously studied estimators/test statistics and in a novel nonparametric R2 test for overall significance of a nonparametric regression model.  相似文献   

18.
It is well known that Yates' algorithm can be used to estimate the effects in a factorial design. We develop a modification of this algorithm and call it modified Yates' algorithm and its inverse. We show that the intermediate steps in our algorithm have a direct interpretation as estimated level-specific mean values and effects. Also we show how Yates' or our modified algorithm can be used to construct the blocks in a 2 k factorial design and to generate the layout sheet of a 2 k−p fractional factorial design and the confounding pattern in such a design. In a final example we put together all these methods by generating and analysing a 26-2 design with 2 blocks.  相似文献   

19.
Properties of Hotelling's (1931) T 2 are studied under model misspecification in the model for a multivariate experiment. Stochastic bounds on T 2 and further properties of the T 2 test are studied under misspecified location and scale. The bounds are evaluated numerically in selected cases.  相似文献   

20.
In this article, we consider the problem of testing the mean vector in the multivariate normal distribution, where the dimension p is greater than the sample size N. We propose a new test TBlock and obtain its asymptotic distribution. We also compare the proposed test with other two tests. The simulation results suggest that the performance of the new test is comparable to the existing two tests, and under some circumstances it may have higher power. Therefore, the new statistic can be employed in practice as an alternative choice.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号