首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 977 毫秒
1.
We provide a simple result on the H-decomposition of a U-statistics that allows for easy determination of its magnitude when the statistic’s kernel depends on the sample size n. The result provides a direct and convenient method to characterize the asymptotic magnitude of semiparametric and nonparametric estimators or test statistics involving high dimensional sums. We illustrate the use of our result in previously studied estimators/test statistics and in a novel nonparametric R2 test for overall significance of a nonparametric regression model.  相似文献   

2.
ABSTRACT

The one-sample Wilcoxon signed rank test was originally designed to test for a specified median, under the assumption that the distribution is symmetric, but it can also serve as a test for symmetry if the median is known. In this article we derive the Wilcoxon statistic as the first component of Pearson's X 2 statistic for independence in a particularly constructed contingency table. The second and third components are new test statistics for symmetry. In the second part of the article, the Wilcoxon test is extended so that symmetry around the median and symmetry in the tails can be examined seperately. A trimming proportion is used to split the observations in the tails from those around the median. We further extend the method so that no arbitrary choice for the trimming proportion has to be made. Finally, the new tests are compared to other tests for symmetry in a simulation study. It is concluded that our tests often have substantially greater powers than most other tests.  相似文献   

3.
This article considers K pairs of incomplete correlated 2 × 2 tables in which the interesting measurement is the risk difference between marginal and conditional probabilities. A Wald-type statistic and a score-type statistic are presented to test the homogeneity hypothesis about risk differences across strata. Powers and sample size formulae based on the above two statistics are deduced. Figures about sample size against risk difference (or marginal probability) are given. A real example is used to illustrate the proposed methods.  相似文献   

4.
When a process is monitored with a T 2 control chart in a Phase II setting, the MYT decomposition is a valuable diagnostic tool for interpreting signals in terms of the process variables. The decomposition splits a signaling T 2 statistic into independent components that can be associated with either individual variables or groups of variables. Since these components are T 2 statistics with known distributions, they can be used to determine which of the process variable(s) contribute to the signal. However, this procedure cannot be applied directly to Phase I since the distributions of the individual components are unknown. In this article, we develop the MYT decomposition procedure for a Phase I operation, when monitoring a random sample of individual observations and identifying outliers. We use a relationship between the T 2 statistic in Phase I with the corresponding T 2 statistic resulting when an observation is omitted from this sample to derive the distributions of these components and demonstrate the Phase I application of the MYT decomposition.  相似文献   

5.
The identity of the Rao score and PearsonX 2 statistics is well known in the areas where the latter was first introduced: goodness-of-fit in contingency tables and binary responses. We show in this paper that the same identity holds when the two statistics are used for testing goodness-of-fit of Generalized Linear Models. We also highlight the connections that exist between the two statistics when they are used for the comparison of nested models. Finally, we discuss some merits of these unifying results. Work financially supported by cofin. MIUR grants 2000 and 2002.  相似文献   

6.
Abstract

It is common to monitor several correlated quality characteristics using the Hotelling's T 2 statistic. However, T 2 confounds the location shift with scale shift and consequently it is often difficult to determine the factors responsible for out of control signal in terms of the process mean vector and/or process covariance matrix. In this paper, we propose a diagnostic procedure called ‘D-technique’ to detect the nature of shift. For this purpose, two sets of regression equations, each consisting of regression of a variable on the remaining variables, are used to characterize the ‘structure’ of the ‘in control’ process and that of ‘current’ process. To determine the sources responsible for an out of control state, it is shown that it is enough to compare these two structures using the dummy variable multiple regression equation. The proposed method is operationally simpler and computationally advantageous over existing diagnostic tools. The technique is illustrated with various examples.  相似文献   

7.
A power study suggests that a good test of fit analysis for the binomial distribution is provided by a data-dependent Chernoff–Lehmann X 2 test with class expectations greater than unity, and its components. These data-dependent statistics involve arithmetically simple parameter estimation, convenient approximate distributions and provide a comprehensive assessment of how well the data agree with a binomial distribution. We suggest that a well-performed single test of fit statistic is the Anderson–Darling statistic.  相似文献   

8.
Data in many experiments arises as curves and therefore it is natural to use a curve as a basic unit in the analysis, which is in terms of functional data analysis (FDA). Functional curves are encountered when units are observed over time. Although the whole function curve itself is not observed, a sufficiently large number of evaluations, as is common with modern recording equipment, is assumed to be available. In this article, we consider the statistical inference for the mean functions in the two samples problem drawn from functional data sets, in which we assume that functional curves are observed, that is, we consider the test if these two groups of curves have the same mean functional curve when the two groups of curves without noise are observed. The L 2-norm based and bootstrap-based test statistics are proposed. It is shown that the proposed methodology is flexible. Simulation study and real-data examples are used to illustrate our techniques.  相似文献   

9.
10.
In statistical process control applications, the multivariate T 2 control chart based on Hotelling's T 2 statistic is useful for detecting the presence of special causes of variation. In particular, use of the T 2 statistic based on the successive differences covariance matrix estimator has been shown to be very effective in detecting the presence of a sustained step or ramp shift in the mean vector. However, the exact distribution of this statistic is unknown. In this article, we derive the maximum value of the T 2 statistic based on the successive differences covariance matrix estimator. This distributional property is crucial for calculating an approximate upper control limit of a T 2 control chart based on successive differences, as described in Williams et al. (2006 Williams , J. D. , Woodall , W. H. , Birch , J. B. , Sullivan , J. H. ( 2006 ). On the distribution of T 2 statistics based on successive differences . J. Qual. Technol. 38 : 217229 .[Taylor & Francis Online], [Web of Science ®] [Google Scholar]).  相似文献   

11.
Large sample tests for the standard To bit model versus the p -Tobit model by Deaton and Irish (1984) are studied. The normalized one-tailed score test by Deaton and Irish (1984) is shown to be a version of Neyman's C(α) test that is valid for the non-standard problem of the null hypothesis lying on the boundary of the parameter space. Then, this paper reports the results of Monte Carlo experiments designed to study the small sample performance of large sample tests for the standard Tobit specification versus the p -Tobit specification.  相似文献   

12.
We propose new multivariate control charts that can effectively deal with massive amounts of complex data through their integration with classification algorithms. We call the proposed control chart the ‘Probability of Class (PoC) chart’ because the values of PoC, obtained from classification algorithms, are used as monitoring statistics. The control limits of PoC charts are established and adjusted by the bootstrap method. Experimental results with simulated and real data showed that PoC charts outperform Hotelling's T 2 control charts. Further, a simulation study revealed that a small proportion of out-of-control observations are sufficient for PoC charts to achieve the desired performance.  相似文献   

13.
We consider here a generalization of the skew-normal distribution, GSN(λ1,λ2,ρ), defined through a standard bivariate normal distribution with correlation ρ, which is a special case of the unified multivariate skew-normal distribution studied recently by Arellano-Valle and Azzalini [2006. On the unification of families of skew-normal distributions. Scand. J. Statist. 33, 561–574]. We then present some simple and useful properties of this distribution and also derive its moment generating function in an explicit form. Next, we show that distributions of order statistics from the trivariate normal distribution are mixtures of these generalized skew-normal distributions; thence, using the established properties of the generalized skew-normal distribution, we derive the moment generating functions of order statistics, and also present expressions for means and variances of these order statistics.Next, we introduce a generalized skew-tν distribution, which is a special case of the unified multivariate skew-elliptical distribution presented by Arellano-Valle and Azzalini [2006. On the unification of families of skew-normal distributions. Scand. J. Statist. 33, 561–574] and is in fact a three-parameter generalization of Azzalini and Capitanio's [2003. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t distribution. J. Roy. Statist. Soc. Ser. B 65, 367–389] univariate skew-tν form. We then use the relationship between the generalized skew-normal and skew-tν distributions to discuss some properties of generalized skew-tν as well as distributions of order statistics from bivariate and trivariate tν distributions. We show that these distributions of order statistics are indeed mixtures of generalized skew-tν distributions, and then use this property to derive explicit expressions for means and variances of these order statistics.  相似文献   

14.
Let T2 i=z′iS?1zi, i==,…k be correlated Hotelling's T2 statistics under normality. where z=(z′i,…,z′k)′ and nS are independently distributed as Nkp((O,ρ?∑) and Wishart distribution Wp(∑, n), respectively. The purpose of this paper is to study the distribution function F(x1,…,xk) of (T2 i,…,T2 k) when n is large. First we derive an asymptotic expansion of the characteristic function of (T2 i,…,T2 k) up to the order n?2. Next we give asymptotic expansions for (T2 i,…,T2 k) in two cases (i)ρ=Ik and (ii) k=2 by inverting the expanded characteristic function up to the orders n?2 and n?1, respectively. Our results can be applied to the distribution function of max (T2 i,…,T2 k) as a special case.  相似文献   

15.
A general class of rank statistics based on the characteristic function is introduced for testing goodness‐of‐fit hypotheses about the copula of a continuous random vector. These statistics are defined as L 2 weighted functional distances between a nonparametric estimator and a semi‐parametric estimator of the characteristic function associated with a copula. It is shown that these statistics behave asymptotically as degenerate V ‐statistics of order four and that the limit distributions have representations in terms of weighted sums of independent chi‐square variables. The consistency of the tests against general alternatives is established and an asymptotically valid parametric bootstrap is suggested for the computation of the critical values of the tests. The behaviour of the new tests in small and moderate sample sizes is investigated with the help of simulations and compared with a competing test based on the empirical copula. Finally, the methodology is illustrated on a five‐dimensional data set.  相似文献   

16.
Statistics that usually accompany the regression model do not provide insight into the quality of the data or the potential influence of the individual observations on the estimates. In this study, the Q2 statistic is used as a criterion for detecting influential observations or outliers. The statistic is derived from the jackknifed residuals, the squared sum of which is generally known as the prediction sum of squares or PRESS. This article compares R 2 with Q2 and suggests that the latter be used as part of the data-quality check. It is shown, for two separate data sets obtained from regional cost of living and U.S. food industry studies, that in the presence of outliers the Q2 statistic can be negative, because it is sensitive to the choice of regressors and the inclusion of influential observations. Once the outliers are dropped from the sample, the discrepancy between Q2 and R 2 values is negligible.  相似文献   

17.
Multivariate exponential weighted moving average and cumulative sum charts are the most common memory type multivariate control charts. They make use of the present and past information to detect small shifts in the process parameter(s). In this article, we propose two new multivariate control charts using a mixed version of their design setups. The plotting statistics of the proposed charts are based on the cumulative sum of the multivariate exponentially weighted moving averages. The performances of these schemes are evaluated in terms of average run length. The proposals are compared with their existing counterparts, including HotellingT2, MCUSUM, MEWMA, and MC1 charts. An application example is also presented for practical considerations using a real dataset.  相似文献   

18.
A recent article in this journal presented a variety of expressions for the coefficient of determination (R 2) and demonstrated that these expressions were generally not equivalent. The article discussed potential pitfalls in interpreting the R 2 statistic in ordinary least-squares regression analysis. The current article extends this discussion to the case in which regression models are fit by weighted least squares and points out an additional pitfall that awaits the unwary data analyst. We show that unthinking reliance on the R 2 statistic can lead to an overly optimistic interpretation of the proportion of variance accounted for in the regression. We propose a modification of the estimator and demonstrate its utility by example.  相似文献   

19.
Abstract

A new non linear estimator, W, for the number of valid, unique signatures on a petition has been shown better, for the cases enumerated and with certain restrictions, than a popular Goodman-type statistic, G. This article extends those results with relaxed conditions by developing the exact probability mass function and mean of W and a close approximation of the variance (Var(W)). If the proportion of valid signatures among unique and duplicated signatures is the same, then Var(W) is approximately a function of the means and variances of the two sample statistics. Using the delta method, we estimate Var(W), with the resulting approximation shown to be good, even when the condition of equal proportions does not hold. We compare W to G and establish which estimator is preferred for different intervals of the design parameters. Data from a Washington State petition illustrate the findings.  相似文献   

20.
We develop a ‘robust’ statistic T2 R, based on Tiku's (1967, 1980) MML (modified maximum likelihood) estimators of location and scale parameters, for testing an assumed meam vector of a symmetric multivariate distribution. We show that T2 R is one the whole considerably more powerful than the prominenet Hotelling T2 statistics. We also develop a robust statistic T2 D for testing that two multivariate distributions (skew or symmetric) are identical; T2 D seems to be usually more powerful than nonparametric statistics. The only assumption we make is that the marginal distributions are of the type (1/σk)f((x-μk)/σk) and the means and variances of these marginal distributions exist.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号