首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
A notion of data depth is used to measure centrality or outlyingness of a data point in a given data cloud. In the context of data depth, the point (or points) having maximum depth is called as deepest point (or points). In the present work, we propose three multi-sample tests for testing equality of location parameters of multivariate populations by using the deepest point (or points). These tests can be considered as extensions of two-sample tests based on the deepest point (or points). The proposed tests are implemented through the idea of Fisher's permutation test. Performance of earlier tests is studied by simulation. Illustration with two real datasets is also provided.  相似文献   

2.
The present paper deals with the problem of testing equality of locations of two multivariate distributions using a notion of data depth. A notion of data depth has been used to measure centrality/outlyingness of a given point in a given data cloud. The paper proposes two nonparametric tests for testing equality of locations of two multivariate populations which are developed by observing the behavior of the depth versus depth plot. Simulation study reveals that the proposed tests are superior to the existing tests based on the data depth with regard to power. Illustrations with real data are provided.  相似文献   

3.
Based on data depth, three types of nonparametric goodness-of-fit tests for multivariate distribution are proposed in this paper. They are Pearson’s chi-square test, tests based on EDF and tests based on spacings, respectively. The Anderson–Darling (AD) test and the Greenwood test for bivariate normal distribution and uniform distribution are simulated. The results of simulation show that these two tests have low type I error rates and become more efficient with the increase in sample size. The AD-type test performs more powerfully than the Greenwood type test.  相似文献   

4.
Li and Liu [New nonparametric tests of multivariate locations and scales. Statist Sci. 2004;19(4):686–696] introduced two tests for a difference in locations of two multivariate distributions based on the concept of data depth. Using the simplicial depth [Liu RY. On a notion of data depth based on random simplices. Ann Stat. 1990;18(1):405–414], they studied the performance of these tests for symmetric distributions, namely, the normal and the Cauchy, in a simulation study. However, to the best of our knowledge, the performance of these tests for skewed distributions has not been studied in the current literature. This paper is a contribution in that direction and examines the performance of these depth-based tests in an extensive simulation study involving ten distributions belonging to five well-known families of multivariate skewed distributions. The study includes a comparison of the performance of these tests for four popular affine-invariant depth functions. Conclusions and recommendations are offered.  相似文献   

5.
Bhattacharyya and Kioiz (1966) propose two multivariate nonparametric tests for monotone trend, one involving coordinate-wise Mann statistics and the other, coordinate-wise Spearman statistics. Dietz and Killeen (1981) propose a different test statistic based on coordinate-wise Mann statistics. The Pitman asymptotic relative efficiency of all three tests with respect to a normal theory competitor equals the cube root of the efficiency of a multivariate signed rank test with respect to Hotelling's T2. In this article, the small sample power of the nonparametric tests, the normal theory test, and a Bonferroni approach involving coordinate-wise univariate Mann or Spearman tests is examined in a simulation study. The Mann statistic of Dietz and Killeen and the Spearman statistic of Bhattacharyya and Klotz are found to perform well under both null and alternative hypotheses  相似文献   

6.
In this paper, we consider a nonparametric test procedure for multivariate data with grouped components under the two sample problem setting. For the construction of the test statistic, we use linear rank statistics which were derived by applying the likelihood ratio principle for each component. For the null distribution of the test statistic, we apply the permutation principle for small or moderate sample sizes and derive the limiting distribution for the large sample case. Also we illustrate our test procedure with an example and compare with other procedures through simulation study. Finally, we discuss some additional interesting features as concluding remarks.  相似文献   

7.
Summary: This paper studies the DDMA–chart, a data depth based moving–average control chart for monitoring multivariate data. This chart is nonparametric and it can detect simultaneously location and scale changes in the process. It improves upon the existing r– and Q–chart in the efficiency of detecting location changes. Both theoretical justifications and simulation studies are provided. Comparisons with some existing multivariate control charts via simulation results are also provided. Some applications of the DDMA–chart to the analysis of airline performance data (collected by the FAA) are demonstrated. The results indicate that the DDMA–chart is an effective nonparametric multivariate control chart.*Research supported in part by grants from the National Science Foundation, the National Security Agency, and the Federal Aviation Administration. The discussion on aviation safety in this paper reects the views of the authors, who are solely responsible for the accuracy of the analysis results presented herein, and does not necessarily reect the official view or policy of the FAA. The dataset used in this paper has been partially masked in order to protect confidentiality.  相似文献   

8.
Modern methods for detecting changes in the scale or covariance of multivariate distributions rely primarily on testing for the constancy of the covariance matrix. These depend on higher-order moment conditions, and also do not work well when the dimension of the data is large or even moderate relative to the sample size. In this paper, we propose a nonparametric change point test for multivariate data using rankings obtained from data depth measures. As the data depth of an observation measures its centrality relative to the sample, changes in data depth may signify a change of scale of the underlying distribution, and the proposed test is particularly responsive to detecting such changes. We provide a full asymptotic theory for the proposed test statistic under the null hypothesis that the observations are stable, and natural conditions under which the test is consistent. The finite sample properties are investigated by means of a Monte Carlo simulation, and these along with the theoretical results confirm that the test is robust to heavy tails, skewness and high dimensionality. The proposed methods are demonstrated with an application to structural break detection in the rate of change of pollutants linked to acid rain measured in Turkey lake, a lake in central Ontario, Canada. Our test suggests a change in the rate of acid rain in the late 1980s/early 1990s, which coincides with clean air legislation in Canada and the US. The Canadian Journal of Statistics 48: 417–446; 2020 © 2020 Statistical Society of Canada  相似文献   

9.
This paper considers two-sample nonparametric comparison of survival function when data are subject to left truncation and interval censoring. We propose a class of rank-based tests, which are generalization of weighted log-rank tests for right-censored data. Simulation studies indicate that the proposed tests are appropriate for practical use.  相似文献   

10.
Liu and Singh (1993, 2006) introduced a depth‐based d‐variate extension of the nonparametric two sample scale test of Siegel and Tukey (1960). Liu and Singh (2006) generalized this depth‐based test for scale homogeneity of k ≥ 2 multivariate populations. Motivated by the work of Gastwirth (1965), we propose k sample percentile modifications of Liu and Singh's proposals. The test statistic is shown to be asymptotically normal when k = 2, and compares favorably with Liu and Singh (2006) if the underlying distributions are either symmetric with light tails or asymmetric. In the case of skewed distributions considered in this paper the power of the proposed tests can attain twice the power of the Liu‐Singh test for d ≥ 1. Finally, in the k‐sample case, it is shown that the asymptotic distribution of the proposed percentile modified Kruskal‐Wallis type test is χ2 with k ? 1 degrees of freedom. Power properties of this k‐sample test are similar to those for the proposed two sample one. The Canadian Journal of Statistics 39: 356–369; 2011 © 2011 Statistical Society of Canada  相似文献   

11.
In first-level analyses of functional magnetic resonance imaging data, adjustments for temporal correlation as a Satterthwaite approximation or a prewhitening method are usually implemented in the univariate model to keep the nominal test level. In doing so, the temporal correlation structure of the data is estimated, assuming an autoregressive process of order one.We show that this is applicable in multivariate approaches too - more precisely in the so-called stabilized multivariate test statistics. Furthermore, we propose a block-wise permutation method including a random shift that renders an approximation of the temporal correlation structure unnecessary but also approximately keeps the nominal test level in spite of the dependence of sample elements.Although the intentions are different, a comparison of the multivariate methods with the multiple ones shows that the global approach may achieve advantages if applied to suitable regions of interest. This is illustrated using an example from fMRI studies.  相似文献   

12.
We propose a new nonparametric estimator for the density function of multivariate bounded data. As frequently observed in practice, the variables may be partially bounded (e.g. nonnegative) or completely bounded (e.g. in the unit interval). In addition, the variables may have a point mass. We reduce the conditions on the underlying density to a minimum by proposing a nonparametric approach. By using a gamma, a beta, or a local linear kernel (also called boundary kernels), in a product kernel, the suggested estimator becomes simple in implementation and robust to the well known boundary bias problem. We investigate the mean integrated squared error properties, including the rate of convergence, uniform strong consistency and asymptotic normality. We establish consistency of the least squares cross-validation method to select optimal bandwidth parameters. A detailed simulation study investigates the performance of the estimators. Applications using lottery and corporate finance data are provided.  相似文献   

13.
Mardia's multivariate kurtosis and the generalized distance have desirable properties as multivariate outlier tests. However, extensive critical values have not been published heretofore. A published approximation formula for critical values of the kurtosis is shown to inadequately control the type I error rate, with observed error rates often differing from their intended values by a factor of two or more. Critical values derived from simulations for both tests for up to 25 dimensions and 500 observations are presented. The power curves of both tests are discussed. The generalized distance is the more powerful test when exactly one outlier is present and the contaminant is substantially mean-shifted. However, as the number of outliers increases, the kurtosis becomes the more powerful test. The two tests are compared with respect to power and vulnerability to masking. Recommendations for the use of these tests and interpretation of results are given.  相似文献   

14.
In the last few years, two adaptive tests for paired data have been proposed. One test proposed by Freidlin et al. [On the use of the Shapiro–Wilk test in two-stage adaptive inference for paired data from moderate to very heavy tailed distributions, Biom. J. 45 (2003), pp. 887–900] is a two-stage procedure that uses a selection statistic to determine which of three rank scores to use in the computation of the test statistic. Another statistic, proposed by O'Gorman [Applied Adaptive Statistical Methods: Tests of Significance and Confidence Intervals, Society for Industrial and Applied Mathematics, Philadelphia, 2004], uses a weighted t-test with the weights determined by the data. These two methods, and an earlier rank-based adaptive test proposed by Randles and Hogg [Adaptive Distribution-free Tests, Commun. Stat. 2 (1973), pp. 337–356], are compared with the t-test and to Wilcoxon's signed-rank test. For sample sizes between 15 and 50, the results show that the adaptive test proposed by Freidlin et al. and the adaptive test proposed by O'Gorman have higher power than the other tests over a range of moderate to long-tailed symmetric distributions. The results also show that the test proposed by O'Gorman has greater power than the other tests for short-tailed distributions. For sample sizes greater than 50 and for small sample sizes the adaptive test proposed by O'Gorman has the highest power for most distributions.  相似文献   

15.
In this paper it is shown that data depth does not only provide consistent and robust estimators but also consistent and robust tests. Thereby, consistency of a test means that the Type I (αα) error and the Type II (ββ) error converge to zero with growing sample size in the interior of the nullhypothesis and the alternative, respectively. Robustness is measured by the breakdown point which depends here on a so-called concentration parameter. The consistency and robustness properties are shown for cases where the parameter of maximum depth is a biased estimator and has to be corrected. This bias is a disadvantage for estimation but an advantage for testing. It causes that the corresponding simplicial depth is not a degenerated U-statistic so that tests can be derived easily. However, the straightforward tests have a very poor power although they are asymptotic α-levelα-level tests. To improve the power, a new method is presented to modify these tests so that even consistency of the modified tests is achieved. Examples of two-dimensional copulas and the Weibull distribution show the applicability of the new method.  相似文献   

16.
17.
Four generic means of conducting randomization tests in the context of multiple regression are analysed. Based on their performance in traditional repeated samples, three of these are shown to be inappropriate or applicable only in special circumstances; their shortcomings are illustrated via Monte Carlo studies  相似文献   

18.
In this paper, we propose a new measure of fit which can be used in the case of quantile–quantile plots. This measure, when applied to Small's and Srivastava's graphical methods provides two new tests for assessing multivariate normality. For different sample sizes and numbers of variables, the critical values of these tests were evaluated via simulations. The power of the new tests and its comparison with some other tests for multivariate normality are presented herein.  相似文献   

19.
It is well known that if a multivariate outlier has one or more missing component values, then multiple imputation (MI) methods tend to impute nonextreme values and make the outlier become less extreme and less likely to be detected. In this paper, nonparametric depth-based multivariate outlier identifiers are used as criteria in a numerical study comparing several established methods of MI as well as a new proposed one, nine in all, in a setting of several actual clinical laboratory data sets of different dimensions. Two criteria, an ‘outlier recovery probability’ and a ‘relative accuracy measure’, are developed, based on depth functions. Three outlier identifiers, based on Mahalanobis distance, robust Mahalanobis distance, and generalized principle component analysis are also included in the study. Consequently, not only the comparison of imputation methods but also the comparison of outlier detection methods is accomplished in this study. Our findings show that the performance of an MI method depends on the choice of depth-based outlier detection criterion, as well as the size and dimension of the data and the fraction of missing components. By taking these features into account, an MI method for a given data set can be selected more optimally.  相似文献   

20.
Two different two-sample tests for dispersion differences based on placement statistics are proposed. The means and variances of the test statistics are derived, and asymptotic normality is established for both. Variants of the proposed tests based on reversing the X and Y labels in the test statistic calculations are shown to have different small-sample properties; for both pairs of tests, one member of the pair will be resolving, the other nonresolving. The proposed tests are similar in spirit to the dispersion tests of both Mood and Hollander; comparative simulation results for these four tests are given. For small sample sizes, the powers of the proposed tests are approximately equal to the powers of the tests of both Mood and Hollander for samples from the normal, Cauchy and exponential distributions. The one-sample limiting distributions are also provided, yielding useful approximations to the exact tests when one sample is much larger than the other. A bootstrap test may alternatively be performed. The proposed test statistics may be used with lightly censored data by substituting Kaplan-Meier estimates for the empirical distribution functions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号