期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

An Asymptotic Characterization of Finite Degree U-statistics With Sample Size-Dependent Kernels: Applications to Nonparametric Estimators and Test Statistics

Feng Yao 《统计学通讯:理论与方法》2013,42(15):3251-3265

We provide a simple result on the H-decomposition of a U-statistics that allows for easy determination of its magnitude when the statistic’s kernel depends on the sample size n. The result provides a direct and convenient method to characterize the asymptotic magnitude of semiparametric and nonparametric estimators or test statistics involving high dimensional sums. We illustrate the use of our result in previously studied estimators/test statistics and in a novel nonparametric R² test for overall significance of a nonparametric regression model. 相似文献

2.

Tests for Symmetry Based on the One-Sample Wilcoxon Signed Rank Statistic

O. Thas J. C. W. Rayner D. J. Best 《统计学通讯:模拟与计算》2013,42(4):957-973

ABSTRACT

The one-sample Wilcoxon signed rank test was originally designed to test for a specified median, under the assumption that the distribution is symmetric, but it can also serve as a test for symmetry if the median is known. In this article we derive the Wilcoxon statistic as the first component of Pearson's X ² statistic for independence in a particularly constructed contingency table. The second and third components are new test statistics for symmetry. In the second part of the article, the Wilcoxon test is extended so that symmetry around the median and symmetry in the tails can be examined seperately. A trimming proportion is used to split the observations in the tails from those around the median. We further extend the method so that no arbitrary choice for the trimming proportion has to be made. Finally, the new tests are compared to other tests for symmetry in a simulation study. It is concluded that our tests often have substantially greater powers than most other tests. 相似文献

3.

Homogeneity Test of Risk Differences of Marginal and Conditional Probabilities in Several Incomplete Correlated 2 × 2 Tables

Shun-Fang Wang Xue-Ren Wang 《统计学通讯:理论与方法》2013,42(16):2877-2890

This article considers K pairs of incomplete correlated 2 × 2 tables in which the interesting measurement is the risk difference between marginal and conditional probabilities. A Wald-type statistic and a score-type statistic are presented to test the homogeneity hypothesis about risk differences across strata. Powers and sample size formulae based on the above two statistics are deduced. Figures about sample size against risk difference (or marginal probability) are given. A real example is used to illustrate the proposed methods. 相似文献

4.

Identifying Variables Contributing to Outliers in Phase I

Robert L. Mason Youn-Min Chou John C. Young 《统计学通讯:理论与方法》2013,42(7):1103-1118

When a process is monitored with a T ² control chart in a Phase II setting, the MYT decomposition is a valuable diagnostic tool for interpreting signals in terms of the process variables. The decomposition splits a signaling T ² statistic into independent components that can be associated with either individual variables or groups of variables. Since these components are T ² statistics with known distributions, they can be used to determine which of the process variable(s) contribute to the signal. However, this procedure cannot be applied directly to Phase I since the distributions of the individual components are unknown. In this article, we develop the MYT decomposition procedure for a Phase I operation, when monitoring a random sample of individual observations and identifying outliers. We use a relationship between the T ² statistic in Phase I with the corresponding T ² statistic resulting when an observation is omitted from this sample to derive the distributions of these components and demonstrate the Phase I application of the MYT decomposition. 相似文献

5.

On Rao score and PearsonX 2 statistics in generalized linear models

Gianfranco Lovison 《Statistical Papers》2005,46(4):555-574

The identity of the Rao score and PearsonX ² statistics is well known in the areas where the latter was first introduced: goodness-of-fit in contingency tables and binary responses. We show in this paper that the same identity holds when the two statistics are used for testing goodness-of-fit of Generalized Linear Models. We also highlight the connections that exist between the two statistics when they are used for the comparison of nested models. Finally, we discuss some merits of these unifying results. Work financially supported by cofin. MIUR grants 2000 and 2002. 相似文献

6.

Diagnosis of Multivariate Control Chart Signal Based on Dummy Variable Regression Technique

《统计学通讯:理论与方法》2013,42(8):1665-1684

Abstract

It is common to monitor several correlated quality characteristics using the Hotelling's T ² statistic. However, T ² confounds the location shift with scale shift and consequently it is often difficult to determine the factors responsible for out of control signal in terms of the process mean vector and/or process covariance matrix. In this paper, we propose a diagnostic procedure called ‘D-technique’ to detect the nature of shift. For this purpose, two sets of regression equations, each consisting of regression of a variable on the remaining variables, are used to characterize the ‘structure’ of the ‘in control’ process and that of ‘current’ process. To determine the sources responsible for an out of control state, it is shown that it is enough to compare these two structures using the dummy variable multiple regression equation. The proposed method is operationally simpler and computationally advantageous over existing diagnostic tools. The technique is illustrated with various examples. 相似文献

7.

Improved testing for the binomial distribution using chi-squared components with data-dependent cells

《Journal of Statistical Computation and Simulation》2012,82(1):75-81

A power study suggests that a good test of fit analysis for the binomial distribution is provided by a data-dependent Chernoff–Lehmann X ² test with class expectations greater than unity, and its components. These data-dependent statistics involve arithmetically simple parameter estimation, convenient approximate distributions and provide a comprehensive assessment of how well the data agree with a binomial distribution. We suggest that a well-performed single test of fit statistic is the Anderson–Darling statistic. 相似文献

8.

Two Samples Tests for Functional Data

Chongqi Zhang Heng Peng Jin-Ting Zhang 《统计学通讯:理论与方法》2013,42(4):559-578

Data in many experiments arises as curves and therefore it is natural to use a curve as a basic unit in the analysis, which is in terms of functional data analysis (FDA). Functional curves are encountered when units are observed over time. Although the whole function curve itself is not observed, a sufficiently large number of evaluations, as is common with modern recording equipment, is assumed to be available. In this article, we consider the statistical inference for the mean functions in the two samples problem drawn from functional data sets, in which we assume that functional curves are observed, that is, we consider the test if these two groups of curves have the same mean functional curve when the two groups of curves without noise are observed. The L ²-norm based and bootstrap-based test statistics are proposed. It is shown that the proposed methodology is flexible. Simulation study and real-data examples are used to illustrate our techniques. 相似文献

9.

Quantifying R 2 bias in the presence of measurement error

Karl D. Majeske Terri Lynch-Caris Janet Brelin-Fornari 《Journal of applied statistics》2010,37(4):667-677

相似文献

10.

Maximum Value of Hotelling's T 2 Statistics Based on the Successive Differences Covariance Matrix Estimator

James D. Williams Joe H. Sullivan Jeffrey B. Birch 《统计学通讯:理论与方法》2013,42(4):471-483

In statistical process control applications, the multivariate T ² control chart based on Hotelling's T ² statistic is useful for detecting the presence of special causes of variation. In particular, use of the T ² statistic based on the successive differences covariance matrix estimator has been shown to be very effective in detecting the presence of a sustained step or ramp shift in the mean vector. However, the exact distribution of this statistic is unknown. In this article, we derive the maximum value of the T ² statistic based on the successive differences covariance matrix estimator. This distributional property is crucial for calculating an approximate upper control limit of a T ² control chart based on successive differences, as described in Williams et al. (2006 Williams , J. D. , Woodall , W. H. , Birch , J. B. , Sullivan , J. H. ( 2006 ). On the distribution of T ² statistics based on successive differences . J. Qual. Technol. 38 : 217 – 229 .[Taylor & Francis Online], [Web of Science ®] , [Google Scholar]). 相似文献

11.

Small sample performance of large sample tests for tobit versus p -tobit

《Journal of Statistical Computation and Simulation》2012,82(1-2):89-97

Large sample tests for the standard To bit model versus the ^p -Tobit model by Deaton and Irish (1984) are studied. The normalized one-tailed score test by Deaton and Irish (1984) is shown to be a version of Neyman's C(α) test that is valid for the non-standard problem of the null hypothesis lying on the boundary of the parameter space. Then, this paper reports the results of Monte Carlo experiments designed to study the small sample performance of large sample tests for the standard Tobit specification versus the ^p -Tobit specification. 相似文献

12.

Integration of classification algorithms and control chart techniques for monitoring multivariate processes

《Journal of Statistical Computation and Simulation》2012,82(12):1897-1911

We propose new multivariate control charts that can effectively deal with massive amounts of complex data through their integration with classification algorithms. We call the proposed control chart the ‘Probability of Class (PoC) chart’ because the values of PoC, obtained from classification algorithms, are used as monitoring statistics. The control limits of PoC charts are established and adjusted by the bootstrap method. Experimental results with simulated and real data showed that PoC charts outperform Hotelling's T ² control charts. Further, a simulation study revealed that a small proportion of out-of-control observations are sufficient for PoC charts to achieve the desired performance. 相似文献

13.

Order statistics from trivariate normal and -distributions in terms of generalized skew-normal and skew- distributions 总被引：1，自引：0，他引：1

A. Jamalizadeh N. Balakrishnan 《Journal of statistical planning and inference》2009,139(11):3799

We consider here a generalization of the skew-normal distribution, GSN(λ₁,λ₂,ρ), defined through a standard bivariate normal distribution with correlation ρ, which is a special case of the unified multivariate skew-normal distribution studied recently by Arellano-Valle and Azzalini [2006. On the unification of families of skew-normal distributions. Scand. J. Statist. 33, 561–574]. We then present some simple and useful properties of this distribution and also derive its moment generating function in an explicit form. Next, we show that distributions of order statistics from the trivariate normal distribution are mixtures of these generalized skew-normal distributions; thence, using the established properties of the generalized skew-normal distribution, we derive the moment generating functions of order statistics, and also present expressions for means and variances of these order statistics.Next, we introduce a generalized skew-t_ν distribution, which is a special case of the unified multivariate skew-elliptical distribution presented by Arellano-Valle and Azzalini [2006. On the unification of families of skew-normal distributions. Scand. J. Statist. 33, 561–574] and is in fact a three-parameter generalization of Azzalini and Capitanio's [2003. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t distribution. J. Roy. Statist. Soc. Ser. B 65, 367–389] univariate skew-t_ν form. We then use the relationship between the generalized skew-normal and skew-t_ν distributions to discuss some properties of generalized skew-t_ν as well as distributions of order statistics from bivariate and trivariate t_ν distributions. We show that these distributions of order statistics are indeed mixtures of generalized skew-t_ν distributions, and then use this property to derive explicit expressions for means and variances of these order statistics. 相似文献

14.

Asymptotic expaxsioxs for the joint distribution of cirrelated hotellings t2 statlstics under normality

Yasunori Fujikoshi Takashi Seo 《统计学通讯:理论与方法》2013,42(3-4):773-788

Let T² _i=z′_iS^?1z_i, i==,…k be correlated Hotelling's T² statistics under normality. where z=(z′_i,…,z′_k)′ and nS are independently distributed as N_kp((O,ρ?∑) and Wishart distribution W_p(∑, n), respectively. The purpose of this paper is to study the distribution function F(x₁,…,x_k) of (T² _i,…,T² _k) when n is large. First we derive an asymptotic expansion of the characteristic function of (T² _i,…,T² _k) up to the order n^?2. Next we give asymptotic expansions for (T² _i,…,T² _k) in two cases (i)ρ=I_k and (ii) k=2 by inverting the expanded characteristic function up to the orders n^?2 and n^?1, respectively. Our results can be applied to the distribution function of max (T² _i,…,T² _k) as a special case. 相似文献

15.

A Family of Goodness‐of‐Fit Tests for Copulas Based on Characteristic Functions

《Scandinavian Journal of Statistics》2018,45(2):301-323

A general class of rank statistics based on the characteristic function is introduced for testing goodness‐of‐fit hypotheses about the copula of a continuous random vector. These statistics are defined as L ₂ weighted functional distances between a nonparametric estimator and a semi‐parametric estimator of the characteristic function associated with a copula. It is shown that these statistics behave asymptotically as degenerate V ‐statistics of order four and that the limit distributions have representations in terms of weighted sums of independent chi‐square variables. The consistency of the tests against general alternatives is established and an asymptotically valid parametric bootstrap is suggested for the computation of the critical values of the tests. The behaviour of the new tests in small and moderate sample sizes is investigated with the help of simulations and compared with a competing test based on the empirical copula. Finally, the methodology is illustrated on a five‐dimensional data set. 相似文献

16.

The Prediction Sum of Squares as a General Measure for Regression Diagnostics

Nguyen T. Quan 《商业与经济统计学杂志》2013,31(4):501-504

Statistics that usually accompany the regression model do not provide insight into the quality of the data or the potential influence of the individual observations on the estimates. In this study, the Q² statistic is used as a criterion for detecting influential observations or outliers. The statistic is derived from the jackknifed residuals, the squared sum of which is generally known as the prediction sum of squares or PRESS. This article compares R ² with Q² and suggests that the latter be used as part of the data-quality check. It is shown, for two separate data sets obtained from regional cost of living and U.S. food industry studies, that in the presence of outliers the Q² statistic can be negative, because it is sensitive to the choice of regressors and the inclusion of influential observations. Once the outliers are dropped from the sample, the discrepancy between Q² and R ² values is negligible. 相似文献

17.

Mixed multivariate EWMA-CUSUM control charts for an improved process monitoring

Jimoh Olawale Ajadi 《统计学通讯:理论与方法》2017,46(14):6980-6993

Multivariate exponential weighted moving average and cumulative sum charts are the most common memory type multivariate control charts. They make use of the present and past information to detect small shifts in the process parameter(s). In this article, we propose two new multivariate control charts using a mixed version of their design setups. The plotting statistics of the proposed charts are based on the cumulative sum of the multivariate exponentially weighted moving averages. The performances of these schemes are evaluated in terms of average run length. The proposals are compared with their existing counterparts, including HotellingT², MCUSUM, MEWMA, and MC1 charts. An application example is also presented for practical considerations using a real dataset. 相似文献

18.

Another Cautionary Note about R 2: Its Use in Weighted Least-Squares Regression Analysis

John B. Willett Judith D. Singer 《The American statistician》2013,67(3):236-238

A recent article in this journal presented a variety of expressions for the coefficient of determination (R ²) and demonstrated that these expressions were generally not equivalent. The article discussed potential pitfalls in interpreting the R ² statistic in ordinary least-squares regression analysis. The current article extends this discussion to the case in which regression models are fit by weighted least squares and points out an additional pitfall that awaits the unwary data analyst. We show that unthinking reliance on the R ² statistic can lead to an overly optimistic interpretation of the proportion of variance accounted for in the regression. We propose a modification of the estimator and demonstrate its utility by example. 相似文献

19.

The sampling distribution of the W estimator of the number of valid signatures on a petition

Mark E. Eakin Mary M. Whiteside 《统计学通讯:理论与方法》2013,42(5):1224-1240

Abstract

A new non linear estimator, W, for the number of valid, unique signatures on a petition has been shown better, for the cases enumerated and with certain restrictions, than a popular Goodman-type statistic, G. This article extends those results with relaxed conditions by developing the exact probability mass function and mean of W and a close approximation of the variance (Var(W)). If the proportion of valid signatures among unique and duplicated signatures is the same, then Var(W) is approximately a function of the means and variances of the two sample statistics. Using the delta method, we estimate Var(W), with the resulting approximation shown to be good, even when the condition of equal proportions does not hold. We compare W to G and establish which estimator is preferred for different intervals of the design parameters. Data from a Washington State petition illustrate the findings. 相似文献

20.

Robust statistics for testing mean vectors of multivariate distributions

M.L. Tiku M. Singh 《统计学通讯:理论与方法》2013,42(9):985-1001

We develop a ‘robust’ statistic T² _R, based on Tiku's (1967, 1980) MML (modified maximum likelihood) estimators of location and scale parameters, for testing an assumed meam vector of a symmetric multivariate distribution. We show that T² _R is one the whole considerably more powerful than the prominenet Hotelling T² statistics. We also develop a robust statistic T² _D for testing that two multivariate distributions (skew or symmetric) are identical; T² _D seems to be usually more powerful than nonparametric statistics. The only assumption we make is that the marginal distributions are of the type (1/σ_k)f((x-μ_k)/σ_k) and the means and variances of these marginal distributions exist. 相似文献