期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Thomas W. O'GORMAN 《Revue canadienne de statistique》1997,25(2):269-279

An adaptive test is proposed for the one-way layout. This test procedure uses the order statistics of the combined data to obtain estimates of percentiles, which are used to select an appropriate set of rank scores for the one-way test statistic. This test is designed to have reasonably high power over a range of distributions. The adaptive procedure proposed for a one-way layout is a generalization of an existing two-sample adaptive test procedure. In this Monte Carlo study, the power and significance level of the F-test, the Kruskal-Wallis test, the normal scores test, and the adaptive test were evaluated for the one-way layout. All tests maintained their significance level for data sets having at least 24 observations. The simulation results show that the adaptive test is more powerful than the other tests for skewed distributions if the total number of observations equals or exceeds 24. For data sets having at least 60 observations the adaptive test is also more powerful than the F-test for some symmetric distributions. 相似文献

2.

Moran's I statistic-based nonparametric test with spatio-temporal observations

Y. Xiong D. Bingham W. J. Braun 《Journal of nonparametric statistics》2019,31(1):244-267

Moran's I statistic [Moran, (1950), ‘Notes on Continuous Stochastic Phenomena’, Biometrika, 37, 17–23] has been widely used to evaluate spatial autocorrelation. This paper is concerned with Moran's I-induced testing procedure in residual analysis. We begin with exploring the Moran's I statistic in both its original and extended forms analytically and numerically. We demonstrate that the magnitude of the statistic in general depends not only on the underlying correlation but also on certain heterogeneity in the individual observations. One should exercise caution when interpreting the outcome on correlation by the Moran's I-induced procedure. On the other hand, the effect on the Moran's I due to heterogeneity in the observations enables a regression model checking procedure with the residuals. This novel application of Moran's I is justified by simulation and illustrated by an analysis of wildfire records from Alberta, Canada. 相似文献

3.

A general algorithm for computing simultaneous prediction intervals for the (log)-location-scale family of distributions

Yimeng Xie Luis A. Escobar William Q. Meeker 《Journal of Statistical Computation and Simulation》2017,87(8):1559-1576

Making predictions of future realized values of random variables based on currently available data is a frequent task in statistical applications. In some applications, the interest is to obtain a two-sided simultaneous prediction interval (SPI) to contain at least k out of m future observations with a certain confidence level based on n previous observations from the same distribution. A closely related problem is to obtain a one-sided upper (or lower) simultaneous prediction bound (SPB) to exceed (or be exceeded) by at least k out of m future observations. In this paper, we provide a general approach for computing SPIs and SPBs based on data from a particular member of the (log)-location-scale family of distributions with complete or right censored data. The proposed simulation-based procedure can provide exact coverage probability for complete and Type II censored data. For Type I censored data, our simulation results show that our procedure provides satisfactory results in small samples. We use three applications to illustrate the proposed simultaneous prediction intervals and bounds. 相似文献

4.

On robustness of the Shiryaev–Roberts change-point detection procedure under parameter misspecification in the post-change distribution

Wenyu Du Grigory Sokolov 《统计学通讯:模拟与计算》2017,46(3):2185-2206

The gist of the quickest change-point detection problem is to detect the presence of a change in the statistical behavior of a series of sequentially made observations, and do so in an optimal detection-speed-versus-“false-positive”-risk manner. When optimality is understood either in the generalized Bayesian sense or as defined in Shiryaev's multi-cyclic setup, the so-called Shiryaev–Roberts (SR) detection procedure is known to be the “best one can do”, provided, however, that the observations’ pre- and post-change distributions are both fully specified. We consider a more realistic setup, viz. one where the post-change distribution is assumed known only up to a parameter, so that the latter may be misspecified. The question of interest is the sensitivity (or robustness) of the otherwise “best” SR procedure with respect to a possible misspecification of the post-change distribution parameter. To answer this question, we provide a case study where, in a specific Gaussian scenario, we allow the SR procedure to be “out of tune” in the way of the post-change distribution parameter, and numerically assess the effect of the “mistuning” on Shiryaev's (multi-cyclic) Stationary Average Detection Delay delivered by the SR procedure. The comprehensive quantitative robustness characterization of the SR procedure obtained in the study can be used to develop the respective theory as well as to provide a rational for practical design of the SR procedure. The overall qualitative conclusion of the study is an expected one: the SR procedure is less (more) robust for less (more) contrast changes and for lower (higher) levels of the false alarm risk. 相似文献

5.

Consistency of the Subsample Bootstrap empirical process

Markus Pauly 《Statistics》2013,47(5):621-626

In the classical Bootstrap approach the number of distinct observation in the resample is random. To overcome this hitch Rao et al. [Bootstrap by sequential resampling, J. Statist. Plan. Inference 64 (1997), pp. 257–281] have proposed a modified resampling procedure – the so-called Sequential Bootstrap or 0.632-Bootstrap – in which each resample has exactly the same number m ～eq ?0.632 n? of distinct observations. Motivated by this idea we introduce an akin procedure, the Subsample Bootstrap, where additionally even the size of each resample is equal. It will turn out that the Subsample Bootstrap empirical process is consistent for a wide class of Donsker classes. 相似文献

6.

Monitoring Change in Persistence Against the Null of Difference-Stationarity in Infinite Variance Observations

Zhanshou Chen 《统计学通讯:模拟与计算》2015,44(1):71-87

In this article, we propose a moving kernel-weighted variance ratio statistic to monitor persistence change in infinite variance observations. We focus on I(1) to I(0) persistence change for sequences in the domain of attraction of a stable law and local-to-finite variance sequences. The null distribution of the monitoring statistic and its consistency are proved. In particular, a bootstrap procedure is proposed to determine the critical values for the derived asymptotic distribution depends on unknown tail index. The small sample performances of proposed monitoring procedure are illustrated by both simulation and application to a high frequency financial data. 相似文献

7.

Improving the detection of unusual observations in high‐dimensional settings

下载免费PDF全文

Insha Ullah Matthew D.M. Pawley Adam N.H. Smith Beatrix Jones 《Australian & New Zealand Journal of Statistics》2017,59(4):449-462

Multivariate control charts are used to monitor stochastic processes for changes and unusual observations. Hotelling's T² statistic is calculated for each new observation and an out‐of‐control signal is issued if it goes beyond the control limits. However, this classical approach becomes unreliable as the number of variables p approaches the number of observations n, and impossible when p exceeds n. In this paper, we devise an improvement to the monitoring procedure in high‐dimensional settings. We regularise the covariance matrix to estimate the baseline parameter and incorporate a leave‐one‐out re‐sampling approach to estimate the empirical distribution of future observations. An extensive simulation study demonstrates that the new method outperforms the classical Hotelling T² approach in power, and maintains appropriate false positive rates. We demonstrate the utility of the method using a set of quality control samples collected to monitor a gas chromatography–mass spectrometry apparatus over a period of 67 days. 相似文献

8.

NONPARAMETRIC RELIABILITY ESTIMATION BASED ON A FEW ORDERED OBSERVATIONS

H. Arsham 《Australian & New Zealand Journal of Statistics》1988,30(1):67-77

The usual one-sided Kolmogorov-Smirnov distance is generalized to obtain an improved lower confidence region for the extreme left tail of the reliability function based on k observations in a “k out of n censored” plan. Finite sample and asymptotic critical values necessary for implementation are given. The two numerical comparisons with existing parametric procedures for the case of complete or censored samples demonstrate the applicability of the proposed nonparametric procedure. 相似文献

9.

On the bootstrap quantile-treatment-effect test

Man-Lai Tang Maozai Tian Ping-Shing Chan 《Journal of applied statistics》2008,35(3):335-350

Let {X ₁, …, X _n} and {Y ₁, …, Y _m} be two samples of independent and identically distributed observations with common continuous cumulative distribution functions F(x)=P(X≤x) and G(y)=P(Y≤y), respectively. In this article, we would like to test the no quantile treatment effect hypothesis H ₀: F=G. We develop a bootstrap quantile-treatment-effect test procedure for testing H ₀ under the location-scale shift model. Our test procedure avoids the calculation of the check function (which is non-differentiable at the origin and makes solving the quantile effects difficult in typical quantile regression analysis). The limiting null distribution of the test procedure is derived and the procedure is shown to be consistent against a broad family of alternatives. Simulation studies show that our proposed test procedure attains its type I error rate close to the pre-chosen significance level even for small sample sizes. Our test procedure is illustrated with two real data sets on the lifetimes of guinea pigs from a treatment-control experiment. 相似文献

10.

Bayes and Empirical Bayes Two-Stage Procedures for Sampling Inspection

Re-Bin Rau 《统计学通讯:理论与方法》2013,42(6):871-887

A batch of M items is inspected for defectives. Suppose there are d defective items in the batch. Let d ₀ be a given standard used to evaluate the quality of the population where 0 < d ₀ < M. The problem of testing H ₀: d < d ₀ versus H ₁: d ≥ d ₀ is considered. It is assumed that past observations are available when the current testing problem is considered. Accordingly, the empirical Bayes approach is employed. By using information obtained from the past data, an empirical Bayes two-stage testing procedure is developed. The associated asymptotic optimality is investigated. It is proved that the rate of convergence of the empirical Bayes two-stage testing procedure is of order O (exp(? c? n)), for some constant c? > 0, where n is the number of past observations at hand. 相似文献

11.

ON A MULTIPLE THREE-DECISION PROCEDURE FOR COMPARING SEVERAL TREATMENTS WITH A CONTROL

W. Liu 《Australian & New Zealand Journal of Statistics》1997,39(1):79-92

For the two-sided comparisons of several treatments with a control, a common statistical problem is to decide which treatments are better than the control and which are worse than the control. This paper studies a multiple three-decision procedure for this purpose, proposed by Bohrer (1979) and Bohrer et al. (1981), and provides tables of critical points to facilitate the application of the procedure. The paper defines a power function of the procedure, and tabulates sample sizes necessary to guarantee a given power level. It addresses the problem of optimal sampling allocation in order to maximize the power for a given total sample size, and considers generalization to the situation where the treatments might have unequal numbers of observations. 相似文献

12.

Impact of Contamination on Training and Test Error Rates in Statistical Clustering

C. Ruwet G. Haesbroeck 《统计学通讯:模拟与计算》2013,42(3):394-411

The k-means algorithm is one of the most common non hierarchical methods of clustering. It aims to construct clusters in order to minimize the within cluster sum of squared distances. However, as most estimators defined in terms of objective functions depending on global sums of squares, the k-means procedure is not robust with respect to atypical observations in the data. Alternative techniques have thus been introduced in the literature, e.g., the k-medoids method. The k-means and k-medoids methodologies are particular cases of the generalized k-means procedure. In this article, focus is on the error rate these clustering procedures achieve when one expects the data to be distributed according to a mixture distribution. Two different definitions of the error rate are under consideration, depending on the data at hand. It is shown that contamination may make one of these two error rates decrease even under optimal models. The consequence of this will be emphasized with the comparison of influence functions and breakdown points of these error rates. 相似文献

13.

Goodness-of-fit test for Gaussian regression with block correlated errors

S. Huet 《Statistics》2015,49(2):239-266

We propose a procedure to test that the expectation of a Gaussian vector is linear against a nonparametric alternative. We consider the case where the covariance matrix of the observations has a block diagonal structure. This framework encompasses regression models with autocorrelated errors, heteroscedastic regression models, mixed-effects models and growth curves. Our procedure does not depend on any prior information about the alternative. We prove that the test is asymptotically of the nominal level and consistent. We characterize the set of vectors on which the test is powerful and prove the classical √log log (n)/n convergence rate over directional alternatives. We propose a bootstrap version of the test as an alternative to the initial one and provide a simulation study in order to evaluate both procedures for small sample sizes when the purpose is to test goodness of fit in a Gaussian mixed-effects model. Finally, we illustrate the procedures using a real data set. 相似文献

14.

The Concept of Majorization in Experimental Design

《统计学通讯:理论与方法》2012,41(16-17):2836-2850

Row and column designs are examined and the concept of majorization is applied to find optimal designs, when the observations are either independent or dependent. The dependence follows a first-order autoregression with parameter α. The case of two treatments is examined and the universally optimal or Φ-optimal designs are given, for different values of a, when the number of experimental units is even or odd. A filtering procedure is followed to reduce the number of competing designs. 相似文献

15.

SELECTING "DEMONSTRABLY BEST" OR "DEMONSTRABLY WORST" EXPONENTIAL POPULATIONS

Eve Bofinger 《Australian & New Zealand Journal of Statistics》1991,33(2):183-190

Consider k independent observations Y_i (i= 1,., k) from two-parameter exponential populations _i with location parameters μ and the same scale parameter If the μ_i are ranked as consider population as the “worst” population and II_p(k) as the “best” population (with some tagging so that p{) and p(k) are well defined in the case of equalities). If the Y_i are ranked as we consider the procedure, “Select provided Y_R(k) Y_r(k) is sufficiently large so that is demonstrably better than the other populations.” A similar procedure is studied for selecting the “demonstrably worst” population. 相似文献

16.

The conditional cumulative distribution function in single functional index model

Ahmed Ait Saidi Mecheri Kheira 《统计学通讯:理论与方法》2013,42(16):4896-4911

ABSTRACT

We consider the estimation of the conditional cumulative distribution function of a scalar response variable Y given a Hilbertian random variable X when the observations are linked via a single-index structure. We establish the pointwise and the uniform almost complete convergence (with the rate) of the kernel estimate of this model. As an application, we show how our result can be applied in the prediction problem via the conditional median estimate. Also, the choice of the functional index via the cross-validation procedure is also discussed but not attacked. 相似文献

17.

APPROXIMATIONS TO SOME TEST STATISTICS FOR PERMUTATION TESTS IN A COMPLETELY RANDOMIZED DESIGN1

J. Robinson 《Australian & New Zealand Journal of Statistics》1983,25(2):358-369

Under a randomization model for a completely randomized design permutation tests are considered based on the usual F statistic and on a multi-response permutation procedure statistic. For the first statistic the first two moments are obtained so a comparision with the distribution under the normal theory model can be made. The second statistic is shown to converge in distribution to an infinite weighted sum of chi-squared variates, the weights being the limits of the eigenvalues of a matrix depending on the distance measure used and the order statistics of the observations. 相似文献

18.

Validity and efficiency in analyzing ordinal responses with missing observations

Xichen She Changbao Wu 《Revue canadienne de statistique》2020,48(2):138-151

This article addresses issues in creating public-use data files in the presence of missing ordinal responses and subsequent statistical analyses of the dataset by users. The authors propose a fully efficient fractional imputation (FI) procedure for ordinal responses with missing observations. The proposed imputation strategy retrieves the missing values through the full conditional distribution of the response given the covariates and results in a single imputed data file that can be analyzed by different data users with different scientific objectives. Two most critical aspects of statistical analyses based on the imputed data set, validity and efficiency, are examined through regression analysis involving the ordinal response and a selected set of covariates. It is shown through both theoretical development and simulation studies that, when the ordinal responses are missing at random, the proposed FI procedure leads to valid and highly efficient inferences as compared to existing methods. Variance estimation using the fractionally imputed data set is also discussed. The Canadian Journal of Statistics 48: 138–151; 2020 © 2019 Statistical Society of Canada 相似文献

19.

Influence diagnostics for censored regression models with autoregressive errors

下载免费PDF全文

Fernanda L. Schumacher Victor H. Lachos Filidor E. Vilca‐Labra Luis M. Castro 《Australian & New Zealand Journal of Statistics》2018,60(2):209-229

Observations collected over time are often autocorrelated rather than independent, and sometimes include observations below or above detection limits (i.e. censored values reported as less or more than a level of detection) and/or missing data. Practitioners commonly disregard censored data cases or replace these observations with some function of the limit of detection, which often results in biased estimates. Moreover, parameter estimation can be greatly affected by the presence of influential observations in the data. In this paper we derive local influence diagnostic measures for censored regression models with autoregressive errors of order p (hereafter, AR(p)‐CR models) on the basis of the Q‐function under three useful perturbation schemes. In order to account for censoring in a likelihood‐based estimation procedure for AR(p)‐CR models, we used a stochastic approximation version of the expectation‐maximisation algorithm. The accuracy of the local influence diagnostic measure in detecting influential observations is explored through the analysis of empirical studies. The proposed methods are illustrated using data, from a study of total phosphorus concentration, that contain left‐censored observations. These methods are implemented in the R package ARCensReg. 相似文献

20.

Identifying Variables Contributing to Outliers in Phase I

Robert L. Mason Youn-Min Chou John C. Young 《统计学通讯:理论与方法》2013,42(7):1103-1118

When a process is monitored with a T ² control chart in a Phase II setting, the MYT decomposition is a valuable diagnostic tool for interpreting signals in terms of the process variables. The decomposition splits a signaling T ² statistic into independent components that can be associated with either individual variables or groups of variables. Since these components are T ² statistics with known distributions, they can be used to determine which of the process variable(s) contribute to the signal. However, this procedure cannot be applied directly to Phase I since the distributions of the individual components are unknown. In this article, we develop the MYT decomposition procedure for a Phase I operation, when monitoring a random sample of individual observations and identifying outliers. We use a relationship between the T ² statistic in Phase I with the corresponding T ² statistic resulting when an observation is omitted from this sample to derive the distributions of these components and demonstrate the Phase I application of the MYT decomposition. 相似文献