首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
ABSTRACT

This article studies the outlier detection problem in mixed regressive-spatial autoregressive model. The formulae for testing outliers and their approximate distributions are derived under the mean-shift model and the variance-weight model, respectively. The simulation studies are conducted for examining the power and size of the test, as well as for the detection of outliers when a simulated data contains several outliers. A real data is analyzed to illustrate the proposed method, and modified models based on mean-shift and variance-weight models in which detected outliers are taken into account are suggested to deal with the outliers and confirm theconclusions.  相似文献   

2.
The growth curve model introduced by Potthoff and Roy (1964) is a general statistical model which includes as special cases regression models and both univariate and multivariate analysis of variance models. In this paper, we discuss procedures for detection of outliers in growth curve models for mean-slippage and dispersion-slippage outlier model. The distributions of the test statistics are discussed and the values of significant probabilities are given using Bonferronl's bounds. Some simulation results are also presented.  相似文献   

3.
空间回归模型选择的反思   总被引:1,自引:0,他引:1  
空间计量经济学存在两种最基本的模型:空间滞后模型和空间误差模型,这里旨在重新思考和探讨这两种空间回归模型的选择,结论为:Moran’s I指数可以用来判断回归模型后的残差是否存在空间依赖性;在实证分析中,采用拉格朗日乘子检验判断两种模型优劣是最常见的做法。然而,该检验仅仅是基于统计推断而忽略了理论基础,因此,可能导致选择错误的模型;在实证分析中,空间误差模型经常被选择性遗忘,而该模型的适用性较空间滞后模型更为广泛;实证分析大多缺乏空间回归模型设定的探讨,Anselin提出三个统计量,并且,如果模型设定正确,应该遵从Wald统计量>Log likelihood统计量>LM统计量的排列顺序。  相似文献   

4.
In the present paper, we use the already defined alpha-divergence and gamma-divergence for constructing some goodness of fit tests for exponentiality. These divergence measures are very robust with respect to outliers. Since the existence of outliers among statistical data can be lead to misleading results, therefore utilizing these divergence measures can be of importance. In order to construct test statistics, two estimators are used for alpha-divergence and gamma-divergence. In the first one, we consider the alpha-divergence and gamma-divergence of the equilibrium distribution function, which is well defined on the empirical distribution function (EDF) and is proposed as an EDF-based goodness of fit test statistic. The second one is an estimator in manner of Vasicek entropy estimator. Simulation results indicate that in comparison with the other tests statistics, our mentioned test statistics almost in most of the cases have higher power. Finally, two examples containing outliers illustrate the importance and use of the proposed tests.  相似文献   

5.
The Forward Search is a powerful general method, incorporating flexible data-driven trimming, for the detection of outliers and unsuspected structure in data and so for building robust models. Starting from small subsets of data, observations that are close to the fitted model are added to the observations used in parameter estimation. As this subset grows we monitor parameter estimates, test statistics and measures of fit such as residuals. The paper surveys theoretical development in work on the Forward Search over the last decade. The main illustration is a regression example with 330 observations and 9 potential explanatory variables. Mention is also made of procedures for multivariate data, including clustering, time series analysis and fraud detection.  相似文献   

6.
This paper studies outlier detection for multilevel models. Approximate formulae for outlier detection in estimating both fixed and random parameters under the mean-shift outlier model are derived, and a test for multiple outliers is proposed. These results can be used to detect outlier units at any levels. Detection of outlier units related to random parts is also studied. Analysis of an example shows that the proposed method is effective in identifying outliers in multilevel models.  相似文献   

7.
The aim of this paper is to develop a general, unified approach, based on some partial estimation functions which we call “Z-process”, to some change point problems in mathematical statistics. The method proposed can be applied not only to ergodic models but also to some models where the Fisher information matrix is random. Applications to some concrete models, including a parametric model for volatilities of diffusion processes are presented. Simulations for randomly time-transformed Brownian bridge process appearing as the limit of the proposed test statistics are performed with computer intensive use.  相似文献   

8.
In this paper, we introduce two new statistics for detecting outliers in the Pareto distribution. These new statistics are the extension of the statistics for detecting outliers in exponential and gamma distributions. In fact, we compare the power of our test statistics with the other statistics and select the best test statistic for detecting outliers in the Pareto distribution. Finally, numerical examples of different insurance claims are used to see the performance of the test.  相似文献   

9.
Most applications in spatial statistics involve modeling of complex spatial–temporal dependency structures, and many of the problems of space and time modeling can be overcome by using separable processes. This subclass of spatial–temporal processes has several advantages, including rapid fitting and simple extensions of many techniques developed and successfully used in time series and classical geostatistics. In particular, a major advantage of these processes is that the covariance matrix for a realization can be expressed as the Kronecker product of two smaller matrices that arise separately from the temporal and purely spatial processes, and hence its determinant and inverse are easily determinable. However, these separable models are not always realistic, and there are no formal tests for separability of general spatial–temporal processes. We present here a formal method to test for separability. Our approach can be also used to test for lack of stationarity of the process. The beauty of our approach is that by using spectral methods the mechanics of the test can be reduced to a simple two-factor analysis of variance (ANOVA) procedure. The approach we propose is based on only one realization of the spatial–temporal process.We apply the statistical methods proposed here to test for separability and stationarity of spatial–temporal ozone fields using data provided by the US Environmental Protection Agency (EPA).  相似文献   

10.
ABSTRACT

Advances in statistical computing software have led to a substantial increase in the use of ordinary least squares (OLS) regression models in the engineering and applied statistics communities. Empirical evidence suggests that data sets can routinely have 10% or more outliers in many processes. Unfortunately, these outliers typically will render the OLS parameter estimates useless. The OLS diagnostic quantities and graphical plots can reliably identify a few outliers; however, they significantly lose power with increasing dimension and number of outliers. Although there have been recent advances in the methods that detect multiple outliers, improvements are needed in regression estimators that can fit well in the presence of outliers. We introduce a robust regression estimator that performs well regardless of outlier quantity and configuration. Our studies show that the best available estimators are vulnerable when the outliers are extreme in the regressor space (high leverage). Our proposed compound estimator modifies recently published methods with an improved initial estimate and measure of leverage. Extensive performance evaluations indicate that the proposed estimator performs the best and consistently fits the bulk of the data when outliers are present. The estimator, implemented in standard software, provides researchers and practitioners a tool for the model-building process to protect against the severe impact from multiple outliers.  相似文献   

11.
韩本三  徐凤  黎实 《统计研究》2011,28(12):83-88
 相关系数的绝对值形式可以很好的避免Pesaran(2004)的CD统计量中异向相关性相互抵消的情况,相应得到一个新的检验面板数据模型扰动项截面相关的统计量。蒙特卡洛模拟显示,无论是在因子模型下还是在空间移动平均模型下,新提出的统计量水平扭曲(size distortion)检验和功效(power)检验表现较好。通过模拟还发现当存在序列相关的扰动项时,先将扰动项进行去序列相关处理可以有效地避免序列相关导致的水平扭曲,并且不会降低统计量的功效。  相似文献   

12.
The nonparametric version of the classical mixed model is considered and the common hypotheses of (parametric) main effects and interactions are reformulated in a nonparametric setup. To test these nonparametric hypotheses, the asymptotic distributions of quadratic forms of rank statistics are derived in a general framework which enables the derivation of the statistics for the nonparametric hypotheses of the fixed treatment effects and interactions in an arbitrary mixed model. The procedures given here are not restricted to semiparametric models or models with additive effects. Moreover, they are robust to outliers since only the ranks of the observations are needed. They are also applicable to pure ordinal data and since no continuity of the distribution functions is assumed, they can also be applied to data with ties. Some approximations for small sample sizes are suggested and analyzed in a simulation study. The application of the statistics and the interpretation of the results is demonstrated in several worked-out examples where some data sets given in the literature are re-analyzed.  相似文献   

13.
The union-intersection approach to multivariate test construction is used to develop an alternative to Wilks' likelihood ratio test statistic for testing for two or more outliers in multivariate normal data. It is shown that critical values of both statistics are poorly approximated by Bonferroni bounds. Simulated critical values are presented for both statistics for significance levels 1% and 5%, for sample sizes 10(5)30, 40, 50, 75 and 100 for 2, 3, 4 and 5 dimensions. A power comparison of the two tests in the slippage of the mean model for generating outliers indicates that the union-intersection test is the more powerful when the slippages are close to collinear. Although Wilks' test remains the preference for general use, the union-intersection test could be valuable when such special structure in the data is suspected.  相似文献   

14.
序列相关的检验与诊断   总被引:4,自引:0,他引:4       下载免费PDF全文
赵进文 《统计研究》1997,14(2):63-65
序列相关的检验与诊断赵进文ABSTRACTIngeneralopinion,themodelswebuilthavepowerfulpracticalforecastabilitysolongastheyareacceptedbysignifican...  相似文献   

15.
Outliers can occur as readily in samples from the finite populations (e.g. in sample surveys) as in samples from infinite populations. However, in the vast literature on outliers there is almost no mention of outlier tests for data from sample surveys. We examine the behaviour of some standard outlier test statistics for infinite populations when these are applied to finite populations, examining their properties by extensive simulation studies. Some anomalous results are obtained Nsuggesting a fundamental difficulty in testing outliers for the finite population case.  相似文献   

16.
刘洪  黄燕 《统计研究》2007,24(8):17-21
 本文采用组合模型的形式对时间序列数据的变化特点建模,在模型通过各种检验、具有良好统计预测功能的基础上,从检验异常值的角度来分析预测值与实际值之间差异的程度,找出离群数据,利用数理统计中检验实验观测数据异常值的方法,对离群数据的误差进行统计上的显著检验,从而评估统计数据的质量。文章以我国国内生产总值(GDP)为研究对象,选取我国1978-2003年间的GDP作为样本,运用趋势模拟评估法来评估我国2004年国内生产总值的准确性。对我国经济指标的时间序列数据进行了实证分析。  相似文献   

17.
Exact tests for the equality of several linear models are developed using permutation techniques. Two cases of the linear model, characterized by either stochastic or nonstochastic predictors, are considered: the linear regression model (LRM) and the general linear model (GLM). A general class of test statistics using the volume of simplexes as the basic unit of analysis is proposed for this problem. The resulting class of statistics is shown to be a natural generalization of the multi-response permutation procedure (MRPP) test statistics which have been shown to comprise many of the statistics used in both parametric and nonparametric analysis of the standard g—sample problem. In the LRM case, exact moments of all orders are derived for the permutation distribution of any test statistic in the general class. Moment-based approximation of significance levels is shown to be computationally feasible in the simple LRM.  相似文献   

18.
In the literature related to the study of lifelengths of experimental units, little attention has been paid to the models where shocks to the units generate outliers. In the present article, we consider a situation where n experimental units under investigation receive shocks at several time points. The parameter values of the lifelength distribution may change due to each shock, resulting in the generation of outliers. We derive the likelihood ratio test statistic to investigate if the shocks have significantly altered the parameter values. We also derive a likelihood ratio test under the labelled slippage alternative with multiple contaminations. Monte Carlo studies have been carried out to investigate the power of the proposed test statistics.  相似文献   

19.
Statistics for spatial functional data is an emerging field in statistics which combines methods of spatial statistics and functional data analysis to model spatially correlated functional data. Checking for spatial autocorrelation is an important step in the statistical analysis of spatial data. Several statistics to achieve this goal have been proposed. The test based on the Mantel statistic is widely known and used in this context. This paper proposes an application of this test to the case of spatial functional data. Although we focus particularly on geostatistical functional data, that is functional data observed in a region with spatial continuity, the test proposed can also be applied with functional data which can be measured on a discrete set of areas of a region (areal functional data) by defining properly the distance between the areas. Based on two simulation studies, we show that the proposed test has a good performance. We illustrate the methodology by applying it to an agronomic data set.  相似文献   

20.
Null as well as alternative distributions of two types of statistics used to test for multiple outliers in exponential samples are obtained. Of these two types one is based on the ratio of sum of the observations suspected to be outliers to sum of the sample observations and the other is Dixon's type. Powers of the tests based on these statistics are compared.

  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号