首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In the paper we derive new types of multivariate exponentially weighted moving average (EWMA) control charts which are based on the Euclidean distance and on the distance defined by using the inverse of the diagonal matrix consisting of the variances. The design of the proposed control schemes does not involve the computation of the inverse covariance matrix and, thus, it can be used in the high-dimensional setting. The distributional properties of the control statistics are obtained and are used in the determination of the new control procedures. Within an extensive simulation study, the new approaches are compared with the multivariate EWMA control charts which are based on the Mahalanobis distance.  相似文献   

2.
The purpose of this paper is to jointly monitor the mean vector and the covariance matrix of multivariate nonlinear times series. The underlying target process is assumed to be a constant conditional correlation process Bollerslev (Rev Econ Stat 72:498–505, 1990) or a dynamic conditional correlation model Engle (J Bus Econ Stat 20:339–350, 2002). We introduce several EWMA and CUSUM control charts. These control schemes are based on univariate EWMA statistics, multivariate EWMA recursions, and different types of cumulative sums. The recursions are applied to local measures for means and covariances, e.g. the present observations and the conditional covariances. Further, they are applied to means and covariances of residuals. The control statistics are obtained by computing the Mahalanobis distance between the EWMA or CUSUM statistics and their expectations if no change occurs. Via Monte Carlo simulation the performance of the proposed charts is compared. Our empirical study illustrates an application of these control procedures to bivariate logarithmic returns of the European indices FTSE100 and DAX. In order to assess the performance of the introduced schemes we apply the average run length and the maximum conditional expected delay.  相似文献   

3.
I consider the problem of estimating the Mahalanobis distance between multivariate normal populations when the population covariance matrix satisfies a graphical model. In addition to providing a clear understanding of the dependencies in a multivariate data set, the use of graphical models can reduce the variability of the estimated distances and improve inferences. I derive the asymptotic distribution of the estimated Mahalanobis distance under a general covariance model, which includes graphical models as a special case. Two examples are discussed.  相似文献   

4.
Modified cumulative sum (CUSUM) control charts and CUSUM schemes for residuals are suggested to detect changes in the covariance matrix of multivariate time series. Several properties of these schemes are derived when the in-control process is a stationary Gaussian process. A Monte Carlo study reveals that the proposed approaches show similar or even better performance than the schemes based on the multivariate exponentially weighted moving average (MEWMA) recursion. We illustrate how the control procedures can be applied to monitor the covariance structure of developed stock market indices.  相似文献   

5.
The Mahalanobis distance between pairs of multivariate observations is used as a measure of similarity between the observations. The theoretical distribution is derived, and the result is used for judging on the degree of isolation of an observation. In case of spatially dependent data where spatial coordinates are available, different exploratory tools are introduced for studying the degree of isolation of an observation from a fraction of its neighbors, and thus to identify local multivariate outliers.  相似文献   

6.
In this article, we study exponentially weighted moving average (EWMA) control schemes to monitor the multivariate Poisson distribution with a general covariance structure, so that the practitioner can simultaneously monitor multiple correlated attribute processes more effectively. The statistical performance of the charts is assessed in terms of the run length properties and compared against other mainstream attribute control schemes. The application of the proposed methods to real-life and simulated datasets is demonstrated.  相似文献   

7.
It is well known that if a multivariate outlier has one or more missing component values, then multiple imputation (MI) methods tend to impute nonextreme values and make the outlier become less extreme and less likely to be detected. In this paper, nonparametric depth-based multivariate outlier identifiers are used as criteria in a numerical study comparing several established methods of MI as well as a new proposed one, nine in all, in a setting of several actual clinical laboratory data sets of different dimensions. Two criteria, an ‘outlier recovery probability’ and a ‘relative accuracy measure’, are developed, based on depth functions. Three outlier identifiers, based on Mahalanobis distance, robust Mahalanobis distance, and generalized principle component analysis are also included in the study. Consequently, not only the comparison of imputation methods but also the comparison of outlier detection methods is accomplished in this study. Our findings show that the performance of an MI method depends on the choice of depth-based outlier detection criterion, as well as the size and dimension of the data and the fraction of missing components. By taking these features into account, an MI method for a given data set can be selected more optimally.  相似文献   

8.
A note on the Cook''s distance   总被引:1,自引:0,他引:1  
A modification of the classical Cook's distance is proposed, providing us with a generalized Mahalanobis distance in the context of multivariate elliptical linear regression models. We establish the exact distribution of a pivotal type statistics based on this generalized Mahalanobis distance, which provides critical points for the identification of outlier data points. Based on the equivalence between the modified Cook's distance and what is called the mean-shift multivariate outlier elliptical model, twelve new modifications are proposed for the Cook's distance. We also describe the explicit relationship between the Cook's distance and the likelihood displacement with the modified Cook's distance. We illustrate the procedure with some examples, in the context of multiple and multivariate linear regression.  相似文献   

9.
Birnbaum–Saunders (BS) models are receiving considerable attention in the literature. Multivariate regression models are a useful tool of the multivariate analysis, which takes into account the correlation between variables. Diagnostic analysis is an important aspect to be considered in the statistical modeling. In this paper, we formulate multivariate generalized BS regression models and carry out a diagnostic analysis for these models. We consider the Mahalanobis distance as a global influence measure to detect multivariate outliers and use it for evaluating the adequacy of the distributional assumption. We also consider the local influence approach and study how a perturbation may impact on the estimation of model parameters. We implement the obtained results in the R software, which are illustrated with real-world multivariate data to show their potential applications.  相似文献   

10.
Multivariate control charts are powerful and simple visual tools for monitoring the quality of a process. This multivariate monitoring is carried out by considering simultaneously several correlated quality characteristics and by determining whether these characteristics are in control or out of control. In this paper, we propose a robust methodology using multivariate quality control charts for subgroups based on generalized Birnbaum–Saunders distributions and an adapted Hotelling statistic. This methodology is constructed for Phases I and II of control charts. We estimate the corresponding parameters with the maximum likelihood method and use parametric bootstrapping to obtain the distribution of the adapted Hotelling statistic. In addition, we consider the Mahalanobis distance to detect multivariate outliers and use it to assess the adequacy of the distributional assumption. A Monte Carlo simulation study is conducted to evaluate the proposed methodology and to compare it with a standard methodology. This study reports the good performance of our methodology. An illustration with real-world air quality data of Santiago, Chile, is provided. This illustration shows that the methodology is useful for alerting early episodes of extreme air pollution, thus preventing adverse effects on human health.  相似文献   

11.
In this paper, we discuss the derivation of the first and second moments for the proposed small area estimators under a multivariate linear model for repeated measures data. The aim is to use these moments to estimate the mean-squared errors (MSE) for the predicted small area means as a measure of precision. At the first stage, we derive the MSE when the covariance matrices are known. At the second stage, a method based on parametric bootstrap is proposed for bias correction and for prediction error that reflects the uncertainty when the unknown covariance is replaced by its suitable estimator.  相似文献   

12.
We apply statistical selection theory to multiple target detection problems by analyzing the Mahalanobis distances between multivariate normal populations and a desired standard (a known characteristic of a target). We want to achieve the goal of selecting a subset that contains no non target (negative) sites, which entails screening out all non targets. Correct selection (CS) is defined according to this goal. We consider two cases: (1) that all covariance matrices are known; and (2) that all covariance matrices are unknown, including both heteroscedastic and homoscedastic cases. Optimal selection procedures are proposed in order to reach the selection goal. The least favorable configurations (LFC) are found. Tables and figures are presented to illustrate the properties of our proposed procedures. Simulation examples are given to show that our procedures work well. The log-concavity results of the operating characteristic functions are also given.  相似文献   

13.
In extending univariate outlier detection methods to higher dimension, various issues arise: limited visualization methods, inadequacy of marginal methods, lack of a natural order, limited parametric modeling, and, when using Mahalanobis distance, restriction to ellipsoidal contours. To address and overcome such limitations, we introduce nonparametric multivariate outlier identifiers based on multivariate depth functions, which can generate contours following the shape of the data set. Also, we study masking robustness, that is, robustness against misidentification of outliers as nonoutliers. In particular, we define a masking breakdown point (MBP), adapting to our setting certain ideas of Davies and Gather [1993. The identification of multiple outliers (with discussion). Journal of the American Statistical Association 88, 782–801] and Becker and Gather [1999. The masking breakdown point of multivariate outlier identification rules. Journal of the American Statistical Association 94, 947–955] based on the Mahalanobis distance outlyingness. We then compare four affine invariant outlier detection procedures, based on Mahalanobis distance, halfspace or Tukey depth, projection depth, and “Mahalanobis spatial” depth. For the goal of threshold type outlier detection, it is found that the Mahalanobis distance and projection procedures are distinctly superior in performance, each with very high MBP, while the halfspace approach is quite inferior. When a moderate MBP suffices, the Mahalanobis spatial procedure is competitive in view of its contours not constrained to be elliptical and its computational burden relatively mild. A small sampling experiment yields findings completely in accordance with the theoretical comparisons. While these four depth procedures are relatively comparable for the purpose of robust affine equivariant location estimation, the halfspace depth is not competitive with the others for the quite different goal of robust setting of an outlyingness threshold.  相似文献   

14.
ABSTRACT

Matrix-valued covariance functions are crucial to geostatistical modelling of multivariate spatial data. The classical assumption of symmetry of a multivariate covariance function is overly restrictive and has been considered as unrealistic for most of the real data applications. Despite of that, the literature on asymmetric covariance functions has been very sparse. In particular, there is some work related to asymmetric covariances on Euclidean spaces, depending on the Euclidean distance. However, for data collected over large portions of planet Earth, the most natural spatial domain is a sphere, with the corresponding geodesic distance being the natural metric. In this work, we propose a strategy based on spatial rotations to generate asymmetric covariances for multivariate random fields on the d-dimensional unit sphere. We illustrate through simulations as well as real data analysis that our proposal allows to achieve improvements in the predictive performance in comparison to the symmetric counterpart.  相似文献   

15.
In this paper, we study the effect of estimating the vector of means and the variance–covariance matrix on the performance of two of the most widely used multivariate cumulative sum (CUSUM) control charts, the MCUSUM chart proposed by Crosier [Multivariate generalizations of cumulative sum quality-control schemes, Technometrics 30 (1988), pp. 291–303] and the MC1 chart proposed by Pignatiello and Runger [Comparisons of multivariate CUSUM charts, J. Qual. Technol. 22 (1990), pp. 173–186]. Using simulation, we investigate and compare the in-control and out-of-control performances of the competing charts in terms of the average run length measure. The in-control and out-of-control performances of the competing charts deteriorate significantly if the estimated parameters are used with control limits intended for known parameters, especially when only a few Phase I samples are used to estimate the parameters. We recommend the use of the MC1 chart over that of the MCUSUM chart if the parameters are estimated from a small number of Phase I samples.  相似文献   

16.
It is well known that the traditional Pearson correlation in many cases fails to capture non-linear dependence structures in bivariate data. Other scalar measures capable of capturing non-linear dependence exist. A common disadvantage of such measures, however, is that they cannot distinguish between negative and positive dependence, and typically the alternative hypothesis of the accompanying test of independence is simply “dependence”. This paper discusses how a newly developed local dependence measure, the local Gaussian correlation, can be used to construct local and global tests of independence. A global measure of dependence is constructed by aggregating local Gaussian correlation on subsets of \(\mathbb{R}^{2}\) , and an accompanying test of independence is proposed. Choice of bandwidth is based on likelihood cross-validation. Properties of this measure and asymptotics of the corresponding estimate are discussed. A bootstrap version of the test is implemented and tried out on both real and simulated data. The performance of the proposed test is compared to the Brownian distance covariance test. Finally, when the hypothesis of independence is rejected, local independence tests are used to investigate the cause of the rejection.  相似文献   

17.
Patriota and Lemonte [24] introduced a quite general multivariate normal regression model. This model considers that the mean vector and the covariance matrix share the same vector of parameters. In this paper we present some influence assessment for this model, such as the local influence, total local influence of an individual and generalized leverage which are discussed. Additionally, the normal curvatures for local influence studies are derived under some perturbation schemes.  相似文献   

18.
In this study, a new per-field classification method is proposed for supervised classification of remotely sensed multispectral image data of an agricultural area using Gaussian mixture discriminant analysis (MDA). For the proposed per-field classification method, multivariate Gaussian mixture models constructed for control and test fields can have fixed or different number of components and each component can have different or common covariance matrix structure. The discrimination function and the decision rule of this method are established according to the average Bhattacharyya distance and the minimum values of the average Bhattacharyya distances, respectively. The proposed per-field classification method is analyzed for different structures of a covariance matrix with fixed and different number of components. Also, we classify the remotely sensed multispectral image data using the per-pixel classification method based on Gaussian MDA.  相似文献   

19.
In this paper control charts for the mean of a multivariate Gaussian process are considered. Using the generalized likelihood ratio approach and the sequential probability ratio test under an additional constraint on the magnitude of the change various types of CUSUM control charts are derived. It is analyzed under which conditions these schemes are directionally invariant. These charts are compared with several other control schemes proposed in literature. The performance of the charts is studied based on the maximum average delay.  相似文献   

20.
Multivariate Poisson regression with covariance structure   总被引:1,自引:0,他引:1  
In recent years the applications of multivariate Poisson models have increased, mainly because of the gradual increase in computer performance. The multivariate Poisson model used in practice is based on a common covariance term for all the pairs of variables. This is rather restrictive and does not allow for modelling the covariance structure of the data in a flexible way. In this paper we propose inference for a multivariate Poisson model with larger structure, i.e. different covariance for each pair of variables. Maximum likelihood estimation, as well as Bayesian estimation methods are proposed. Both are based on a data augmentation scheme that reflects the multivariate reduction derivation of the joint probability function. In order to enlarge the applicability of the model we allow for covariates in the specification of both the mean and the covariance parameters. Extension to models with complete structure with many multi-way covariance terms is discussed. The method is demonstrated by analyzing a real life data set.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号