期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Indices of Dependence Between Types in Multivariate Point Patterns 总被引：2，自引：0，他引：2

M. N. M. Van Lieshout & A. J. Baddeley 《Scandinavian Journal of Statistics》1999,26(4):511-532

We propose new summary statistics quantifying several forms of dependence between points of different types in a multi-type spatial point pattern. These statistics are the multivariate counterparts of the J -function for point processes of a single type, introduced by Lieshout & Baddeley (1996). They are based on comparing distances from a type i point to either the nearest type j point or to the nearest point in the pattern regardless of type to these distances seen from an arbitrary point in space. Information about the range of interaction can also be inferred. Our statistics can be computed explicitly for a range of well-known multivariate point process models. Some applications to bivariate and trivariate data sets are presented as well. 相似文献

2.

Shape bias of robust covariance estimators: an empirical study

M. Hubert P. Rousseeuw K. Vakili 《Statistical Papers》2014,55(1):15-28

Detecting outliers in a multivariate point cloud is not trivial, especially when dealing with a sizable fraction of contamination. Over time, it has increasingly been recognized that the safest and most feasible approach to exposing outliers starts by computing a highly robust estimator of location and scatter that can withstand a large proportion of contamination. Many such estimators have been proposed in recent years. We will compare the worst-case bias of several prominent robust multivariate estimators by means of simulation. We also propose a new tool to compare robust estimators on real data sets, and illustrate it. 相似文献

3.

Robust estimation of the mean vector for high-dimensional data set using robust clustering

Hamid Shahriari 《Journal of applied statistics》2015,42(6):1183-1205

The first step in statistical analysis is the parameter estimation. In multivariate analysis, one of the parameters of interest to be estimated is the mean vector. In multivariate statistical analysis, it is usually assumed that the data come from a multivariate normal distribution. In this situation, the maximum likelihood estimator (MLE), that is, the sample mean vector, is the best estimator. However, when outliers exist in the data, the use of sample mean vector will result in poor estimation. So, other estimators which are robust to the existence of outliers should be used. The most popular robust multivariate estimator for estimating the mean vector is S-estimator with desirable properties. However, computing this estimator requires the use of a robust estimate of mean vector as a starting point. Usually minimum volume ellipsoid (MVE) is used as a starting point in computing S-estimator. For high-dimensional data computing, the MVE takes too much time. In some cases, this time is so large that the existing computers cannot perform the computation. In addition to the computation time, for high-dimensional data set the MVE method is not precise. In this paper, a robust starting point for S-estimator based on robust clustering is proposed which could be used for estimating the mean vector of the high-dimensional data. The performance of the proposed estimator in the presence of outliers is studied and the results indicate that the proposed estimator performs precisely and much better than some of the existing robust estimators for high-dimensional data. 相似文献

4.

Fast computation of spatially adaptive kernel estimates

Tilman M. Davies Adrian Baddeley 《Statistics and Computing》2018,28(4):937-956

Kernel smoothing of spatial point data can often be improved using an adaptive, spatially varying bandwidth instead of a fixed bandwidth. However, computation with a varying bandwidth is much more demanding, especially when edge correction and bandwidth selection are involved. This paper proposes several new computational methods for adaptive kernel estimation from spatial point pattern data. A key idea is that a variable-bandwidth kernel estimator for d-dimensional spatial data can be represented as a slice of a fixed-bandwidth kernel estimator in \((d+1)\)-dimensional scale space, enabling fast computation using Fourier transforms. Edge correction factors have a similar representation. Different values of global bandwidth correspond to different slices of the scale space, so that bandwidth selection is greatly accelerated. Potential applications include estimation of multivariate probability density and spatial or spatiotemporal point process intensity, relative risk, and regression functions. The new methods perform well in simulations and in two real applications concerning the spatial epidemiology of primary biliary cirrhosis and the alarm calls of capuchin monkeys. 相似文献

5.

Detecting outliers and influential points: an indirect classical Mahalanobis distance-based method

Xuqing Liu Feng Gao Yandong Wu Zhiguo Zhao 《Journal of Statistical Computation and Simulation》2018,88(11):2013-2033

相似文献

6.

Multivariate geometric anisotropic Cox processes

James S. Martin David J. Murrell Sofia C. Olhede 《Scandinavian Journal of Statistics》2023,50(3):1420-1465

This paper introduces a new modeling and inference framework for multivariate and anisotropic point processes. Building on recent innovations in multivariate spatial statistics, we propose a new family of multivariate anisotropic random fields, and from them a family of anisotropic point processes. We give conditions that make the proposed models valid. We also propose a Palm likelihood-based inference method for this type of point process, circumventing issues of likelihood tractability. Finally we illustrate the utility of the proposed modeling framework by analyzing spatial ecological observations of plants and trees in the Barro Colorado Island data. 相似文献

7.

Multivariate Change Point Control Chart Based on Data Depth for Phase I Analysis

Zhonghua Li Yi Dai Zhaojun Wang 《统计学通讯:模拟与计算》2013,42(6):1490-1507

A multivariate change point control chart based on data depth (CPDP) is considered for detecting shifts in either the mean vector, the covariance matrix, or both of the processes for Phase I. The proposed chart is preferable from a robustness point of view, has attractive detection performance, and can be especially useful in Phase I analysis setting, where there is limited information about the underlying process. Comparison results and an illustrative example show that our CPDP chart has great potential for Phase I analysis of multivariate individual observations. The application of CPDP chart is illustrated in a real data example. 相似文献

8.

Multivariate multi-sample tests for location based on data depth

《Journal of Statistical Computation and Simulation》2012,82(18):3377-3390

A notion of data depth is used to measure centrality or outlyingness of a data point in a given data cloud. In the context of data depth, the point (or points) having maximum depth is called as deepest point (or points). In the present work, we propose three multi-sample tests for testing equality of location parameters of multivariate populations by using the deepest point (or points). These tests can be considered as extensions of two-sample tests based on the deepest point (or points). The proposed tests are implemented through the idea of Fisher's permutation test. Performance of earlier tests is studied by simulation. Illustration with two real datasets is also provided. 相似文献

9.

Multivariate trimmed means based on the Tukey depth

Jean-Claude Massé 《Journal of statistical planning and inference》2009

In univariate statistics, the trimmed mean has long been regarded as a robust and efficient alternative to the sample mean. A multivariate analogue calls for a notion of trimmed region around the center of the sample. Using Tukey's depth to achieve this goal, this paper investigates two types of multivariate trimmed means obtained by averaging over the trimmed region in two different ways. For both trimmed means, conditions ensuring asymptotic normality are obtained; in this respect, one of the main features of the paper is the systematic use of Hadamard derivatives and empirical processes methods to derive the central limit theorems. Asymptotic efficiency relative to the sample mean as well as breakdown point are also studied. The results provide convincing evidence that these location estimators have nice asymptotic behavior and possess highly desirable finite-sample robustness properties; furthermore, relative to the sample mean, both of them can in some situations be highly efficient for dimensions between 2 and 10. 相似文献

10.

Methods for repeated measures data analysis with missing values

《Journal of statistical planning and inference》1999,77(2):221-236

There are various techniques for dealing with incomplete data; some are computationally highly intensive and others are not as computationally intensive, while all may be comparable in their efficiencies. In spite of these developments, analysis using only the complete data subset is performed when using popular statistical software. In an attempt to demonstrate the efficiencies and advantages of using all available data, we compared several approaches that are relatively simple but efficient alternatives to those using the complete data subset for analyzing repeated measures data with missing values, under the assumption of a multivariate normal distribution of the data. We also assumed that the missing values occur in a monotonic pattern and completely at random. The incomplete data procedure is demonstrated to be more powerful than the procedure of using the complete data subset, generally when the within-subject correlation gets large. One other principal finding is that even with small sample data, for which various covariance models may be indistinguishable, the empirical size and power are shown to be sensitive to misspecified assumptions about the covariance structure. Overall, the testing procedures that do not assume any particular covariance structure are shown to be more robust in keeping the empirical size at the nominal level than those assuming a special structure. 相似文献

11.

Detection of multiple change-points in multivariate data

Edgard M. Maboudou-Tchao Douglas M. Hawkins 《Journal of applied statistics》2013,40(9):1979-1995

The statistical analysis of change-point detection and estimation has received much attention recently. A time point such that observations follow a certain statistical distribution up to that point and a different distribution – commonly of the same functional form but different parameters after that point – is called a change-point. Multiple change-point problems arise when we have more than one change-point. This paper develops a method for multivariate normally distributed data to detect change-points and estimate within-segment parameters using maximum likelihood estimation. 相似文献

12.

Nonparametric tests for multivariate locations based on data depth

Somanath D. Pawar Digambar T. Shirke 《统计学通讯:模拟与计算》2019,48(3):753-776

The present paper deals with the problem of testing equality of locations of two multivariate distributions using a notion of data depth. A notion of data depth has been used to measure centrality/outlyingness of a given point in a given data cloud. The paper proposes two nonparametric tests for testing equality of locations of two multivariate populations which are developed by observing the behavior of the depth versus depth plot. Simulation study reveals that the proposed tests are superior to the existing tests based on the data depth with regard to power. Illustrations with real data are provided. 相似文献

13.

A flexible marginal modelling strategy for non-monotone missing data

Ivy Jansen Geert Molenberghs 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2008,171(2):347-373

Summary. Much research has been devoted to modelling strategies for longitudinal data with missingness, recently especially within the missingness not at random context. In this paper, the relatively unexplored but practically highly relevant domain of non-monotone missingness with multivariate ordinal responses is broached. For this, a dedicated version of the multivariate Dale model is formulated. Furthermore, we also assess the sensitivity of these models to their assumptions, by using the technique of global influence. 相似文献

14.

A probabilistic nearest neighbour method for statistical pattern recognition

C. C. Holmes N. M. Adams 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(2):295-306

Summary. Nearest neighbour algorithms are among the most popular methods used in statistical pattern recognition. The models are conceptually simple and empirical studies have shown that their performance is highly competitive against other techniques. However, the lack of a formal framework for choosing the size of the neighbourhood k is problematic. Furthermore, the method can only make discrete predictions by reporting the relative frequency of the classes in the neighbourhood of the prediction point. We present a probabilistic framework for the k -nearest-neighbour method that largely overcomes these difficulties. Uncertainty is accommodated via a prior distribution on k as well as in the strength of the interaction between neighbours. These prior distributions propagate uncertainty through to proper probabilistic predictions that have continuous support on (0, 1). The method makes no assumptions about the distribution of the predictor variables. The method is also fully automatic with no user-set parameters and empirically it proves to be highly accurate on many bench-mark data sets. 相似文献

15.

Identifying the time of a step change with multivariate single control charts

《Journal of Statistical Computation and Simulation》2012,82(8):1529-1543

Change point estimation procedures simplify the efforts to search for and identify special causes in multivariate statistical process monitoring. After a signal is generated by the simultaneously used control charts or a single control chart, add-on change point procedure estimates the time of the change. In this study, multivariate joint change point estimation performance for simultaneous monitoring of both location and dispersion is compared under the assumption that various single charts are used to monitor the process. The change detection performance for several structural changes for the mean vector and covariance matrix is also discussed. It is concluded that choice of the control chart to obtain a signal may affect the change point detection performance. 相似文献

16.

Bayesian mixture modeling for spatial Poisson process intensities,with applications to extreme value analysis

Athanasios Kottas Bruno Sansó 《Journal of statistical planning and inference》2007

We propose a method for the analysis of a spatial point pattern, which is assumed to arise as a set of observations from a spatial nonhomogeneous Poisson process. The spatial point pattern is observed in a bounded region, which, for most applications, is taken to be a rectangle in the space where the process is defined. The method is based on modeling a density function, defined on this bounded region, that is directly related with the intensity function of the Poisson process. We develop a flexible nonparametric mixture model for this density using a bivariate Beta distribution for the mixture kernel and a Dirichlet process prior for the mixing distribution. Using posterior simulation methods, we obtain full inference for the intensity function and any other functional of the process that might be of interest. We discuss applications to problems where inference for clustering in the spatial point pattern is of interest. Moreover, we consider applications of the methodology to extreme value analysis problems. We illustrate the modeling approach with three previously published data sets. Two of the data sets are from forestry and consist of locations of trees. The third data set consists of extremes from the Dow Jones index over a period of 1303 days. 相似文献

17.

Analysis of distance for structured multivariate data and extensions to multivariate analysis of variance 总被引：3，自引：0，他引：3

J. C. Gower & W. J. Krzanowski 《Journal of the Royal Statistical Society. Series C, Applied statistics》1999,48(4):505-519

相似文献

18.

A pattern-mixture odds ratio model for incomplete categorical data

Bart Michiels Geert Molenberghs Stuart R. Lipsitz 《统计学通讯:理论与方法》2013,42(12):2843-2869

Most models for incomplete data are formulated within the selection model framework. Pattern-mixture models are increasingly seen as a viable alternative, both from an interpretational as well as from a computational point of view (Little 1993, Hogan and Laird 1997, Ekholm and Skinner 1998). Whereas most applications are either for continuous normally distributed data or for simplified categorical settings such as contingency tables, we show how a multivariate odds ratio model (Molenberghs and Lesaffre 1994, 1998) can be used to fit pattern-mixture models to repeated binary outcomes with continuous covariates. Apart from point estimation, useful methods for interval estimation are presented and data from a clinical study are analyzed to illustrate the methods. 相似文献

19.

A nested frailty model for survival data,with an application to the study of child survival in northeast Brazil

Sastry N 《Journal of the American Statistical Association》1997,92(438):426-435

"This article presents a multivariate hazard model for survival data that are clustered at two hierarchical levels.... We apply the model to an analysis of the covariates of child survival using survey data from northeast Brazil collected via a hierarchically clustered sampling scheme. We find that family and community frailty effects are fairly small in magnitude but are of importance because they alter the results in a systematic pattern." 相似文献

20.

Nonparametric tests for multivariate multi-sample locations based on data depth

Somanath D. Pawar Digambar T. Shirke 《Journal of Statistical Computation and Simulation》2019,89(9):1574-1591

Several nonparametric tests for multivariate multi-sample location problem are proposed in this paper. These tests are based on the notion of data depth, which is used to measure the centrality/outlyingness of a given point with respect to a given distribution or a data cloud. Proposed tests are completely nonparametric and implemented through the idea of permutation tests. Performance of the proposed tests is compared with existing parametric test and nonparametric test based on data depth. An extensive simulation study reveals that proposed tests are superior to the existing tests based on data depth with regard to power. Illustrations with real data are provided. 相似文献