期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Goodness‐of‐fit testing based on a weighted bootstrap: A fast large‐sample alternative to the parametric bootstrap

Ivan Kojadinovic Jun Yan 《Revue canadienne de statistique》2012,40(3):480-500

The process comparing the empirical cumulative distribution function of the sample with a parametric estimate of the cumulative distribution function is known as the empirical process with estimated parameters and has been extensively employed in the literature for goodness‐of‐fit testing. The simplest way to carry out such goodness‐of‐fit tests, especially in a multivariate setting, is to use a parametric bootstrap. Although very easy to implement, the parametric bootstrap can become very computationally expensive as the sample size, the number of parameters, or the dimension of the data increase. An alternative resampling technique based on a fast weighted bootstrap is proposed in this paper, and is studied both theoretically and empirically. The outcome of this work is a generic and computationally efficient multiplier goodness‐of‐fit procedure that can be used as a large‐sample alternative to the parametric bootstrap. In order to approximately determine how large the sample size needs to be for the parametric and weighted bootstraps to have roughly equivalent powers, extensive Monte Carlo experiments are carried out in dimension one, two and three, and for models containing up to nine parameters. The computational gains resulting from the use of the proposed multiplier goodness‐of‐fit procedure are illustrated on trivariate financial data. A by‐product of this work is a fast large‐sample goodness‐of‐fit procedure for the bivariate and trivariate t distribution whose degrees of freedom are fixed. The Canadian Journal of Statistics 40: 480–500; 2012 © 2012 Statistical Society of Canada 相似文献

2.

Testing parametric models in linear‐directional regression

Eduardo GarcÍa‐Portugués Ingrid Van Keilegom Rosa M. Crujeiras and Wenceslao González‐Manteiga 《Scandinavian Journal of Statistics》2016,43(4):1178-1191

This paper presents a goodness‐of‐fit test for parametric regression models with scalar response and directional predictor, that is, a vector on a sphere of arbitrary dimension. The testing procedure is based on the weighted squared distance between a smooth and a parametric regression estimator, where the smooth regression estimator is obtained by a projected local approach. Asymptotic behaviour of the test statistic under the null hypothesis and local alternatives is provided, jointly with a consistent bootstrap algorithm for application in practice. A simulation study illustrates the performance of the test in finite samples. The procedure is applied to test a linear model in text mining. 相似文献

3.

Model Checks in Inverse Regression Models with Convolution‐Type Operators

NICOLAI BISSANTZ HOLGER DETTE KATHARINA PROKSCH 《Scandinavian Journal of Statistics》2012,39(2):305-322

Abstract. We consider the problem of testing parametric assumptions in an inverse regression model with a convolution‐type operator. An L ₂‐type goodness‐of‐fit test is proposed which compares the distance between a parametric and a non‐parametric estimate of the regression function. Asymptotic normality of the corresponding test statistic is shown under the null hypothesis and under a general non‐parametric alternative with different rates of convergence in both cases. The feasibility of the proposed test is demonstrated by means of a small simulation study. In particular, the power of the test against certain types of alternative is investigated. Finally, an empirical example is provided, in which the proposed methods are applied to the determination of the shape of the luminosity profile of the elliptical galaxy NGC 5017. 相似文献

4.

BOOTSTRAP TESTS FOR THE ERROR DISTRIBUTION IN LINEAR AND NONPARAMETRIC REGRESSION MODELS

Natalie Neumeyer Holger Dette Eva-Renate Nagel 《Australian & New Zealand Journal of Statistics》2006,48(2):129-156

In this paper we investigate several tests for the hypothesis of a parametric form of the error distribution in the common linear and non‐parametric regression model, which are based on empirical processes of residuals. It is well known that tests in this context are not asymptotically distribution‐free and the parametric bootstrap is applied to deal with this problem. The performance of the resulting bootstrap test is investigated from an asymptotic point of view and by means of a simulation study. The results demonstrate that even for moderate sample sizes the parametric bootstrap provides a reliable and easy accessible solution to the problem of goodness‐of‐fit testing of assumptions regarding the error distribution in linear and non‐parametric regression models. 相似文献

5.

A Family of Goodness‐of‐Fit Tests for Copulas Based on Characteristic Functions

《Scandinavian Journal of Statistics》2018,45(2):301-323

A general class of rank statistics based on the characteristic function is introduced for testing goodness‐of‐fit hypotheses about the copula of a continuous random vector. These statistics are defined as L ₂ weighted functional distances between a nonparametric estimator and a semi‐parametric estimator of the characteristic function associated with a copula. It is shown that these statistics behave asymptotically as degenerate V ‐statistics of order four and that the limit distributions have representations in terms of weighted sums of independent chi‐square variables. The consistency of the tests against general alternatives is established and an asymptotically valid parametric bootstrap is suggested for the computation of the critical values of the tests. The behaviour of the new tests in small and moderate sample sizes is investigated with the help of simulations and compared with a competing test based on the empirical copula. Finally, the methodology is illustrated on a five‐dimensional data set. 相似文献

6.

On a new goodness‐of‐fit process for families of copulas

Mhamed Mesfioui Jean‐François Quessy Marie‐Hélène Toupin 《Revue canadienne de statistique》2009,37(1):80-101

A goodness‐of‐fit procedure is proposed for parametric families of copulas. The new test statistics are functionals of an empirical process based on the theoretical and sample versions of Spearman's dependence function. Conditions under which this empirical process converges weakly are seen to hold for many families including the Gaussian, Frank, and generalized Farlie–Gumbel–Morgenstern systems of distributions, as well as the models with singular components described by Durante [Durante ( 2007 ) Comptes Rendus Mathématique. Académie des Sciences. Paris, 344, 195–198]. Thanks to a parametric bootstrap method that allows to compute valid P‐values, it is shown empirically that tests based on Cramér–von Mises distances keep their size under the null hypothesis. Simulations attesting the power of the newly proposed tests, comparisons with competing procedures and complete analyses of real hydrological and financial data sets are presented. The Canadian Journal of Statistics 37: 80‐101; 2009 © 2009 Statistical Society of Canada 相似文献

7.

Exact Goodness‐of‐Fit Testing for the Ising Model

下载免费PDF全文

Abraham Martín del Campo Sarah Cepeda Caroline Uhler 《Scandinavian Journal of Statistics》2017,44(2):285-306

The Ising model is one of the simplest and most famous models of interacting systems. It was originally proposed to model ferromagnetic interactions in statistical physics and is now widely used to model spatial processes in many areas such as ecology, sociology, and genetics, usually without testing its goodness of fit. Here, we propose various test statistics and an exact goodness‐of‐fit test for the finite‐lattice Ising model. The theory of Markov bases has been developed in algebraic statistics for exact goodness‐of‐fit testing using a Monte Carlo approach. However, finding a Markov basis is often computationally intractable. Thus, we develop a Monte Carlo method for exact goodness‐of‐fit testing for the Ising model that avoids computing a Markov basis and also leads to a better connectivity of the Markov chain and hence to a faster convergence. We show how this method can be applied to analyze the spatial organization of receptors on the cell membrane. 相似文献

8.

A likelihood ratio test for goodness‐of‐fit of recessive and dominant models for case–control studies

Meng Qian Yongzhao Shao 《Revue canadienne de statistique》2013,41(2):341-352

Testing goodness‐of‐fit of commonly used genetic models is of critical importance in many applications including association studies and testing for departure from Hardy–Weinberg equilibrium. Case–control design has become widely used in population genetics and genetic epidemiology, thus it is of interest to develop powerful goodness‐of‐fit tests for genetic models using case–control data. This paper develops a likelihood ratio test (LRT) for testing recessive and dominant models for case–control studies. The LRT statistic has a closed‐form formula with a simple $\chi^{2}(1)$ null asymptotic distribution, thus its implementation is easy even for genome‐wide association studies. Moreover, it has the same power and optimality as when the disease prevalence is known in the population. The Canadian Journal of Statistics 41: 341–352; 2013 © 2013 Statistical Society of Canada 相似文献

9.

Non‐parametric Copula Estimation Under Bivariate Censoring

下载免费PDF全文

Svetlana Gribkova Olivier Lopez 《Scandinavian Journal of Statistics》2015,42(4):925-946

In this paper, we consider non‐parametric copula inference under bivariate censoring. Based on an estimator of the joint cumulative distribution function, we define a discrete and two smooth estimators of the copula. The construction that we propose is valid for a large range of estimators of the distribution function and therefore for a large range of bivariate censoring frameworks. Under some conditions on the tails of the distributions, the weak convergence of the corresponding copula processes is obtained in l^∞([0,1]²). We derive the uniform convergence rates of the copula density estimators deduced from our smooth copula estimators. Investigation of the practical behaviour of these estimators is performed through a simulation study and two real data applications, corresponding to different censoring settings. We use our non‐parametric estimators to define a goodness‐of‐fit procedure for parametric copula models. A new bootstrap scheme is proposed to compute the critical values. 相似文献

10.

A Case Study for Modelling Cancer Incidence Using Bayesian Spatio‐Temporal Models

下载免费PDF全文

Su Yun Kang James McGree Peter Baade Kerrie Mengersen 《Australian & New Zealand Journal of Statistics》2015,57(3):325-345

Researchers familiar with spatial models are aware of the challenge of choosing the level of spatial aggregation. Few studies have been published on the investigation of temporal aggregation and its impact on inferences regarding disease outcome in space–time analyses. We perform a case study for modelling individual disease outcomes using several Bayesian hierarchical spatio‐temporal models, while taking into account the possible impact of spatial and temporal aggregation. Using longitudinal breast cancer data from South East Queensland, Australia, we consider both parametric and non‐parametric formulations for temporal effects at various levels of aggregation. Two temporal smoothness priors are considered separately; each is modelled with fixed effects for the covariates and an intrinsic conditional autoregressive prior for the spatial random effects. Our case study reveals that different model formulations produce considerably different model performances. For this particular dataset, a classical parametric formulation that assumes a linear time trend produces the best fit among the five models considered. Different aggregation levels of temporal random effects were found to have little impact on model goodness‐of‐fit and estimation of fixed effects. 相似文献

11.

Goodness‐of‐fit tests for linear regression models with missing response data

Wenceslao Gonzlez‐Manteiga Ana Perz‐Gonzlez 《Revue canadienne de statistique》2006,34(1):149-170

The authors show how to test the goodness‐of‐fit of a linear regression model when there are missing data in the response variable. Their statistics are based on the L₂ distance between nonparametric estimators of the regression function and a ‐consistent estimator of the same function under the parametric model. They obtain the limit distribution of the statistics and check the validity of their bootstrap version. Finally, a simulation study allows them to examine the behaviour of their tests, whether the samples are complete or not. 相似文献

12.

Testing goodness of fit via nonparametric function estimation techniques

R.L. Eubank J.D. Hart V.N. LaRiccia 《统计学通讯:理论与方法》2013,42(12):3327-3354

An overview is given of methodology for testing goodness of fit of parametric models using nonparametric function estimation techniques. The ideas are illustrated in two settings: the classical one-sample goodness-of-fit scenario and testing the goodness of fit of a polynomial regression model. 相似文献

13.

Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis

Ana‐Maria Staicu Yingxing Li Ciprian M. Crainiceanu David Ruppert 《Scandinavian Journal of Statistics》2014,41(4):932-949

This paper introduces a general framework for testing hypotheses about the structure of the mean function of complex functional processes. Important particular cases of the proposed framework are as follows: (1) testing the null hypothesis that the mean of a functional process is parametric against a general alternative modelled by penalized splines; and (2) testing the null hypothesis that the means of two possibly correlated functional processes are equal or differ by only a simple parametric function. A global pseudo‐likelihood ratio test is proposed, and its asymptotic distribution is derived. The size and power properties of the test are confirmed in realistic simulation scenarios. Finite‐sample power results indicate that the proposed test is much more powerful than competing alternatives. Methods are applied to testing the equality between the means of normalized δ‐power of sleep electroencephalograms of subjects with sleep‐disordered breathing and matched controls. 相似文献

14.

Analysis of band‐recovery data in a multistate capture‐recapture framework

Gilles Gauthier Jean‐Dominique Lebreton 《Revue canadienne de statistique》2008,36(1):59-73

Dead recoveries of marked animals are commonly used to estimate survival probabilities. Band‐recovery models can be parameterized either by r (the probability of recovering a band conditional on death of the animal) or by f (the probability that an animal will be killed, retrieved, and have its band reported). The T parametrization can be implemented in a capture‐recapture framework with two states (alive and newly dead), mortality being the transition probability between the two states. The authors show here that the f parametrization can also be implemented in a multistate framework by imposing simple constraints on some parameters. They illustrate it using data on the mallard and the snow goose. However, they mention that because it does not entirely separate the individual survival and encounter processes, the f parametrization must be used with care on reduced models, or in the presence of estimates at the boundary of the parameter space. As they show, a multistate framework allows the use of powerful software for model fitting or testing the goodness‐of‐fit of models; it also affords the implementation of complex models such as those based on mixture of information or uncertain states 相似文献

15.

Empirical Likelihood Intervals for Conditional Value‐at‐Risk in Heteroscedastic Regression Models

ZHOUPING LI YUN GONG LIANG PENG 《Scandinavian Journal of Statistics》2011,38(4):781-787

Abstract. Non‐parametric regression models have been studied well including estimating the conditional mean function, the conditional variance function and the distribution function of errors. In addition, empirical likelihood methods have been proposed to construct confidence intervals for the conditional mean and variance. Motivated by applications in risk management, we propose an empirical likelihood method for constructing a confidence interval for the pth conditional value‐at‐risk based on the non‐parametric regression model. A simulation study shows the advantages of the proposed method. 相似文献

16.

On inference for a semiparametric partially linear regression model with serially correlated errors

Jinhong You Gemai Chen 《Revue canadienne de statistique》2007,35(4):515-531

The authors consider a semiparametric partially linear regression model with serially correlated errors. They propose a new way of estimating the error structure which has the advantage that it does not involve any nonparametric estimation. This allows them to develop an inference procedure consisting of a bandwidth selection method, an efficient semiparametric generalized least squares estimator of the parametric component, a goodness‐of‐fit test based on the bootstrap, and a technique for selecting significant covariates in the parametric component. They assess their approach through simulation studies and illustrate it with a concrete application. 相似文献

17.

An Optimal Semiparametric Method for Two‐group Classification

《Scandinavian Journal of Statistics》2018,45(3):806-846

In the classical discriminant analysis, when two multivariate normal distributions with equal variance–covariance matrices are assumed for two groups, the classical linear discriminant function is optimal with respect to maximizing the standardized difference between the means of two groups. However, for a typical case‐control study, the distributional assumption for the case group often needs to be relaxed in practice. Komori et al. (Generalized t ‐statistic for two‐group classification. Biometrics 2015, 71: 404–416) proposed the generalized t ‐statistic to obtain a linear discriminant function, which allows for heterogeneity of case group. Their procedure has an optimality property in the class of consideration. We perform a further study of the problem and show that additional improvement is achievable. The approach we propose does not require a parametric distributional assumption on the case group. We further show that the new estimator is efficient, in that no further improvement is possible to construct the linear discriminant function more efficiently. We conduct simulation studies and real data examples to illustrate the finite sample performance and the gain that it produces in comparison with existing methods. 相似文献

18.

A new test for the extreme value distribution

Aydin Öztürk Serdar Korukogu 《统计学通讯:模拟与计算》2013,42(4):1375-1393

In this paper a test statistic which is a modification of the W statistic for testing the goodness of fit for the two paremeter extreme value (smallest element) distribution is proposed. The test statistic Is obtained as the ratio of two linear estimates of the scale parameter. It Is shown that the suggested statistic is computationally simple and has good power properties. Percentage points of the statistic are obtained by performing Monte Carlo experiments. An example is given to illustrate the test procedure. 相似文献

19.

Goodness‐of‐Fit based on Downsampling with Applications to Linear Drift Diffusions

JULIE L. FORMAN BO MARKUSSEN HELLE SØRENSEN 《Scandinavian Journal of Statistics》2011,38(2):288-310

Abstract. A goodness‐of‐fit test for continuous‐time models is developed that examines if the parameter estimates are consistent with another for different sampling frequencies. The test compares parameter estimates obtained from estimating functions for downsamples of the data. We prove asymptotic results for stationary and ergodic processes, and apply the downsampling test to linear drift diffusions. Simulations indicate that the test is quite powerful in detecting non‐Markovian deviations from the linear drift diffusions. 相似文献

20.

Pearson-type goodness-of-fit tests for regression

M. G. Akritas A. F. Torbeyns 《Revue canadienne de statistique》1997,25(3):359-374

A procedure for testing the goodness of fit of linear regression models is introduced. For a given partition of the real line into cells, the proposed test is a quadratic form based on the vector of observed minus expected frequencies of the residuals obtained by maximum-likelihood estimation of the regression parameters. The quadratic form is of the same computational difficulty as the traditional Pearson-type tests with uncensored data. A statistic based on only one cell is particularly easy to apply and is used for testing the normality assumption in a real data set from astronomy. A simulation study examines the finite-sample properties of the proposed tests. 相似文献