共查询到20条相似文献,搜索用时 31 毫秒
1.
Trias Wahyuni Rakhmawati Geert Molenberghs Geert Verbeke Christel Faes 《Journal of applied statistics》2017,44(4):620-641
Since the seminal paper by Cook and Weisberg [9], local influence, next to case deletion, has gained popularity as a tool to detect influential subjects and measurements for a variety of statistical models. For the linear mixed model the approach leads to easily interpretable and computationally convenient expressions, not only highlighting influential subjects, but also which aspect of their profile leads to undue influence on the model's fit [17]. Ouwens et al. [24] applied the method to the Poisson-normal generalized linear mixed model (GLMM). Given the model's nonlinear structure, these authors did not derive interpretable components but rather focused on a graphical depiction of influence. In this paper, we consider GLMMs for binary, count, and time-to-event data, with the additional feature of accommodating overdispersion whenever necessary. For each situation, three approaches are considered, based on: (1) purely numerical derivations; (2) using a closed-form expression of the marginal likelihood function; and (3) using an integral representation of this likelihood. Unlike when case deletion is used, this leads to interpretable components, allowing not only to identify influential subjects, but also to study the cause thereof. The methodology is illustrated in case studies that range over the three data types mentioned. 相似文献
2.
Abhik Ghosh 《Journal of applied statistics》2015,42(9):2056-2072
The density power divergence (DPD) measure, defined in terms of a single parameter α, has proved to be a popular tool in the area of robust estimation [1]. Recently, Ghosh and Basu [5] rigorously established the asymptotic properties of the MDPDEs in case of independent non-homogeneous observations. In this paper, we present an extensive numerical study to describe the performance of the method in the case of linear regression, the most common setup under the case of non-homogeneous data. In addition, we extend the existing methods for the selection of the optimal robustness tuning parameter from the case of independent and identically distributed (i.i.d.) data to the case of non-homogeneous observations. Proper selection of the tuning parameter is critical to the appropriateness of the resulting analysis. The selection of the optimal robustness tuning parameter is explored in the context of the linear regression problem with an extensive numerical study involving real and simulated data. 相似文献
3.
Housila P. Singh 《统计学通讯:理论与方法》2017,46(2):521-531
This paper aimed at providing an efficient new unbiased estimator for estimating the proportion of a potentially sensitive attribute in survey sampling. The suggested randomization device makes use of the means, variances of scrambling variables, and the two scalars lie between “zero” and “one.” Thus, the same amount of information has been used at the estimation stage. The variance formula of the suggested estimator has been obtained. We have compared the proposed unbiased estimator with that of Kuk (1990) and Franklin (1989), and Singh and Chen (2009) estimators. Relevant conditions are obtained in which the proposed estimator is more efficient than Kuk (1990) and Franklin (1989) and Singh and Chen (2009) estimators. The optimum estimator (OE) in the proposed class of estimators has been identified which finally depends on moments ratios of the scrambling variables. The variance of the optimum estimator has been obtained and compared with that of the Kuk (1990) and Franklin (1989) estimator and Singh and Chen (2009) estimator. It is interesting to mention that the “optimum estimator” of the class of estimators due to Singh and Chen (2009) depends on the parameter π under investigation which limits the use of Singh and Chen (2009) OE in practice while the proposed OE in this paper is free from such a constraint. The proposed OE depends only on the moments ratios of scrambling variables. This is an advantage over the Singh and Chen (2009) estimator. Numerical illustrations are given in the support of the present study when the scrambling variables follow normal distribution. Theoretical and empirical results are very sound and quite illuminating in the favor of the present study. 相似文献
4.
The Jackknife-after-bootstrap (JaB) technique originally developed by Efron [8] has been proposed as an approach to improve the detection of influential observations in linear regression models by Martin and Roberts [12] and Beyaztas and Alin [2]. The method is based on the use of percentile-method confidence intervals to provide improved cut-off values for several single case-deletion influence measures. In order to improve JaB, we propose using robust versions of Efron [7]’s bias-corrected and accelerated (BCa) bootstrap confidence intervals. In this study, the performances of robust BCa–JaB and conventional JaB methods are compared in the cases of DFFITS, Welsch's distance and modified Cook's distance influence diagnostics. Comparisons are based on both real data examples and through a simulation study. Our results reveal that under a variety of scenarios, our proposed method provides more accurate and reliable results, and it is more robust to masking effects. 相似文献
5.
Karima Boualam 《统计学通讯:理论与方法》2017,46(18):9218-9229
In this article, we investigate the asymptotic normality of the Hill's estimator of the tail index parameter, when the observations are weakly dependent in the sense of Doukhan and Louhichi (1999) and are drawn from a strictly linear process. We show that the previous result on Hill estimator obtained by Rootzen et al. (1990) and Resnick and Starica (1997) for strong mixing can be extended to weak dependence. 相似文献
6.
We discuss a one-sample location test that can be used when the dimension and the sample size are large. It is well-known that the power of Hotelling’s test decreases when the dimension is close to the sample size. To address this loss of power, some non exact approaches were proposed, e.g., Dempster (1958, 1960), Bai and Saranadasa (1996), and Srivastava and Du (2008). In this article, we focus on Hotelling’s test and Dempster’s test. The comparative merits and demerits of these two tests vary according to the local parameters. In particular, we consider the situation where it is difficult to determine which test should be used, that is, where the two tests are asymptotically equivalent in terms of local power. We propose a new statistic based on the weighted averaging of Hotelling’s T2-statistic and Dempster’s statistic that can be applied in such a situation. Our weight is determined on the basis of the maximum local asymptotic power on a restricted parameter space that induces local asymptotic equivalence between Hotelling’s test and Dempster’s test. Numerical results show that our test is more stable than Hotelling’s T2-statistic and Dempster’s statistic in most parameter settings. 相似文献
7.
Yang Xing 《统计学通讯:理论与方法》2013,42(5):972-982
The introduction of the Hausdorff α-entropy in Xing (2008a), Xing (2008b), Xing (2010), Xing (2011), and Xing and Ranneby (2009) has lead a series of improvements of well-known results on posterior consistency. In this paper we discuss an application of the Hausdorff α-entropy. We construct a universal prior distribution such that the corresponding posterior distribution is almost surely consistent. The approach of the construction of this type of prior distribution is natural, but it works very well for all separable models. We illustrate such prior distributions by examples. In particular, we obtain that if the true density function is known to be some normal probability density function with unknown mean and unknown variance then without any additional assumption one can construct a prior distribution which leads to posterior consistency. 相似文献
8.
This paper addresses a generalization of the bivariate Cauchy distribution discussed by Fang et al. (1990), derived from a trivariate normal distribution with a general correlation matrix. We obtain explicit expressions for the joint distribution function and joint density function, and show that they reduce in a special case to the corresponding expressions of Fang et al. (1990). Finally, we show that this generalized distribution is useful in determining the orthant probability of a bivariate skew-normal distribution of Azzalini and Dalla Valle (1996). 相似文献
9.
In this paper, the focus is on sequential analysis of multivariate financial time series with heavy tails. The mean vector and the covariance matrix of multivariate non linear models are simultaneously monitored by modifying conventional control charts to identify structural changes in the data. The considered target process is a constant conditional correlation model (cf. Bollerslev, 1990), an extended constant conditional correlation model (cf. He and Teräsvirta, 2004), a dynamic conditional correlation model (cf. Engle, 2002), or a generalized dynamic conditional correlation model (cf. Capiello et al., 2006). For statistical surveillance we use control charts based on residuals. Further, the procedures are constructed for t-distribution. The detection speed of these charts is compared via Monte Carlo simulation. In the empirical study, the procedure with the best performance is applied to log-returns of the stock market indices FTSE and CAC. 相似文献
10.
The proposed test detects deviations from randomness, without a priori distributional assumption, when observations are not independent and identically distributed (i.i.d.), which is suitable for our motivating stock market index data. Departures from i.i.d. are tested by subdividing data into subintervals and then using a conditional probability measure within intervals as a binomial test. This nonparametric test is designed to detect deviations of neighboring observations from randomness when the dataset consists of time series observations. Simulation results and a comparison with Lo and MacKinlay's (1988) variance ratio test showed that our proposed test is a competitive alternative. 相似文献
11.
This paper treats the problem of stochastic comparisons for the extreme order statistics arising from heterogeneous beta distributions. Some sufficient conditions involved in majorization-type partial orders are provided for comparing the extreme order statistics in the sense of various magnitude orderings including the likelihood ratio order, the reversed hazard rate order, the usual stochastic order, and the usual multivariate stochastic order. The results established here strengthen and extend those including Kochar and Xu (2007), Mao and Hu (2010), Balakrishnan et al. (2014), and Torrado (2015). A real application in system assembly and some numerical examples are also presented to illustrate the theoretical results. 相似文献
12.
13.
We develop a simple corrected score for logistic regression with errors-in-covariates. The new method is an extension of the consistent functional methods proposed by Huang and Wang (2001) and is closely related to the corrected score method by Nakamura (1990) and Stefanski (1989). The new method requires that the measurement error distribution is known, but does not require normality. The new method yields a consistent and asymptotically normal estimator under regularity conditions. We examine the finite-sample performance of the new estimator through simulation studies. Finally, we illustrate the new method by applying it to an AIDS study. 相似文献
14.
《统计学通讯:理论与方法》2013,42(8):1631-1646
Abstract In this paper we develop a Bayesian analysis for the nonlinear regression model with errors that follow a continuous autoregressive process. In this way, unequally spaced observations do not present a problem in the analysis. We employ the Gibbs sampler, (see Gelfand, A., Smith, A. (1990). Sampling based approaches to calculating marginal densities. J. Amer. Statist. Assoc. 85:398–409.), as the foundation for making Bayesian inferences. We illustrate these Bayesian inferences with an analysis of a real data-set. Using these same data, we contrast the Bayesian approach with a generalized least squares technique. 相似文献
15.
ABSTRACTIn this paper, we extend a variance shift model, previously considered in the linear mixed models, to the linear mixed measurement error models using the corrected likelihood of Nakamura (1990). This model assumes that a single outlier arises from an observation with inflated variance. We derive the score test and the analogue of the likelihood ratio test, to assess whether the ith observation has inflated variance. A parametric bootstrap procedure is implemented to obtain empirical distributions of the test statistics. Finally, results of a simulation study and an example of real data are presented to illustrate the performance of proposed tests. 相似文献
16.
M. A. A. Cox 《统计学通讯:理论与方法》2013,42(20):5050-5057
AbstractThe adoption of control charts can be traced to the classic text by Shewhart (1931) and championed by many writers since then, including Deming (1982). Numerous other texts and publications stress the continuing importance of this area. While tables of key Shewhart control chart parameters are extremely useful they are easily lost or mislaid and can sometimes be difficult to interpret. To address this issue spreadsheet code is implemented to produce all the key control chart factors. 相似文献
17.
This article proposes new symmetric and asymmetric distributions applying methods analogous as the ones in Kim (2005) and Arnold et al. (2009) to the exponentiated normal distribution studied in Durrans (1992), that we call the power-normal (PN) distribution. The proposed bimodal extension, the main focus of the paper, is called the bimodal power-normal model and is denoted by BPN(α) model, where α is the asymmetry parameter. The authors give some properties including moments and maximum likelihood estimation. Two important features of the model proposed is that its normalizing constant has closed and simple form and that the Fisher information matrix is nonsingular, guaranteeing large sample properties of the maximum likelihood estimators. Finally, simulation studies and real applications reveal that the proposed model can perform well in both situations. 相似文献
18.
In this article, our objective is to evaluate the performance of different tests which are used to compare the equality of more than two location parameters. We have considered six tests (including some commonly used) in this study, one of which is parametric and the others are nonparametric. These tests include the usual F test (Fisher and Mackenzie, 1923), Kruskal–Wallis test (Kruskall and Wallis, 1952), Kolmogorov–Smirnov test (David, 1958), the g test (Stekler, 1987), f test (Batchelor, 1990), and Extension of Median test (as given in Daniel, 1990). Performance of these tests are compared under different symmetric, skewed and contaminated probability distributions that include Normal, Cauchy, Uniform, Laplace, Lognormal, Exponential, Weibull, Gamma, t, Chi-square, Half Normal, Mixed Weibull, and Mixed Normal. Performances of these tests are measured in terms of power. We have suggested appropriate tests which may perform better under different situations. It is expected that researchers will find these results useful in decision making. 相似文献
19.
Marek Dvořák 《统计学通讯:理论与方法》2017,46(1):465-484
The aim of this article is the construction of the test statistic for the detection of changes in vector autoregressive (AR) models where both AR parameters and the variance matrix of the error term are the subjects of a change. The approximating distribution of the proposed statistic is the Gumbel distribution. The proof stands on the approximation of weakly dependent random vectors by independent ones and by application of Horváth’s extension of Darling-Erdös extremal result for random vectors, see Darling and Erdös (1956) and Horváth (1993). The test statistic is a modification of the likelihood ratio. 相似文献
20.
This article describes how diagnostic procedures were derived for symmetrical nonlinear regression models, continuing the work carried out by Cysneiros and Vanegas (2008) and Vanegas and Cysneiros (2010), who showed that the parameters estimates in nonlinear models are more robust with heavy-tailed than with normal errors. In this article, we focus on assessing if the robustness of this kind of models is also observed in the inference process (i.e., partial F-test). Symmetrical nonlinear regression models includes all symmetric continuous distributions for errors covering both light- and heavy-tailed distributions such as Student-t, logistic-I and -II, power exponential, generalized Student-t, generalized logistic, and contaminated normal. Firstly, a statistical test is shown to evaluating the assumption that the error terms all have equal variance. The results of simulation studies which describe the behavior of the test for heteroscedasticity proposed in the presence of outliers are then given. To assess the robustness of inference process, we present the results of a simulation study which described the behavior of partial F-test in the presence of outliers. Also, some diagnostic procedures are derived to identify influential observations on the partial F-test. As ilustration, a dataset described in Venables and Ripley (2002), is also analyzed. 相似文献