首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Detection of outliers or influential observations is an important work in statistical modeling, especially for the correlated time series data. In this paper we propose a new procedure to detect patch of influential observations in the generalized autoregressive conditional heteroskedasticity (GARCH) model. Firstly we compare the performance of innovative perturbation scheme, additive perturbation scheme and data perturbation scheme in local influence analysis. We find that the innovative perturbation scheme give better result than other two schemes although this perturbation scheme may suffer from masking effects. Then we use the stepwise local influence method under innovative perturbation scheme to detect patch of influential observations and uncover the masking effects. The simulated studies show that the new technique can successfully detect a patch of influential observations or outliers under innovative perturbation scheme. The analysis based on simulation studies and two real data sets show that the stepwise local influence method under innovative perturbation scheme is efficient for detecting multiple influential observations and dealing with masking effects in the GARCH model.  相似文献   

2.
The purpose of this paper is to develop influence diagnostics for AR(1) models under the innovative and the data perturbation schemes. There are four main contributions. First, we derive analytical expressions for the slope and curvature statistics. Second, we establish a relationship between the slope and curvature showing that the standardised slope and standardised curvature are equal for the innovative perturbation scheme, and these vectors are nearly identical for several values of the autoregressive parameter, for the data perturbation scheme. Third, we present a connection between the influence statistics and the tests for outlier detection. Fourth, for the innovative perturbation scheme, we derive the asymptotic distribution of a new influence statistic, whereas for the data perturbation scheme, the distribution of the influence statistics is obtained via Monte Carlo simulation. We additionally discuss practical guidelines for the use of local influence statistics, which are illustrated on a chemical process data set.  相似文献   

3.
The aim of this paper is to develop a Bayesian local influence method (Zhu et al. 2009, submitted) for assessing minor perturbations to the prior, the sampling distribution, and individual observations in survival analysis. We introduce a perturbation model to characterize simultaneous (or individual) perturbations to the data, the prior distribution, and the sampling distribution. We construct a Bayesian perturbation manifold to the perturbation model and calculate its associated geometric quantities including the metric tensor to characterize the intrinsic structure of the perturbation model (or perturbation scheme). We develop local influence measures based on several objective functions to quantify the degree of various perturbations to statistical models. We carry out several simulation studies and analyze two real data sets to illustrate our Bayesian local influence method in detecting influential observations, and for characterizing the sensitivity to the prior distribution and hazard function.  相似文献   

4.
Longitudinal count responses are often analyzed with a Poisson mixed model. However, under overdispersion, these responses are better described by a negative binomial mixed model. Estimators of the corresponding parameters are usually obtained by the maximum likelihood method. To investigate the stability of these maximum likelihood estimators, we propose a methodology of sensitivity analysis using local influence. As count responses are discrete, we are unable to perturb them with the standard scheme used in local influence. Then, we consider an appropriate perturbation for the means of these responses. The proposed methodology is useful in different applications, but particularly when medical data are analyzed, because the removal of influential cases can change the statistical results and then the medical decision. We study the performance of the methodology by using Monte Carlo simulation and applied it to real medical data related to epilepsy and headache. All of these numerical studies show the good performance and potential of the proposed methodology.  相似文献   

5.
We introduce a new class of heteroscedastic log-exponentiated Weibull (LEW) regression models. The class of regression models can be applied to censored data and be used more effectively in survival analysis. Maximum likelihood estimation of the model parameters with censored data as well as influence diagnostics for the new regression model is investigated. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and compared to the performance of the heteroscedastic LEW regression model. The normal curvatures for studying local influence are derived under various perturbation schemes. An empirical application to a real data set is provided to illustrate the usefulness of the new class of heteroscedastic regression models.  相似文献   

6.
We formulate a new cure rate survival model by assuming that the number of competing causes of the event of interest has the Poisson distribution, and the time to this event has the generalized linear failure rate distribution. A new distribution to analyze lifetime data is defined from the proposed cure rate model, and its quantile function as well as a general expansion for the moments is derived. We estimate the parameters of the model with cure rate in the presence of covariates for censored observations using maximum likelihood and derive the observed information matrix. We obtain the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes and present some ways to perform global influence analysis. The usefulness of the proposed cure rate survival model is illustrated in an application to real data.  相似文献   

7.
In this paper, we propose a cure rate survival model by assuming that the number of competing causes of the event of interest follows the Poisson distribution and the time to event has the Birnbaum–Saunders (BS) distribution. We define the Poisson BS distribution and provide two useful representations for its density function which facilitate to obtain some mathematical properties. Two closed-form expressions for the moments of the new distribution are given. We estimate the parameters of the model with cure rate using maximum likelihood. For different parameter settings, sample sizes and censoring percentages, several simulations are performed. We derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes and present some ways to perform a global influence study. We analyse a real data set from the medical area.  相似文献   

8.
In this paper, we discuss the bivariate Birnbaum-Saunders accelerated lifetime model, in which we have modeled the dependence structure of bivariate survival data through the use of frailty models. Specifically, we propose the bivariate model Birnbaum-Saunders with the following frailty distributions: gamma, positive stable and logarithmic series. We present a study of inference and diagnostic analysis for the proposed model, more concisely, are proposed a diagnostic analysis based in local influence and residual analysis to assess the fit model, as well as, to detect influential observations. In this regard, we derived the normal curvatures of local influence under different perturbation schemes and we performed some simulation studies for assessing the potential of residuals to detect misspecification in the systematic component, the presence in the stochastic component of the model and to detect outliers. Finally, we apply the methodology studied to real data set from recurrence in times of infections of 38 kidney patients using a portable dialysis machine, we analyzed these data considering independence within the pairs and using the bivariate Birnbaum-Saunders accelerated lifetime model, so that we could make a comparison and verify the importance of modeling dependence within the times of infection associated with the same patient.  相似文献   

9.
We extend the confidence interval construction procedure for location for symmetric iid data using the one-sample Wilcoxon signed rank statistic (T+) to stationary time series data. We propose a normal approximation procedure when explicit knowledge of the underlying dependence structure/distribution is unknown. By conducting extensive simulations from linear and nonlinear time series models, we show that the extended procedure is a strong contender for use in the construction of confidence intervals in time series analysis. Finally we demonstrate real application implementations in two case studies.  相似文献   

10.
The method of local influence is generalized to the multivariate regression. The scheme of perturbations adopted in multivariate regression is similar in spirit to the perturbation of case-weights in univariate regression case. The method developed here is useful for identifying influential observations in multivariate regression as an exploratory or confirmatory data analysis. An illustrative example is given for the effectiveness of the local influence approach in multivariate regression.  相似文献   

11.
In this paper, a local influence approach is employed to assess adequacy of the growth curve model with an unstructured covariance, based on likelihood displacement. The Hessian matrix of the model is investigated in detail under an abstract perturbation scheme. For illustration, covariance-weighted perturbation is discussed and used to analyze two real-life biological data sets, which show that the criteria presented in this article are useful in practice.  相似文献   

12.
We investigate local influence analysis in functional comparative calibration models with replicated data. A method for selecting appropriate perturbation schemes based on the expected Fisher information matrix with respect to the perturbation vector is proposed. It is shown that arbitrarily perturbing these models may result in misleading inference about the influential subjects. First-order influence measures for identifying the correct influential subjects and replicates on corrected score estimators are defined. We introduce different perturbation schemes including perturbation of subjects and replicates on the corrected likelihood function and obtain the density of the perturbed model from which the methodology is based. Particularly, three perturbation of variances schemes could be a better way to handle badly modeled subjects or replicates. Two real data sets are analyzed to illustrate the use of our local influence measures.  相似文献   

13.
The use of bivariate distributions plays a fundamental role in survival and reliability studies. In this paper, we introduce a location-scale model for bivariate survival times based on the copula to model the dependence of bivariate survival data with cure fraction. We create the correlation structure between the failure times using the Clayton family of copulas, which is assumed to have any distribution. It turns out that the model becomes very flexible with respect to the choice of the marginal distributions. For the proposed model, we consider inferential procedures based on constrained parameters under maximum likelihood. We derive the appropriate matrices for assessing local influence under different perturbation schemes and present some ways to perform global influence analysis. The relevance of the approach is illustrated using a real data set and a diagnostic analysis is performed to select an appropriate model.  相似文献   

14.
The local influence method is adapted to canonical correlation analysis for the purpose of investigating the influence of observations. We consider a perturbation based on the empirical distribution function. An illustrative example is given to show the effectiveness of the local influence method for the identification of influential observations.  相似文献   

15.
We propose a new regression-based filter for extracting signals online from multivariate high frequency time series. It separates relevant signals of several variables from noise and (multivariate) outliers.

Unlike parallel univariate filters, the new procedure takes into account the local covariance structure between the single time series components. It is based on high-breakdown estimates, which makes it robust against (patches of) outliers in one or several of the components as well as against outliers with respect to the multivariate covariance structure. Moreover, the trade-off problem between bias and variance for the optimal choice of the window width is approached by choosing the size of the window adaptively, depending on the current data situation.

Furthermore, we present an advanced algorithm of our filtering procedure that includes the replacement of missing observations in real time. Thus, the new procedure can be applied in online-monitoring practice. Applications to physiological time series from intensive care show the practical effect of the proposed filtering technique.  相似文献   

16.
We study the nonparametric maximum likelihood estimate (NPMLE) of the cdf or sub-distribution functions of the failure time for the failure causes in a series system. The study is motivated by a cancer research data (from the Memorial Sloan-Kettering Cancer Center) with interval-censored time and masked failure cause. The NPMLE based on this data set suggests that the existing masking models are not appropriate. We propose a new model called the random partition masking model, which does not rely on the commonly used symmetry assumption (namely, given the failure cause, the probability of observing the masked failure causes is independent of the failure time; see Flehinger et al. Inference about defects in the presence of masking, Technometrics 38 (1996), pp. 247–255). The RPM model is easier to implement in simulation studies than the existing models. We discuss the algorithms for computing the NPMLE and study its asymptotic properties. Our simulation and data analysis indicate that the NPMLE is feasible for a moderate sample size.  相似文献   

17.
In this paper, we propose a multivariate log-linear Birnbaum–Saunders regression model. We discuss maximum-likelihood estimation of the model parameters and provide closed-form expressions for the score function and for Fisher's information matrix. Hypothesis testing is performed using approximations obtained from the asymptotic normality of the maximum-likelihood estimator. Some influence methods, such as the local influence and generalized leverage are discussed and the normal curvatures for studying local influence are derived under some perturbation schemes. Further, a test for the homogeneity of the shape parameter of the multivariate regression model is investigated. A real data set is presented for illustrative purposes.  相似文献   

18.
The spectral analysis of Gaussian linear time-series processes is usually based on uni-frequential tools because the spectral density functions of degree 2 and higher are identically zero and there is no polyspectrum in this case. In finite samples, such an approach does not allow the resolution of closely adjacent spectral lines, except by using autoregressive models of excessively high order in the method of maximum entropy. In this article, multi-frequential periodograms designed for the analysis of discrete and mixed spectra are defined and studied for their properties in finite samples. For a given vector of frequencies ω, the sum of squares of the corresponding trigonometric regression model fitted to a time series by unweighted least squares defines the multi-frequential periodogram statistic IM(ω). When ω is unknown, it follows from the properties of nonlinear models whose parameters separate (i.e., the frequencies and the cosine and sine coefficients here) that the least-squares estimator of frequencies is obtained by maximizing I M(ω). The first-order, second-order and distribution properties of I M(ω) are established theoretically in finite samples, and are compared with those of Schuster's uni-frequential periodogram statistic. In the multi-frequential periodogram analysis, the least-squares estimator of frequencies is proved to be theoretically unbiased in finite samples if the number of periodic components of the time series is correctly estimated. Here, this number is estimated at the end of a stepwise procedure based on pseudo-Flikelihood ratio tests. Simulations are used to compare the stepwise procedure involving I M(ω) with a stepwise procedure using Schuster's periodogram, to study an approximation of the asymptotic theory for the frequency estimators in finite samples in relation to the proximity and signal-to-noise ratio of the periodic components, and to assess the robustness of I M(ω) against autocorrelation in the analysis of mixed spectra. Overall, the results show an improvement of the new method over the classical approach when spectral lines are adjacent. Finally, three examples with real data illustrate specific aspects of the method, and extensions (i.e., unequally spaced observations, trend modeling, replicated time series, periodogram matrices) are outlined.  相似文献   

19.
Abstract

We propose a cure rate survival model by assuming that the number of competing causes of the event of interest follows the negative binomial distribution and the time to the event of interest has the Birnbaum-Saunders distribution. Further, the new model includes as special cases some well-known cure rate models published recently. We consider a frequentist analysis for parameter estimation of the negative binomial Birnbaum-Saunders model with cure rate. Then, we derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes. We illustrate the usefulness of the proposed model in the analysis of a real data set from the medical area.  相似文献   

20.
We define the exponentiated power exponential distribution and propose a regression model with different systematic structures based on the new distribution. We show that the new regression model can be applied to dispersion data since it represents a parametric family of models that includes as sub-models some widely-known regression models. It then can be used more effectively in the analysis of real data. We use maximum likelihood estimation and derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes. Some global-influence measurements are also investigated and simulation studies are performed to evaluate the accuracy of the estimates. We provide an application of the regression model with four systematic structures to nursing activities score data in the Unit of the Medical Clinic of University of São Paulo (USP) Hospital.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号