期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A class of residuals for outlier identification in zero adjusted regression models

Gustavo H. A. Pereira Juliana Scudilio Manoel Santos-Neto Denise A. Botter Mnica C. Sandoval 《Journal of applied statistics》2020,47(10):1833

Zero adjusted regression models are used to fit variables that are discrete at zero and continuous at some interval of the positive real numbers. Diagnostic analysis in these models is usually performed using the randomized quantile residual, which is useful for checking the overall adequacy of a zero adjusted regression model. However, it may fail to identify some outliers. In this work, we introduce a class of residuals for outlier identification in zero adjusted regression models. Monte Carlo simulation studies and two applications suggest that one of the residuals of the class introduced here has good properties and detects outliers that are not identified by the randomized quantile residual. 相似文献

2.

Residuals for log-Burr XII regression models in survival analysis

Giovana O. Silva Gilberto A. Paula 《Journal of applied statistics》2011,38(7):1435-1445

In this paper, we compare three residuals to assess departures from the error assumptions as well as to detect outlying observations in log-Burr XII regression models with censored observations. These residuals can also be used for the log-logistic regression model, which is a special case of the log-Burr XII regression model. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and the empirical distribution of each residual is displayed and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be straightforwardly extended to the modified martingale-type residual in log-Burr XII regression models with censored data. 相似文献

3.

Focused information criterion and model averaging based on weighted composite quantile regression

Ganggang Xu Suojin Wang Jianhua Z. Huang 《Scandinavian Journal of Statistics》2014,41(2):365-381

We study the focused information criterion and frequentist model averaging and their application to post‐model‐selection inference for weighted composite quantile regression (WCQR) in the context of the additive partial linear models. With the non‐parametric functions approximated by polynomial splines, we show that, under certain conditions, the asymptotic distribution of the frequentist model averaging WCQR‐estimator of a focused parameter is a non‐linear mixture of normal distributions. This asymptotic distribution is used to construct confidence intervals that achieve the nominal coverage probability. With properly chosen weights, the focused information criterion based WCQR estimators are not only robust to outliers and non‐normal residuals but also can achieve efficiency close to the maximum likelihood estimator, without assuming the true error distribution. Simulation studies and a real data analysis are used to illustrate the effectiveness of the proposed procedure. 相似文献

4.

Deviance residuals in generalised log-gamma regression models with censored observations

《Journal of Statistical Computation and Simulation》2012,82(8):747-764

In this article, we compare three residuals based on the deviance component in generalised log-gamma regression models with censored observations. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and the empirical distribution of each residual is displayed and compared with the standard normal distribution. For all cases studied, the empirical distributions of the proposed residuals are in general symmetric around zero, but only a martingale-type residual presented negligible kurtosis for the majority of the cases studied. These studies suggest that the residual analysis usually performed in normal linear regression models can be straightforwardly extended for the martingale-type residual in generalised log-gamma regression models with censored data. A lifetime data set is analysed under log-gamma regression models and a model checking based on the martingale-type residual is performed. 相似文献

5.

Deviance Residuals for an Angular Response

Francisco A.M. de Souza & Gilberto A. Paula 《Australian & New Zealand Journal of Statistics》2002,44(3):345-356

This paper discusses deviance residual approximations in von Mises regression models. By using a relationship between the von Mises and the wrapped normal distributions, the paper shows that the deviance component of the von Mises distribution is approximately a linear function of the standard normal distribution. Two standardized forms are proposed for the deviance residual, and a simulation study is performed to compare the approximation of the proposed residuals to the standard normal distribution. An illustrative example is given. 相似文献

6.

Extreme values identification in regression using a peaks-over-threshold approach

Tong Siu Tung Wong Wai Keung Li 《Journal of applied statistics》2015,42(3):566-576

The problem of heavy tail in regression models is studied. It is proposed that regression models are estimated by a standard procedure and a statistical check for heavy tail using residuals is conducted as a tool for regression diagnostic. Using the peaks-over-threshold approach, the generalized Pareto distribution quantifies the degree of heavy tail by the extreme value index. The number of excesses is determined by means of an innovative threshold model which partitions the random sample into extreme values and ordinary values. The overall decision on a significant heavy tail is justified by both a statistical test and a quantile–quantile plot. The usefulness of the approach includes justification of goodness of fit of the estimated regression model and quantification of the occurrence of extremal events. The proposed methodology is supplemented by surface ozone level in the city center of Leeds. 相似文献

7.

A log-linear regression model for the odd Weibull distribution with censored data

Edwin M.M. Ortega Gauss M. Cordeiro Elizabeth M. Hashimoto Kahadawala Cooray 《Journal of applied statistics》2014,41(9):1859-1880

We introduce the log-odd Weibull regression model based on the odd Weibull distribution (Cooray, 2006). We derive some mathematical properties of the log-transformed distribution. The new regression model represents a parametric family of models that includes as sub-models some widely known regression models that can be applied to censored survival data. We employ a frequentist analysis and a parametric bootstrap for the parameters of the proposed model. We derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes and present some ways to assess global influence. Further, for different parameter settings, sample sizes and censoring percentages, some simulations are performed. In addition, the empirical distribution of some modified residuals are given and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be extended to a modified deviance residual in the proposed regression model applied to censored data. We define martingale and deviance residuals to check the model assumptions. The extended regression model is very useful for the analysis of real data. 相似文献

8.

A scale-location adjustment for proportional hazards deviance residuals

De Wayne R. Derryberry Paul A. Murtaugh 《统计学通讯:理论与方法》2013,42(11):2583-2591

We show that deviance residuals derived using the proportional hazards assumption (including Cox regression) are not asymptotically standard normal, but that a scale-location adjustment makes them nearly standard normal, even for moderate sample sizes. This adjustment should aid in outlier detection, as it allows a more exact assessment of when a deviance residual is unusually large. 相似文献

9.

Adjusted Pearson residuals in exponential family nonlinear models

《Journal of Statistical Computation and Simulation》2012,82(4):411-425

In this paper, we give matrix formulae of order 𝒪(n ^?1), where n is the sample size, for the first two moments of Pearson residuals in exponential family nonlinear regression models [G.M. Cordeiro and G.A. Paula, Improved likelihood ratio statistic for exponential family nonlinear models, Biometrika 76 (1989), pp. 93–100.]. The formulae are applicable to many regression models in common use and generalize the results by Cordeiro [G.M. Cordeiro, On Pearson's residuals in generalized linear models, Statist. Prob. Lett. 66 (2004), pp. 213–219.] and Cook and Tsai [R.D. Cook and C.L. Tsai, Residuals in nonlinear regression, Biometrika 72(1985), pp. 23–29.]. We suggest adjusted Pearson residuals for these models having, to this order, the expected value zero and variance one. We show that the adjusted Pearson residuals can be easily computed by weighted linear regressions. Some numerical results from simulations indicate that the adjusted Pearson residuals are better approximated by the standard normal distribution than the Pearson residuals. 相似文献

10.

The log-odd log-logistic Weibull regression model: modelling,estimation, influence diagnostics and residual analysis

《Journal of Statistical Computation and Simulation》2012,82(8):1516-1538

In applications of survival analysis, the failure rate function may frequently present a unimodal shape. In such cases, the log-normal and log-logistic distributions are used. In this paper, we shall be concerned only with parametric forms, so a location-scale regression model based on the odd log-logistic Weibull distribution is proposed for modelling data with a decreasing, increasing, unimodal and bathtub failure rate function as an alternative to the log-Weibull regression model. For censored data, we consider a classic method to estimate the parameters of the proposed model. We derive the appropriate matrices for assessing local influences on the parameter estimates under different perturbation schemes and present some ways to assess global influences. Further, for different parameter settings, sample sizes and censoring percentages, various simulations are performed. In addition, the empirical distribution of some modified residuals is determined and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be extended to a modified deviance residual in the new regression model applied to censored data. We analyse a real data set using the log-odd log-logistic Weibull regression model. 相似文献

11.

On confidence intervals for semiparametric expectile regression

Fabian Sobotka Göran Kauermann Linda Schulze Waltrup Thomas Kneib 《Statistics and Computing》2013,23(2):135-148

In regression scenarios there is a growing demand for information on the conditional distribution of the response beyond the mean. In this scenario quantile regression is an established method of tail analysis. It is well understood in terms of asymptotic properties and estimation quality. Another way to look at the tail of a distribution is via expectiles. They provide a valuable alternative since they come with a combination of preferable attributes. The easy weighted least squares estimation of expectiles and the quadratic penalties often used in flexible regression models are natural partners. Also, in a similar way as quantiles can be seen as a generalisation of median regression, expectiles offer a generalisation of mean regression. In addition to regression estimates, confidence intervals are essential for interpretational purposes and to assess the variability of the estimate, but there is a lack of knowledge regarding the asymptotic properties of a semiparametric expectile regression estimate. Therefore confidence intervals for expectiles based on an asymptotic normal distribution are introduced. Their properties are investigated by a simulation study and compared to a boostrap-based gold standard method. Finally the introduced confidence intervals help to evaluate a geoadditive expectile regression model on childhood malnutrition data from India. 相似文献

12.

A bounded influence regression estimator based on the statistics of the hat matrix 总被引：1，自引：0，他引：1

Alan D. Chave David J. Thomson 《Journal of the Royal Statistical Society. Series C, Applied statistics》2003,52(3):307-322

Summary. Many geophysical regression problems require the analysis of large (more than 10⁴ values) data sets, and, because the data may represent mixtures of concurrent natural processes with widely varying statistical properties, contamination of both response and predictor variables is common. Existing bounded influence or high breakdown point estimators frequently lack the ability to eliminate extremely influential data and/or the computational efficiency to handle large data sets. A new bounded influence estimator is proposed that combines high asymptotic efficiency for normal data, high breakdown point behaviour with contaminated data and computational simplicity for large data sets. The algorithm combines a standard M -estimator to downweight data corresponding to extreme regression residuals and removal of overly influential predictor values (leverage points) on the basis of the statistics of the hat matrix diagonal elements. For this, the exact distribution of the hat matrix diagonal elements p _ii for complex multivariate Gaussian predictor data is shown to be β ( p _ii, m , N − m ), where N is the number of data and m is the number of parameters. Real geophysical data from an auroral zone magnetotelluric study which exhibit severe outlier and leverage point contamination are used to illustrate the estimator's performance. The examples also demonstrate the utility of looking at both the residual and the hat matrix distributions through quantile–quantile plots to diagnose robust regression problems. 相似文献

13.

The Kumaraswamy normal linear regression model with applications

Gauss M. Cordeiro Elizabete C. Machado Mônica C. Sandoval 《统计学通讯:模拟与计算》2013,42(10):3062-3082

ABSTRACT

For any continuous baseline G distribution, Cordeiro and Castro pioneered the Kumaraswamy-G family of distributions with two extra positive parameters, which generalizes both Lehmann types I and II classes. We study some mathematical properties of the Kumaraswamy-normal (KwN) distribution including ordinary and incomplete moments, mean deviations, quantile and generating functions, probability weighted moments, and two entropy measures. We propose a new linear regression model based on the KwN distribution, which extends the normal linear regression model. We obtain the maximum likelihood estimates of the model parameters and provide some diagnostic measures such as global influence, local influence, and residuals. We illustrate the potentiality of the introduced models by means of two applications to real datasets. 相似文献

14.

Quantile regression in functional linear semiparametric model

Tang Qingguo Linglong Kong 《Statistics》2017,51(6):1342-1358

This paper proposes nonparametric estimation methods for functional linear semiparametric quantile regression, where the conditional quantile of the scalar responses is modelled by both scalar and functional covariates and an additional unknown nonparametric function term. The slope function is estimated using the functional principal component basis and the nonparametric function is approximated by a piecewise polynomial function. The asymptotic distribution of the estimators of slope parameters is derived and the global convergence rate of the quantile estimator of unknown slope function is established under suitable norm. The asymptotic distribution of the estimator of the unknown nonparametric function is also established. Simulation studies are conducted to investigate the finite-sample performance of the proposed estimators. The proposed methodology is demonstrated by analysing a real data from ADHD-200 sample. 相似文献

15.

Gibbs sampling methods for Bayesian quantile regression

《Journal of Statistical Computation and Simulation》2012,82(11):1565-1578

This paper considers quantile regression models using an asymmetric Laplace distribution from a Bayesian point of view. We develop a simple and efficient Gibbs sampling algorithm for fitting the quantile regression model based on a location-scale mixture representation of the asymmetric Laplace distribution. It is shown that the resulting Gibbs sampler can be accomplished by sampling from either normal or generalized inverse Gaussian distribution. We also discuss some possible extensions of our approach, including the incorporation of a scale parameter, the use of double exponential prior, and a Bayesian analysis of Tobit quantile regression. The proposed methods are illustrated by both simulated and real data. 相似文献

16.

非参数固定效应Panel Data模型的分位数回归推断

吕秀梅《统计与信息论坛》2012,27(6):28-32

利用分位数回归方法,讨论了非参数固定效应Panel Data模型的估计和检验问题,得到了参数估计的渐近正态性及收敛速度。同时,建立一个秩得分(rank score)统计量来检验模型的固定效应,并证明了这个统计量渐近服从标准正态分布。相似文献

17.

Approximating the equilibrium quantity traded and welfare in large markets

Ellen V. Muir Konstantin Borovkov 《随机性模型》2017,33(3):411-429

We consider the efficient outcome of a canonical economic market model involving buyers and sellers with independent and identically distributed random valuations and costs, respectively. When the number of buyers and sellers is large, we show that the joint distribution of the equilibrium quantity traded and welfare is asymptotically normal. Moreover, we bound the approximation rate. The proof proceeds by constructing, on a common probability space, a representation consisting of two independent empirical quantile processes, which in large markets can be approximated by independent Brownian bridges. The distribution of interest can then be approximated by that of a functional of a Gaussian process. This methodology applies to a variety of mechanism design problems. 相似文献

18.

Goodness‐of‐fit methods for matched case‐control studies

Patrick G. Arbogast Danyu Y. Lin 《Revue canadienne de statistique》2004,32(4):373-386

The authors propose graphical and numerical methods for checking the adequacy of the logistic regression model for matched case‐control data. Their approach is based on the cumulative sum of residuals over the covariate or linear predictor. Under the assumed model, the cumulative residual process converges weakly to a centered Gaussian limit whose distribution can be approximated via computer simulation. The observed cumulative residual pattern can then be compared both visually and analytically to a certain number of simulated realizations of the approximate limiting process under the null hypothesis. The proposed techniques allow one to check the functional form of each covariate, the logistic link function as well as the overall model adequacy. The authors assess the performance of the proposed methods through simulation studies and illustrate them using data from a cardiovascular study. 相似文献

19.

Simple correspondence analysis using adjusted residuals

Eric J. Beh 《Journal of statistical planning and inference》2012,142(4):965-973

Correspondence analysis is a versatile statistical technique that allows the user to graphically identify the association that may exist between variables of a contingency table. For two categorical variables, the classical approach involves applying singular value decomposition to the Pearson residuals of the table. These residuals allow for one to use a simple test to determine those cells that deviate from what is expected under independence. However, the assumptions concerning these residuals are not always satisfied and so such results can lead to questionable conclusions.One may consider instead, an adjustment of the Pearson residual, which is known to have properties associated with the standard normal distribution. This paper explores the application of these adjusted residuals to correspondence analysis and determines how they impact upon the configuration of points in the graphical display. 相似文献

20.

A simple formula based on quantiles for the moments of beta generalized distributions

Gauss M. Cordeiro 《Journal of Statistical Computation and Simulation》2013,83(10):1932-1943

In this article, we derive explicit expansions for the moments of beta generalized distributions from power series expansions for the quantile functions of the baseline distributions. We apply our formula to the beta normal, beta Student t, beta gamma and beta beta generalized distributions. We propose a simple way to express the quantile function of any beta generalized distribution as a power series expansion with known coefficients. 相似文献