期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Robust estimation for linear regression with asymmetric errors

Ana M. Bianco Marta Garcia Ben Víctor J. Yohai 《Revue canadienne de statistique》2005,33(4):511-528

The authors propose a new class of robust estimators for the parameters of a regression model in which the distribution of the error terms belongs to a class of exponential families including the log‐gamma distribution. These estimates, which are a natural extension of the MM‐estimates for ordinary regression, may combine simultaneously high asymptotic efficiency and a high breakdown point. The authors prove the consistency and derive the asymptotic normal distribution of these estimates. A Monte Carlo study allows them to assess the efficiency and robustness of these estimates for finite samples. 相似文献

2.

A class of finite mixture of quantile regressions with its applications

Yuzhu Tian Maozai Tian 《Journal of applied statistics》2016,43(7):1240-1252

Mixture of linear regression models provide a popular treatment for modeling nonlinear regression relationship. The traditional estimation of mixture of regression models is based on Gaussian error assumption. It is well known that such assumption is sensitive to outliers and extreme values. To overcome this issue, a new class of finite mixture of quantile regressions (FMQR) is proposed in this article. Compared with the existing Gaussian mixture regression models, the proposed FMQR model can provide a complete specification on the conditional distribution of response variable for each component. From the likelihood point of view, the FMQR model is equivalent to the finite mixture of regression models based on errors following asymmetric Laplace distribution (ALD), which can be regarded as an extension to the traditional mixture of regression models with normal error terms. An EM algorithm is proposed to obtain the parameter estimates of the FMQR model by combining a hierarchical representation of the ALD. Finally, the iterated weighted least square estimation for each mixture component of the FMQR model is derived. Simulation studies are conducted to illustrate the finite sample performance of the estimation procedure. Analysis of an aphid data set is used to illustrate our methodologies. 相似文献

3.

An ECM Estimation Approach for Analyzing Multivariate Skew-Normal Data with Dropout

T. Baghfalaki 《统计学通讯:模拟与计算》2013,42(10):1970-1988

In this article, an ECM algorithm is developed to obtain the maximum likelihood estimates of parameters where multivariate skew-normal distribution is used for analyzing longitudinal skewed normal regression data with dropout. A simulation study is performed to investigate the performance of the presented algorithm. Also, the methodology is illustrated through two applications and the results of proposed methodology are compared with ECM under multivariate normal assumption using AIC and BIC criteria. Standard errors of parameter estimates are obtained by asymptotic observed information matrix. 相似文献

4.

Estimation of regression parameters in generalized linear models for cluster correlated data with measurement error

B.C. Sutradhar J.N.K. Rao 《Revue canadienne de statistique》1996,24(2):177-192

Liang and Zeger (1986) introduced a class of estimating equations that gives consistent estimates of regression parameters and of their asymptotic variances in the class of generalized linear models for cluster correlated data. When the independent variables or covariates in such models are subject to measurement errors, the parameter estimates obtained from these estimating equations are no longer consistent. To correct for the effect of measurement errors, an estimator with smaller asymptotic bias is constructed along the lines of Stefanski (1985), assuming that the measurement error variance is either known or estimable. The asymptotic distribution of the bias-corrected estimator and a consistent estimator of its asymptotic variance are also given. The special case of a binary logistic regression model is studied in detail. For this case, methods based on conditional scores and quasilikelihood are also extended to cluster correlated data. Results of a small simulation study on the performance of the proposed estimators and associated tests of hypotheses are reported. 相似文献

5.

Model selection and parameter estimation of a multinomial logistic regression model

《Journal of Statistical Computation and Simulation》2012,82(7):1412-1426

In the multinomial regression model, we consider the methodology for simultaneous model selection and parameter estimation by using the shrinkage and LASSO (least absolute shrinkage and selection operation) [R. Tibshirani, Regression shrinkage and selection via the LASSO, J. R. Statist. Soc. Ser. B 58 (1996), pp. 267–288] strategies. The shrinkage estimators (SEs) provide significant improvement over their classical counterparts in the case where some of the predictors may or may not be active for the response of interest. The asymptotic properties of the SEs are developed using the notion of asymptotic distributional risk. We then compare the relative performance of the LASSO estimator with two SEs in terms of simulated relative efficiency. A simulation study shows that the shrinkage and LASSO estimators dominate the full model estimator. Further, both SEs perform better than the LASSO estimators when there are many inactive predictors in the model. A real-life data set is used to illustrate the suggested shrinkage and LASSO estimators. 相似文献

6.

Estimating the p-values of robust tests for the linear model

《Journal of statistical planning and inference》2005,128(1):241-257

In this paper, we study the estimation of p-values for robust tests for the linear regression model. The asymptotic distribution of these tests has only been studied under the restrictive assumption of errors with known scale or symmetric distribution. Since these robust tests are based on robust regression estimates, Efron's bootstrap (1979) presents a number of problems. In particular, it is computationally very expensive, and it is not resistant to outliers in the data. In other words, the tails of the bootstrap distribution estimates obtained by re-sampling the data may be severely affected by outliers.We show how to adapt the Robust Bootstrap (Ann. Statist 30 (2002) 556; Bootstrapping MM-estimators for linear regression with fixed designs, http://mathstat.carleton.ca/~matias/pubs.html) to this problem. This method is very fast to compute, resistant to outliers in the data, and asymptotically correct under weak regularity assumptions. In this paper, we show that the Robust Bootstrap can be used to obtain asymptotically correct, computationally simple p-value estimates. A simulation study indicates that the tests whose p-values are estimated with the Robust Bootstrap have better finite sample significance levels than those obtained from the asymptotic theory based on the symmetry assumption.Although this paper is focussed on robust scores-type tests (in: Directions in Robust Statistics and Diagnostics, Part I, Springer, New York), our approach can be applied to other robust tests (for example, Wald- and dispersion-type also discussed in Markatou et al., 1991). 相似文献

7.

Semiparametric Regression with Kernel Error Model

AO YUAN JAN G. DE GOOIJER 《Scandinavian Journal of Statistics》2007,34(4):841-869

Abstract. We propose and study a class of regression models, in which the mean function is specified parametrically as in the existing regression methods, but the residual distribution is modelled non-parametrically by a kernel estimator, without imposing any assumption on its distribution. This specification is different from the existing semiparametric regression models. The asymptotic properties of such likelihood and the maximum likelihood estimate (MLE) under this semiparametric model are studied. We show that under some regularity conditions, the MLE under this model is consistent (when compared with the possibly pseudo-consistency of the parameter estimation under the existing parametric regression model), is asymptotically normal with rate and efficient. The non-parametric pseudo-likelihood ratio has the Wilks property as the true likelihood ratio does. Simulated examples are presented to evaluate the accuracy of the proposed semiparametric MLE method. 相似文献

8.

Estimation of the linear-linear segmented regression model in the presence of measurement error

E. E. Gbur P. F. Dahm 《统计学通讯:理论与方法》2013,42(4):809-826

A simple segmented regression model in which the independent variable is measured with error is considered. The method of moments is used to obtain parameter estimates and the joint asymptotic distribution of the estimators is presented. The small sample properties of the inference procedures based on the asymptotic distribution of the estimators are studied numerically. 相似文献

9.

On Semiparametric Mode Regression Estimation

Ali Gannoun Jerome Saracco Keming Yu 《统计学通讯:理论与方法》2013,42(7):1141-1157

It has been found that, for a variety of probability distributions, there is a surprising linear relation between mode, mean, and median. In this article, the relation between mode, mean, and median regression functions is assumed to follow a simple parametric model. We propose a semiparametric conditional mode (mode regression) estimation for an unknown (unimodal) conditional distribution function in the context of regression model, so that any m-step-ahead mean and median forecasts can then be substituted into the resultant model to deliver m-step-ahead mode prediction. In the semiparametric model, Least Squared Estimator (LSEs) for the model parameters and the simultaneous estimation of the unknown mean and median regression functions by the local linear kernel method are combined to infer about the parametric and nonparametric components of the proposed model. The asymptotic normality of these estimators is derived, and the asymptotic distribution of the parameter estimates is also given and is shown to follow usual parametric rates in spite of the presence of the nonparametric component in the model. These results are applied to obtain a data-based test for the dependence of mode regression over mean and median regression under a regression model. 相似文献

10.

M-Estimates of regression when the scale is unknown and the error distribution is possibly asymmetric: A minimax result

Bing Li Ruben H. Zamar 《Revue canadienne de statistique》1996,24(2):193-206

Huber (1964) found the minimax-variance M-estimate of location under the assumption that the scale parameter is known; Li and Zamar (1991) extended this result to the case when the scale is unknown. We consider the robust estimation of the regression coefficients (β₁,…,β_p) when the scale and the intercept parameters are unknown. The minimax-variance estimates of (β₁,…,β_p) with respect to the trace of their asymptotic covariance matrix are derived. The maximum is taken over ?-contamination neighbourhoods of a central regression model with Gaussian errors (asymmetric contamination is allowed), and the minimum is taken over a large class of generalized M-estimates of regression of the Mallow type. The optimal choice of estimates for the nuisance parameters (scale and intercept) is also considered. 相似文献

11.

Discontinuities in robust nonparametric regression with α-mixing dependence

Marie Hušková 《Journal of nonparametric statistics》2017,29(2):447-475

The main idea of the paper is to introduce a robust regression estimation method under an α-mixing dependence assumption, staying free of any parametric model restrictions while also allowing for some sudden changes in the unknown regression function. The sudden changes in the model may correspond to discontinuity points (jumps) or higher order breaks (jumps in corresponding derivatives) as well. We firstly derive some important statistical properties for local polynomial M-smoother estimates and we will propose a statistical test to decide whether some given point of interest is significantly important for a change to occur or not. As the asymptotic distribution of the test statistic depends on quantities which are left unknown we also introduce a bootstrap algorithm which can be used to mimic the target distribution of interest. All necessary proofs are provided together with some experimental results from a simulation study and a real data example. 相似文献

12.

Restricted likelihood ratio lack-of-fit tests using mixed spline models

Gerda Claeskens 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(4):909-926

Summary. Penalized regression spline models afford a simple mixed model representation in which variance components control the degree of non-linearity in the smooth function estimates. This motivates the study of lack-of-fit tests based on the restricted maximum likelihood ratio statistic which tests whether variance components are 0 against the alternative of taking on positive values. For this one-sided testing problem a further complication is that the variance component belongs to the boundary of the parameter space under the null hypothesis. Conditions are obtained on the design of the regression spline models under which asymptotic distribution theory applies, and finite sample approximations to the asymptotic distribution are provided. Test statistics are studied for simple as well as multiple-regression models. 相似文献

13.

THE ANALYSIS OF DISCRETE CHOICE EXPERIMENTS WITH CORRELATED ERROR STRUCTURE

《统计学通讯:理论与方法》2013,42(4):615-626

In a stated preference discrete choice experiment each subject is typically presented with several choice sets, and each choice set contains a number of alternatives. The alternatives are defined in terms of their name (brand) and their attributes at specified levels. The task for the subject is to choose from each choice set the alternative with highest utility for them. The multinomial is an appropriate distribution for the responses to each choice set since each subject chooses one alternative, and the multinomial logit is a common model. If the responses to the several choice sets are independent, the likelihood function is simply the product of multinomials. The most common and generally preferred method of estimating the parameters of the model is maximum likelihood (that is, selecting as estimates those values that maximize the likelihood function). If the assumption of within-subject independence to successive choice tasks is violated (it is almost surely violated), the likelihood function is incorrect and maximum likelihood estimation is inappropriate. The most serious errors involve the estimation of the variance-covariance matrix of the model parameter estimates, and the corresponding variances of market shares and changes in market shares.

In this paper we present an alternative method of estimation of the model parameter coefficients that incorporates a first-order within-subject covariance structure. The method involves the familiar log-odds transformation and application of the multivariate delta method. Estimation of the model coefficients after the transformation is a straightforward generalized least squares regression, and the corresponding improved estimate of the variance-covariance matrix is in closed form. Estimates of market share (and change in market share) follow from a second application of the multivariate delta method. The method and comparison with maximum likelihood estimation are illustrated with several simulated and actual data examples.

Advantages of the proposed method are: 1) it incorporates the within-subject covariance structure; 2) it is completely data driven; 3) it requires no additional model assumptions; 4) assuming asymptotic normality, it provides a simple procedure for computing confidence regions on market shares and changes in market shares; and 5) it produces results that are asymptotically equivalent to those produced by maximum likelihood when the data are independent. 相似文献

14.

Serial correlation and distributions on the shere

D.A.S. Fraser Irwin Guttman George P.H. Styan 《统计学通讯:理论与方法》2013,42(2):97-118

Estimation and tests for serial correlation in recation and regression models with normal error have been derive from various points of view; for example: Anderson (1948), Durbi for Watson (1950, 1951, 1971), Theil (1965), Durbin (1970), Haq (1970), Kadiyala (1970), Abrahamse & Louter (1971), Levenbac (1972), Berenblut & Webb (1973), Phillips & Harvey (1974), a Sims (1975). In this paper we derive likelihood functions and most powerful tests for serial correclation in Locationa and regression models with arbitrary but specificed error; the methods extend to include the determination of the likelihood for the parameter of the error distribution.

In Section 2, we survey the modthods that have been used in deriving the various tests and estimates in the literature. In Section 2, we introduce the stataistical model that directly describes the error distribution and we obtain the likelihood function for error correlation and determine locally and specifically kost powerful tests for correlation. In Section 3 we consider the case with normal error derive a normal distribution on the sphere by radial projection. The likelihood function and test are then specialized to the case of normal error in Section 4. The computational procedures for the tests and related power functions are examined in Section 5. Power comparisons for the textile data of Theil and Nagar (1961), the consumption data of Kelin (1950), and the plums and the wheat data of Hildreth & Lu (1960) are presented in Section 6, while the likelihood functions for correlation in these data are given in Section 7. 相似文献

15.

Least circular distance regression for directional data

Ulric Lund 《Journal of applied statistics》1999,26(6):723-733

Least-squares regression is not appropriate when the response variable is circular, and can lead to erroneous results. The reason for this is that the squared difference is not an appropriate measure of distance on the circle. In this paper, a circular analog to least-squares regression is presented for predicting a circular response variable by another circular variable and a set of linear covariates. An alternative maximum-likelihood formulation yields the same regression parameter estimates. Under the maximum-likelihood model, asymptotic standard errors of the parameter estimates are obtained. As an example, the regression model is used to model data from a marine biology study. 相似文献

16.

On asymptotic distribution of prediction in functional linear regression

Omid Khademnoe 《Statistics》2016,50(5):974-990

There has been substantial recent attention on problems involving a functional linear regression model with scalar response. Among them, there have been few works dealing with asymptotic distribution of prediction in functional linear regression models. In recent literature, the centeral limit theorem for prediction has been discussed, but the proof and conditions under which the random bias terms for a fixed predictor converge to zero have been ignored so that the impact of these terms on the convergence of the prediction has not been well understood. Clarifying the proof and conditions under which the bias terms converge to zero, we show that the asymptotic distribution of the prediction is normal. Furthermore, we have derived those results related to other terms that already obtained by others, under milder conditions. Finally, we conduct a simulation study to investigate performance of the asymptotic distribution under various parameter settings. 相似文献

17.

Optimum percentile estimating equations for nonlinear random coefficient models

《Journal of statistical planning and inference》2001,97(2):275-292

In nonlinear random coefficients models, the means or variances of response variables may not exist. In such cases, commonly used estimation procedures, e.g., (extended) least-squares (LS) and quasi-likelihood methods, are not applicable. This article solves this problem by proposing an estimate based on percentile estimating equations (PEE). This method does not require full distribution assumptions and leads to efficient estimates within the class of unbiased estimating equations. By minimizing the asymptotic variance of the PEE estimates, the optimum percentile estimating equations (OPEE) are derived. Several examples including Weibull regression show the flexibility of the PEE estimates. Under certain regularity conditions, the PEE estimates are shown to be strongly consistent and asymptotic normal, and the OPEE estimates have the minimal asymptotic variance. Compared with the parametric maximum likelihood estimates (MLE), the asymptotic efficiency of the OPEE estimates is more than 98%, while the LS-type of procedures can have infinite variances. When the observations have outliers or do not follow the distributions considered in model assumptions, the article shows that OPEE is more robust than the MLE, and the asymptotic efficiency in the model misspecification cases can be above 150%. 相似文献

18.

Efficient estimation in a regression model with missing responses

《Statistical Methodology》2014

This article examines methods to efficiently estimate the mean response in a linear model with an unknown error distribution under the assumption that the responses are missing at random. We show how the asymptotic variance is affected by the estimator of the regression parameter, and by the imputation method. To estimate the regression parameter, the ordinary least squares is efficient only if the error distribution happens to be normal. If the errors are not normal, then we propose a one step improvement estimator or a maximum empirical likelihood estimator to efficiently estimate the parameter.To investigate the imputation’s impact on the estimation of the mean response, we compare the listwise deletion method and the propensity score method (which do not use imputation at all), and two imputation methods. We demonstrate that listwise deletion and the propensity score method are inefficient. Partial imputation, where only the missing responses are imputed, is compared to full imputation, where both missing and non-missing responses are imputed. Our results reveal that, in general, full imputation is better than partial imputation. However, when the regression parameter is estimated very poorly, the partial imputation will outperform full imputation. The efficient estimator for the mean response is the full imputation estimator that utilizes an efficient estimator of the parameter. 相似文献

19.

Asymptotic distribution theory for the kalman filter state estimator

James C. Spall Kent D. Wall 《统计学通讯:理论与方法》2013,42(16):1981-2003

An asymptotic distribution theory for the state estimate from a Kalman filter in the absence of the usual Gaussian assumption is presented. It is found that the stability properties of the state transition matrix playa key role in the distribution theory. Specifically, when the state equation is neutrally stable (i.e., borderline stable-unstable) the state estimate is asymptotically normal when the random terms in the model have arbitrary distributions. This case includes the popular random walk state equation. However, when the state equation is either stable or unstable, at least some of the random terms in the model must be normally distributed beyond some finite time before the state estimate is asymptotically normal. 相似文献

20.

Generalised regression estimation via the bootstrap

James G. Booth Alan H. Welsh 《Australian & New Zealand Journal of Statistics》2020,62(1):5-24

A generalised regression estimation procedure is proposed that can lead to much improved estimation of population characteristics, such as quantiles, variances and coefficients of variation. The method involves conditioning on the discrepancy between an estimate of an auxiliary parameter and its known population value. The key distributional assumption is joint asymptotic normality of the estimates of the target and auxiliary parameters. This assumption implies that the relationship between the estimated target and the estimated auxiliary parameters is approximately linear with coefficients determined by their asymptotic covariance matrix. The main contribution of this paper is the use of the bootstrap to estimate these coefficients, which avoids the need for parametric distributional assumptions. First‐order correct conditional confidence intervals based on asymptotic normality can be improved upon using quantiles of a conditional double bootstrap approximation to the distribution of the studentised target parameter estimate. 相似文献