期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The information for the direction of dependence in l1 regression

Yadolah Dodge Joe Whittaker 《统计学通讯:理论与方法》2013,42(9-10):1945-1955

Fitting a linear regression for a response variable by minimising the sum of absolute deviations, L₁ regression, may be viewed as a maximum likelihood procedure applied to the Laplace distribution. An interesting bivariate case is where the conditional distribution of the response X₂ given X₁ and the marginal distribution of the explanatory variable X₁ are both Laplace. In this context we show there is information to distinguish the direction of dependence between X₁ and X₂ from observations. That is we may distinguish the model in which X₁ is dependent on X₂ from that in which X₂ is dependent on X₁ This is not true for L₂ regression based on the Normal distribution. 相似文献

2.

Adaptive penalized quantile regression for high dimensional data

Qi Zheng Colin Gallagher K.B. Kulasekera 《Journal of statistical planning and inference》2013

We propose a new adaptive L₁ penalized quantile regression estimator for high-dimensional sparse regression models with heterogeneous error sequences. We show that under weaker conditions compared with alternative procedures, the adaptive L₁ quantile regression selects the true underlying model with probability converging to one, and the unique estimates of nonzero coefficients it provides have the same asymptotic normal distribution as the quantile estimator which uses only the covariates with non-zero impact on the response. Thus, the adaptive L₁ quantile regression enjoys oracle properties. We propose a completely data driven choice of the penalty level _λ_n

λ_{n}

, which ensures good performance of the adaptive L₁ quantile regression. Extensive Monte Carlo simulation studies have been conducted to demonstrate the finite sample performance of the proposed method. 相似文献

3.

Influence measure for the L1 regression

Silvia N. Elian Carmen D.S. André Subhash C. Narula 《统计学通讯:理论与方法》2013,42(4):837-849

Because outliers and leverage observations unduly affect the least squares regression, the identification of influential observations is considered an important and integrai part of the analysis. However, very few techniques have been developed for the residual analysis and diagnostics for the minimum sum of absolute errors, L₁ regression. Although the L₁ regression is more resistant to the outliers than the least squares regression, it appears that outliers (leverage) in the predictor variables may affect it. In this paper, our objective is to develop an influence measure for the L₁ regression based on the likelihood displacement function. We illustrate the proposed influence measure with examples. 相似文献

4.

On least absolute values estimation

J. E. Gentle W. J. Kennedy V. A. Sposito 《统计学通讯:理论与方法》2013,42(9):839-845

The resistance of least absolute values (L₁) estimators to outliers and their robustness to heavy-tailed distributions make these estimators useful alternatives to the usual least squares estimators. The recent development of efficient algorithms for L₁ estimation in linear models has permitted their use in practical data analysis. Although in general the L₁ estimators are not unique, there are a number of properties they all share. The set of all L₁ estimators for a given model and data set can be characterized as the convex hull of some extreme estimators. Properties of the extreme estimators and of the L₁-estimate set are considered. 相似文献

5.

PLUG-IN ESTIMATION OF GENERAL LEVEL SETS

Antonio Cuevas Wenceslao González-Manteiga Alberto Rodríguez-Casal 《Australian & New Zealand Journal of Statistics》2006,48(1):7-19

Given an unknown function (e.g. a probability density, a regression function, …) f and a constant c, the problem of estimating the level set L(c) ={f≥c} is considered. This problem is tackled in a very general framework, which allows f to be defined on a metric space different from . Such a degree of generality is motivated by practical considerations and, in fact, an example with astronomical data is analyzed where the domain of f is the unit sphere. A plug‐in approach is followed; that is, L(c) is estimated by L_n(c) ={f_n≥c} , where f_n is an estimator of f. Two results are obtained concerning consistency and convergence rates, with respect to the Hausdorff metric, of the boundaries ?L_n(c) towards ?L(c) . Also, the consistency of L_n(c) to L(c) is shown, under mild conditions, with respect to the L₁ distance. Special attention is paid to the particular case of spherical data. 相似文献

6.

Robust Coordinate Descent Algorithm Robust Solution Path for High-dimensional Sparse Regression Modeling

H. Park S. Konishi 《统计学通讯:模拟与计算》2016,45(1):115-129

The L₁-type regularization provides a useful tool for variable selection in high-dimensional regression modeling. Various algorithms have been proposed to solve optimization problems for L₁-type regularization. Especially the coordinate descent algorithm has been shown to be effective in sparse regression modeling. Although the algorithm shows a remarkable performance to solve optimization problems for L₁-type regularization, it suffers from outliers, since the procedure is based on the inner product of predictor variables and partial residuals obtained from a non-robust manner. To overcome this drawback, we propose a robust coordinate descent algorithm, especially focusing on the high-dimensional regression modeling based on the principal components space. We show that the proposed robust algorithm converges to the minimum value of its objective function. Monte Carlo experiments and real data analysis are conducted to examine the efficiency of the proposed robust algorithm. We observe that our robust coordinate descent algorithm effectively performs for the high-dimensional regression modeling even in the presence of outliers. 相似文献

7.

Comparison of computer programs for simple linear L 1 regression

《Journal of Statistical Computation and Simulation》2012,82(1-2):63-68

A number of efficient computer codes are available for the simple linear L ₁ regression problem. However, a number of these codes can be made more efficient by utilizing the least squares solution. In fact, a couple of available computer programs already do so.

We report the results of a computational study comparing several openly available computer programs for solving the simple linear L ₁ regression problem with and without computing and utilizing a least squares solution. 相似文献

8.

Nonlinear orthogonal series estimates for random design regression

《Journal of statistical planning and inference》2003,115(2):491-520

Let (X,Y) be a pair of random variables with supp(X)⊆[0,1] and EY²<∞. Let m be the corresponding regression function. Estimation of m from i.i.d. data is considered. The L₂ error with integration with respect to the design measure μ (i.e., the distribution of X) is used as an error criterion.Estimates are constructed by estimating the coefficients of an orthonormal expansion of the regression function. This orthonormal expansion is done with respect to a family of piecewise polynomials, which are orthonormal in L₂(μ_n), where μ_n denotes the empirical design measure.It is shown that the estimates are weakly and strongly consistent for every distribution of (X,Y). Furthermore, the estimates behave nearly as well as an ideal (but not applicable) estimate constructed by fitting a piecewise polynomial to the data, where the partition of the piecewise polynomial is chosen optimally for the underlying distribution. This implies e.g., that the estimates achieve up to a logarithmic factor the rate n^−2p/(2p+1), if the underlying regression function is piecewise p-smooth, although their definition depends neither on the smoothness nor on the location of the discontinuities of the regression function. 相似文献

9.

Empirical Comparison of Nonparametric Regression Estimates on Real Data

Daniel Jones Michael Kohler Alexander Richter 《统计学通讯:模拟与计算》2016,45(7):2309-2319

The performance of nine different nonparametric regression estimates is empirically compared on ten different real datasets. The number of data points in the real datasets varies between 7, 900 and 18, 000, where each real dataset contains between 5 and 20 variables. The nonparametric regression estimates include kernel, partitioning, nearest neighbor, additive spline, neural network, penalized smoothing splines, local linear kernel, regression trees, and random forests estimates. The main result is a table containing the empirical L₂ risks of all nine nonparametric regression estimates on the evaluation part of the different datasets. The neural networks and random forests are the two estimates performing best. The datasets are publicly available, so that any new regression estimate can be easily compared with all nine estimates considered in this article by just applying it to the publicly available data and by computing its empirical L₂ risks on the evaluation part of the datasets. 相似文献

10.

Penalized regression with model‐based penalties

Nancy E. Heckman James O. Ramsay 《Revue canadienne de statistique》2000,28(2):241-258

Nonparametric regression techniques such as spline smoothing and local fitting depend implicitly on a parametric model. For instance, the cubic smoothing spline estimate of a regression function ∫ μ based on observations ti, Yi is the minimizer of Σ{Yi ‐ μ(ti)}² + λ∫(μ′′)². Since ∫(μ″)² is zero when μ is a line, the cubic smoothing spline estimate favors the parametric model μ(t) = α_o + α₁t. Here the authors consider replacing ∫(μ″)² with the more general expression ∫(Lμ)² where L is a linear differential operator with possibly nonconstant coefficients. The resulting estimate of μ performs well, particularly if Lμ is small. They present an O(n) algorithm for the computation of μ. This algorithm is applicable to a wide class of L's. They also suggest a method for the estimation of L. They study their estimates via simulation and apply them to several data sets. 相似文献

11.

Estimation and selection procedures in regression: An L1 approach

Nicolas Hengartner Marten Wegkamp 《Revue canadienne de statistique》2001,29(4):621-632

The authors consider the problem of estimating a regression function go involving several variables by the closest functional element of a prescribed class _G that is closest to it in the L₁ norm. They propose a new estimator ? based on independent observations and give explicit finite sample bounds for the L₁distance between ?g and go. They apply their estimation procedure to the problem of selecting the smoothing parameter in nonparametric regression. 相似文献

12.

Quantile regression via iterative least squares computations

《Journal of Statistical Computation and Simulation》2012,82(11):1557-1569

We present an estimating framework for quantile regression where the usual L ₁-norm objective function is replaced by its smooth parametric approximation. An exact path-following algorithm is derived, leading to the well-known ‘basic’ solutions interpolating exactly a number of observations equal to the number of parameters being estimated. We discuss briefly possible practical implications of the proposed approach, such as early stopping for large data sets, confidence intervals, and additional topics for future research. 相似文献

13.

Consistency of l 1 estimates in censored linear regression models

X. R. Chen Y. Wu 《统计学通讯:理论与方法》2013,42(7):1847-1858

In this paper, the regression model with a nonnegativity constraint on the dependent variable is considered. Under weak conditions, L ₁ estimates of the regression coefficients are shown to be consistent. 相似文献

14.

Standard and robust orthogonal regression

Larry Ammann John Van Ness 《统计学通讯:模拟与计算》2013,42(1):145-162

A fast routine for converting regression algorithms into corresponding orthogonal regression (OR) algorithms was introduced in Ammann and Van Ness (1988). The present paper discusses the properties of various ordinary and robust OR procedures created using this routine. OR minimizes the sum of the orthogonal distances from the regression plane to the data points. OR has three types of applications. First, L ₂ OR is the maximum likelihood solution of the Gaussian errors-in-variables (EV) regression problem. This L ₂ solution is unstable, thus the robust OR algorithms created from robust regression algorithms should prove very useful. Secondly, OR is intimately related to principal components analysis. Therefore, the routine can also be used to create L ₁, robust, etc. principal components algorithms. Thirdly, OR treats the x and y variables symmetrically which is important in many modeling problems. Using Monte Carlo studies this paper compares the performance of standard regression, robust regression, OR, and robust OR on Gaussian EV data, contaminated Gaussian EV data, heavy-tailed EV data, and contaminated heavy-tailed EV data. 相似文献

15.

Robust variable selection for the varying coefficient model based on composite L 1–L 2 regression

Weihua Zhao Jicai Liu 《Journal of applied statistics》2013,40(9):2024-2040

The varying coefficient model (VCM) is an important generalization of the linear regression model and many existing estimation procedures for VCM were built on L ₂ loss, which is popular for its mathematical beauty but is not robust to non-normal errors and outliers. In this paper, we address the problem of both robustness and efficiency of estimation and variable selection procedure based on the convex combined loss of L ₁ and L ₂ instead of only quadratic loss for VCM. By using local linear modeling method, the asymptotic normality of estimation is driven and a useful selection method is proposed for the weight of composite L ₁ and L ₂. Then the variable selection procedure is given by combining local kernel smoothing with adaptive group LASSO. With appropriate selection of tuning parameters by Bayesian information criterion (BIC) the theoretical properties of the new procedure, including consistency in variable selection and the oracle property in estimation, are established. The finite sample performance of the new method is investigated through simulation studies and the analysis of body fat data. Numerical studies show that the new method is better than or at least as well as the least square-based method in terms of both robustness and efficiency for variable selection. 相似文献

16.

Analysis of least squares regression estimates in case of additional errors in the variables

Andreas Fromkorth Michael Kohler 《Journal of statistical planning and inference》2011,141(1):172-188

Estimation of a regression function from independent and identical distributed data is considered. The L₂ error with integration with respect to the design measure is used as error criterion. Upper bounds on the L₂ error of least squares regression estimates are presented, which bound the error of the estimate in case that in the sample given to the estimate the values of the independent and the dependent variables are pertubated by some arbitrary procedure. The bounds are applied to analyze regression-based Monte Carlo methods for pricing American options in case of errors in modelling the price process. 相似文献

17.

Multivariate orthogonal series estimates for random design regression

Michael Kohler 《Journal of statistical planning and inference》2008

In this paper a new multivariate regression estimate is introduced. It is based on ideas derived in the context of wavelet estimates and is constructed by hard thresholding of estimates of coefficients of a series expansion of the regression function. Multivariate functions constructed analogously to the classical Haar wavelets are used for the series expansion. These functions are orthogonal in _L₂(_μ_n)

L_{2} (μ_{n})

, where _μ_n

μ_{n}

denotes the empirical design measure. The construction can be considered as designing adapted Haar wavelets. 相似文献

18.

Estimating the parameter of selected uniform population under the squared log error loss function

K. R. Meena Mohd. Arshad Aditi Kar Gangopadhyay 《统计学通讯:理论与方法》2018,47(7):1679-1692

Let π₁, …, π_k be k (? 2) independent populations, where π_i denotes the uniform distribution over the interval (0, θ_i) and θ_i > 0 (i = 1, …, k) is an unknown scale parameter. The population associated with the largest scale parameter is called the best population. For selecting the best population, We use a selection rule based on the natural estimators of θ_i, i = 1, …, k, for the case of unequal sample sizes. Consider the problem of estimating the scale parameter θ_L of the selected uniform population when sample sizes are unequal and the loss is measured by the squared log error (SLE) loss function. We derive the uniformly minimum risk unbiased (UMRU) estimator of θ_L under the SLE loss function and two natural estimators of θ_L are also studied. For k = 2, we derive a sufficient condition for inadmissibility of an estimator of θ_L. Using these condition, we conclude that the UMRU estimator and natural estimator are inadmissible. Finally, the risk functions of various competing estimators of θ_L are compared through simulation. 相似文献

19.

Bayesian quantile inference

《Journal of Statistical Computation and Simulation》2012,82(9):659-674

The paper proposes a Bayesian interpretation of quantile regression that is shown to be equivalent to scale mixtures of normals leading to a skewed Laplace distribution. This representation of the model facilitates Bayesian analysis by means of Gibbs sampling with data augmentation, and nests regression in the L₁ norm as a special case. The new methods are applied to an analysis of the patents - R&D relationship for U.S. firms and unit root inference for the dollar-deutschemark exchange rate. 相似文献

20.

L1 for the simple linear regression model

Ha Sik Sunwoo Byung Chun Kim 《统计学通讯:理论与方法》2013,42(6):1703-1715

By modifying the direct method to solve the overdetermined linear system we are able to present an algorithm for L₁ estimation which appears to be superior computationally to any other known algorithm for the simple linear regression problem. 相似文献