期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Automatic and asymptotically optimal data sharpening for nonparametric regression

Fang Yao Thomas C.M. Lee 《Journal of statistical planning and inference》2009,139(12):4017-4030

In this article we consider data-sharpening methods for nonparametric regression. In particular modifications are made to existing methods in the following two directions. First, we introduce a new tuning parameter to control the extent to which the data are to be sharpened, so that the amount of sharpening is adaptive and can be tuned to best suit the data at hand. We call this new parameter the sharpening parameter. Second, we develop automatic methods for jointly choosing the value of this sharpening parameter as well as the values of other required smoothing parameters. These automatic parameter selection methods are shown to be asymptotically optimal in a well defined sense. Numerical experiments were also conducted to evaluate their finite-sample performances. To the best of our knowledge, there is no bandwidth selection method developed in the literature for sharpened nonparametric regression. 相似文献

2.

Asymptotic normality of locally modelled regression estimator for functional data

Zhiyong Zhou Zhengyan Lin 《Journal of nonparametric statistics》2016,28(1):116-131

We focus on the nonparametric regression of a scalar response on a functional explanatory variable. As an alternative to the well-known Nadaraya-Watson estimator for regression function in this framework, the locally modelled regression estimator performs very well [cf. [Barrientos-Marin, J., Ferraty, F., and Vieu, P. (2010), ‘Locally Modelled Regression and Functional Data’, Journal of Nonparametric Statistics, 22, 617–632]. In this paper, the asymptotic properties of locally modelled regression estimator for functional data are considered. The mean-squared convergence as well as asymptotic normality for the estimator are established. We also adapt the empirical likelihood method to construct the point-wise confidence intervals for the regression function and derive the Wilk's phenomenon for the empirical likelihood inference. Furthermore, a simulation study is presented to illustrate our theoretical results. 相似文献

3.

Nonparametric regression for dependent data in the errors-in-variables problem

Toshio Honda 《Journal of statistical planning and inference》2010

We consider the nonparametric estimation of the regression functions for dependent data. Suppose that the covariates are observed with additive errors in the data and we employ nonparametric deconvolution kernel techniques to estimate the regression functions in this paper. We investigate how the strength of time dependence affects the asymptotic properties of the local constant and linear estimators. We treat both short-range dependent and long-range dependent linear processes in a unified way and demonstrate that the long-range dependence (LRD) of the covariates affects the asymptotic properties of the nonparametric estimators as well as the LRD of regression errors does. 相似文献

4.

Rate-optimal nonparametric estimation in classical and Berkson errors-in-variables problems

Aurore Delaigle Alexander Meister 《Journal of statistical planning and inference》2011,141(1):102-114

We consider nonparametric estimation of a regression curve when the data are observed with Berkson errors or with a mixture of classical and Berkson errors. In this context, other existing nonparametric procedures can either estimate the regression curve consistently on a very small interval or require complicated inversion of an estimator of the Fourier transform of a nonparametric regression estimator. We introduce a new estimation procedure which is simpler to implement, and study its asymptotic properties. We derive convergence rates which are faster than those previously obtained in the literature, and we prove that these rates are optimal. We suggest a data-driven bandwidth selector and apply our method to some simulated examples. 相似文献

5.

Asymptotic Distribution of Robust Estimator for Functional Nonparametric Models

Mohammed Attouch Ali Laksaci 《统计学通讯:理论与方法》2013,42(8):1317-1335

We propose a family of robust nonparametric estimators for regression function based on kernel method. We establish the asymptotic normality of the estimator under the concentration properties on small balls of the probability measure of the functional explanatory variables. Useful applications to prediction, discrimination in a semi-metric space, and confidence curves are given. In addition, to highlight the generality of our purpose and to emphasize the role of each of our hypotheses, several special cases of our general conditions are also discussed. Finally, some numerical study in chemiometrical real data are carried out to compare the sensitivity to outliers between the classical and robust regression. 相似文献

6.

Estimating nonlinear additive models with nonstationarities and correlated errors

Michael Vogt Christopher Walsh 《Scandinavian Journal of Statistics》2019,46(1):160-199

In this paper, we study a nonparametric additive regression model suitable for a wide range of time series applications. Our model includes a periodic component, a deterministic time trend, various component functions of stochastic explanatory variables, and an AR(p) error process that accounts for serial correlation in the regression error. We propose an estimation procedure for the nonparametric component functions and the parameters of the error process based on smooth backfitting and quasimaximum likelihood methods. Our theory establishes convergence rates and the asymptotic normality of our estimators. Moreover, we are able to derive an oracle‐type result for the estimators of the AR parameters: Under fairly mild conditions, the limiting distribution of our parameter estimators is the same as when the nonparametric component functions are known. Finally, we illustrate our estimation procedure by applying it to a sample of climate and ozone data collected on the Antarctic Peninsula. 相似文献

7.

Free-knot polynomial splines with confidence intervals

Wenxin Mao Linda H. Zhao 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2003,65(4):901-919

Summary. We construct approximate confidence intervals for a nonparametric regression function, using polynomial splines with free-knot locations. The number of knots is determined by generalized cross-validation. The estimates of knot locations and coefficients are obtained through a non-linear least squares solution that corresponds to the maximum likelihood estimate. Confidence intervals are then constructed based on the asymptotic distribution of the maximum likelihood estimator. Average coverage probabilities and the accuracy of the estimate are examined via simulation. This includes comparisons between our method and some existing methods such as smoothing spline and variable knots selection as well as a Bayesian version of the variable knots method. Simulation results indicate that our method works well for smooth underlying functions and also reasonably well for discontinuous functions. It also performs well for fairly small sample sizes. 相似文献

8.

Estimation for semi-functional linear regression

Tang Qingguo 《Statistics》2015,49(6):1262-1278

This paper studies estimation in semi-functional linear regression. A general formulation is used to treat mean regression, median regression, quantile regression and robust mean regression in one setting. The linear slope function is estimated by the functional principal component basis and the nonparametric component is approximated by a B-spline function. The global convergence rates of the estimators of unknown slope function and nonparametric component are established under suitable norm. The convergence rate of the mean-squared prediction error for the proposed estimators is also established. Finite sample properties of our procedures are studied through Monte Carlo simulations. A real data example about Berkeley growth data is used to illustrate our proposed methodology. 相似文献

9.

Nonparametric geometric outlier detection

Matias Heikkil 《Scandinavian Journal of Statistics》2019,46(4):1300-1314

Outlier detection is a major topic in robust statistics due to the high practical significance of anomalous observations. Many existing methods, however, either are parametric or cease to perform well when the data are far from linearly structured. In this paper, we propose a quantity, Delaunay outlyingness, that is a nonparametric outlyingness score applicable to data with complicated structure. The approach is based on a well‐known triangulation of the sample, which seems to reflect the sparsity of the pointset to different directions in a useful way. We derive results on the asymptotic behavior of Delaunay outlyingness in case of a sufficiently simple set of observations. Simulations and an application to empirical data are also discussed. 相似文献

10.

Multiple-output quantile regression through optimal quantization

Isabelle Charlier Davy Paindaveine Jérôme Saracco 《Scandinavian Journal of Statistics》2020,47(1):250-278

A new nonparametric quantile regression method based on the concept of optimal quantization was developed recently and was showed to provide estimators that often dominate their classical, kernel-type, competitors. In the present work, we extend this method to multiple-output regression problems. We show how quantization allows approximating population multiple-output regression quantiles based on halfspace depth. We prove that this approximation becomes arbitrarily accurate as the size of the quantization grid goes to infinity. We also derive a weak consistency result for a sample version of the proposed regression quantiles. Through simulations, we compare the performances of our estimators with (local constant and local bilinear) kernel competitors. The results reveal that the proposed quantization-based estimators, which are local constant in nature, outperform their kernel counterparts and even often dominate their local bilinear kernel competitors. The various approaches are also compared on artificial and real data. 相似文献

11.

Empirical likelihood confidence intervals for nonparametric functional data analysis

Heng Lian 《Journal of statistical planning and inference》2012

We consider the problem of constructing confidence intervals for nonparametric functional data analysis using empirical likelihood. In this doubly infinite-dimensional context, we demonstrate the Wilk's phenomenon and propose a bias-corrected construction that requires neither undersmoothing nor direct bias estimation. We also extend our results to partially linear regression models involving functional data. Our numerical results demonstrate improved performance of the empirical likelihood methods over normal approximation-based methods. 相似文献

12.

A ‘nondecimated’ lifting transform

Marina I. Knight Guy P. Nason 《Statistics and Computing》2009,19(1):1-16

Classical nondecimated wavelet transforms are attractive for many applications. When the data comes from complex or irregular designs, the use of second generation wavelets in nonparametric regression has proved superior to that of classical wavelets. However, the construction of a nondecimated second generation wavelet transform is not obvious. In this paper we propose a new ‘nondecimated’ lifting transform, based on the lifting algorithm which removes one coefficient at a time, and explore its behavior. Our approach also allows for embedding adaptivity in the transform, i.e. wavelet functions can be constructed such that their smoothness adjusts to the local properties of the signal. We address the problem of nonparametric regression and propose an (averaged) estimator obtained by using our nondecimated lifting technique teamed with empirical Bayes shrinkage. Simulations show that our proposed method has higher performance than competing techniques able to work on irregular data. Our construction also opens avenues for generating a ‘best’ representation, which we shall explore. 相似文献

13.

The average area under correlated receiver operating characteristic curves: a nonparametric approach based on generalized two-sample Wilcoxon statistics 总被引：2，自引：0，他引：2

Mei-Ling Ting Lee & Bernard A. Rosner 《Journal of the Royal Statistical Society. Series C, Applied statistics》2001,50(3):337-344

It is well known that, when sample observations are independent, the area under the receiver operating characteristic (ROC) curve corresponds to the Wilcoxon statistics if the area is calculated by the trapezoidal rule. Correlated ROC curves arise often in medical research and have been studied by various parametric methods. On the basis of the Mann–Whitney U-statistics for clustered data proposed by Rosner and Grove, we construct an average ROC curve and derive nonparametric methods to estimate the area under the average curve for correlated ROC curves obtained from multiple readers. For the more complicated case where, in addition to multiple readers examining results on the same set of individuals, two or more diagnostic tests are involved, we derive analytic methods to compare the areas under correlated average ROC curves for these diagnostic tests. We demonstrate our methods in an example and compare our results with those obtained by other methods. The nonparametric average ROC curve and the analytic methods that we propose are easy to explain and simple to implement. 相似文献

14.

Smooth backfitting in practice

Jens Perch Nielsen Stefan Sperlich 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2005,67(1):43-61

Summary. Compared with the classical backfitting of Buja, Hastie and Tibshirani, the smooth backfitting estimator (SBE) of Mammen, Linton and Nielsen not only provides complete asymptotic theory under weaker conditions but is also more efficient, robust and easier to calculate. However, the original paper describing the SBE method is complex and the practical as well as the theoretical advantages of the method have still neither been recognized nor accepted by the statistical community. We focus on a clear presentation of the idea, the main theoretical results and practical aspects like implementation and simplification of the algorithm. We introduce a feasible cross-validation procedure and apply it to the problem of data-driven bandwidth choice for the SBE. By simulations it is shown that the SBE and our cross-validation work very well indeed. In particular, the SBE is less affected by sparseness of data in high dimensional regression problems or strongly correlated designs. The SBE has reasonable performance even in 100-dimensional additive regression problems. 相似文献

15.

Local polynomial regression and simulation–extrapolation

John Staudenmayer David Ruppert 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(1):17-30

Summary. The paper introduces a new local polynomial estimator and develops supporting asymptotic theory for nonparametric regression in the presence of covariate measurement error. We address the measurement error with Cook and Stefanski's simulation–extrapolation (SIMEX) algorithm. Our method improves on previous local polynomial estimators for this problem by using a bandwidth selection procedure that addresses SIMEX's particular estimation method and considers higher degree local polynomial estimators. We illustrate the accuracy of our asymptotic expressions with a Monte Carlo study, compare our method with other estimators with a second set of Monte Carlo simulations and apply our method to a data set from nutritional epidemiology. SIMEX was originally developed for parametric models. Although SIMEX is, in principle, applicable to nonparametric models, a serious problem arises with SIMEX in nonparametric situations. The problem is that smoothing parameter selectors that are developed for data without measurement error are no longer appropriate and can result in considerable undersmoothing. We believe that this is the first paper to address this difficulty. 相似文献

16.

Robust difference-based outlier detection

Chun Gun Park 《统计学通讯:理论与方法》2020,49(22):5553-5577

Abstract

In this paper, we propose an outlier-detection approach that uses the properties of an intercept estimator in a difference-based regression model (DBRM) that we first introduce. This DBRM uses multiple linear regression, and invented it to detect outliers in a multiple linear regression. Our outlier-detection approach uses only the intercept; it does not require estimates for the other parameters in the DBRM. In this paper, we first employed a difference-based intercept estimator to study the outlier-detection problem in a multiple regression model. We compared our approach with several existing methods in a simulation study and the results suggest that our approach outperformed the others. We also demonstrated the advantage of our approach using a real data application. Our approach can extend to nonparametric regression models for outliers detection. 相似文献

17.

Prediction intervals for ordinary and dual generalized order statistics from two independent sequences

M. M. Mohie El-Din 《统计学通讯:模拟与计算》2013,42(9):2703-2723

ABSTRACT

Based on the observed dual generalized order statistics drawn from an arbitrary unknown distribution, nonparametric two-sided prediction intervals as well as prediction upper and lower bounds for an ordinary and a dual generalized order statistic from another iid sequence with the same distribution are developed. The prediction intervals for dual generalized order statistics based on the observed ordinary generalized order statistics are also developed. The coverage probabilities of these prediction intervals are exact and free of the parent distribution, F. Finally, numerical computations and real examples of the coverage probabilities are presented for choosing the appropriate limits of the prediction. 相似文献

18.

Local comparison of empirical distributions via nonparametric regression

《Journal of Statistical Computation and Simulation》2012,82(12):2384-2405

Given two independent samples of size n and m drawn from univariate distributions with unknown densities f and g, respectively, we are interested in identifying subintervals where the two empirical densities deviate significantly from each other. The solution is built by turning the nonparametric density comparison problem into a comparison of two regression curves. Each regression curve is created by binning the original observations into many small size bins, followed by a suitable form of root transformation to the binned data counts. Turned as a regression comparison problem, several nonparametric regression procedures for detection of sparse signals can be applied. Both multiple testing and model selection methods are explored. Furthermore, an approach for estimating larger connected regions where the two empirical densities are significantly different is also derived, based on a scale-space representation. The proposed methods are applied on simulated examples as well as real-life data from biology. 相似文献

19.

Bayesian spectral analysis models for quantile regression with Dirichlet process mixtures

Seongil Jo Taeyoung Roh 《Journal of nonparametric statistics》2016,28(1):177-206

This paper presents a Bayesian analysis of partially linear additive models for quantile regression. We develop a semiparametric Bayesian approach to quantile regression models using a spectral representation of the nonparametric regression functions and the Dirichlet process (DP) mixture for error distribution. We also consider Bayesian variable selection procedures for both parametric and nonparametric components in a partially linear additive model structure based on the Bayesian shrinkage priors via a stochastic search algorithm. Based on the proposed Bayesian semiparametric additive quantile regression model referred to as BSAQ, the Bayesian inference is considered for estimation and model selection. For the posterior computation, we design a simple and efficient Gibbs sampler based on a location-scale mixture of exponential and normal distributions for an asymmetric Laplace distribution, which facilitates the commonly used collapsed Gibbs sampling algorithms for the DP mixture models. Additionally, we discuss the asymptotic property of the sempiparametric quantile regression model in terms of consistency of posterior distribution. Simulation studies and real data application examples illustrate the proposed method and compare it with Bayesian quantile regression methods in the literature. 相似文献

20.

Component selection in additive quantile regression models

《Journal of the Korean Statistical Society》2014,43(3):439-452

Nonparametric additive models are powerful techniques for multivariate data analysis. Although many procedures have been developed for estimating additive components both in mean regression and quantile regression, the problem of selecting relevant components has not been addressed much especially in quantile regression. We present a doubly-penalized estimation procedure for component selection in additive quantile regression models that combines basis function approximation with a ridge-type penalty and a variant of the smoothly clipped absolute deviation penalty. We show that the proposed estimator identifies relevant and irrelevant components consistently and achieves the nonparametric optimal rate of convergence for the relevant components. We also provide an accurate and efficient computation algorithm to implement the estimator and demonstrate its performance through simulation studies. Finally, we illustrate our method via a real data example to identify important body measurements to predict percentage of body fat of an individual. 相似文献