期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Flexible empirical Bayes estimation for wavelets

Merlise Clyde & Edward I. George 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2000,62(4):681-698

Wavelet shrinkage estimation is an increasingly popular method for signal denoising and compression. Although Bayes estimators can provide excellent mean-squared error (MSE) properties, the selection of an effective prior is a difficult task. To address this problem, we propose empirical Bayes (EB) prior selection methods for various error distributions including the normal and the heavier-tailed Student t -distributions. Under such EB prior distributions, we obtain threshold shrinkage estimators based on model selection, and multiple-shrinkage estimators based on model averaging. These EB estimators are seen to be computationally competitive with standard classical thresholding methods, and to be robust to outliers in both the data and wavelet domains. Simulated and real examples are used to illustrate the flexibility and improved MSE performance of these methods in a wide variety of settings. 相似文献

2.

Robust estimation for boundary correction in wavelet regression

《Journal of Statistical Computation and Simulation》2012,82(10):1531-1544

For boundary problems present in wavelet regression, two common methods are usually considered: polynomial wavelet regression (PWR) and hybrid local polynomial wavelet regression (LPWR). Normality assumption played a key role for making such choices for the order of the low-order polynomial, the wavelet thresholding value and other calculations involved in LPWR. However, in practice, the normality assumption may not be valid. In this paper, for PWR, we propose three automatic robust methods based on: MM-estimator, bootstrap and robust threshold procedure. For LPWR, the use of a robust local polynomial (RLP) estimator with a robust threshold procedure has been investigated. The proposed methods do not require any knowledge of noise distribution, are easy to implement and achieve high performances when only a small amount of data is in hand. A simulation study is conducted to assess the numerical performance of the proposed methods. 相似文献

3.

Outlier detection and robust variable selection via the penalized weighted LAD-LASSO method

Yunlu Jiang Yan Wang Jiantao Zhang Baojian Xie Jibiao Liao Wenhui Liao 《Journal of applied statistics》2021,48(2):234

This paper studies the outlier detection and robust variable selection problem in the linear regression model. The penalized weighted least absolute deviation (PWLAD) regression estimation method and the adaptive least absolute shrinkage and selection operator (LASSO) are combined to simultaneously achieve outlier detection, and robust variable selection. An iterative algorithm is proposed to solve the proposed optimization problem. Monte Carlo studies are evaluated the finite-sample performance of the proposed methods. The results indicate that the finite sample performance of the proposed methods performs better than that of the existing methods when there are leverage points or outliers in the response variable or explanatory variables. Finally, we apply the proposed methodology to analyze two real datasets. 相似文献

4.

Regularisation Parameter Selection Via Bootstrapping

下载免费PDF全文

Zhen Pang Bingqing Lin Jiming Jiang 《Australian & New Zealand Journal of Statistics》2016,58(3):335-356

Penalised likelihood methods, such as the least absolute shrinkage and selection operator (Lasso) and the smoothly clipped absolute deviation penalty, have become widely used for variable selection in recent years. These methods impose penalties on regression coefficients to shrink a subset of them towards zero to achieve parameter estimation and model selection simultaneously. The amount of shrinkage is controlled by the regularisation parameter. Popular approaches for choosing the regularisation parameter include cross‐validation, various information criteria and bootstrapping methods that are based on mean square error. In this paper, a new data‐driven method for choosing the regularisation parameter is proposed and the consistency of the method is established. It holds not only for the usual fixed‐dimensional case but also for the divergent setting. Simulation results show that the new method outperforms other popular approaches. An application of the proposed method to motif discovery in gene expression analysis is included in this paper. 相似文献

5.

Wavelet threshold based on Stein's unbiased risk estimators of restricted location parameter in multivariate normal

H. Karamikabir M. Afshari F. Lak 《Journal of applied statistics》2021,48(10):1712

In this paper, the problem of estimating the mean vector under non-negative constraints on location vector of the multivariate normal distribution is investigated. The value of the wavelet threshold based on Stein''s unbiased risk estimators is calculated for the shrinkage estimator in restricted parameter space. We suppose that covariance matrix is unknown and we find the dominant class of shrinkage estimators under Balance loss function. The performance evaluation of the proposed class of estimators is checked through a simulation study by using risk and average mean square error values. 相似文献

6.

Bayesian False Discovery Rate Wavelet Shrinkage: Theory and Applications

Ilya Lavrik Yoon Young Jung Brani Vidakovic 《统计学通讯:模拟与计算》2013,42(6):1086-1100

Statistical inference in the wavelet domain remains a vibrant area of contemporary statistical research because of desirable properties of wavelet representations and the need of scientific community to process, explore, and summarize massive data sets. Prime examples are biomedical, geophysical, and internet related data. We propose two new approaches to wavelet shrinkage/thresholding.

In the spirit of Efron and Tibshirani's recent work on local false discovery rate, we propose Bayesian Local False Discovery Rate (BLFDR), where the underlying model on wavelet coefficients does not assume known variances. This approach to wavelet shrinkage is shown to be connected with shrinkage based on Bayes factors. The second proposal, Bayesian False Discovery Rate (BaFDR), is based on ordering of posterior probabilities of hypotheses on true wavelets coefficients being null, in Bayesian testing of multiple hypotheses.

We demonstrate that both approaches result in competitive shrinkage methods by contrasting them to some popular shrinkage techniques. 相似文献

7.

A modified generalized lasso algorithm to detect local spatial clusters for count data

Hosik Choi Eunjung Song Seung-sik Hwang Woojoo Lee 《AStA Advances in Statistical Analysis》2018,102(4):537-563

Detecting local spatial clusters for count data is an important task in spatial epidemiology. Two broad approaches—moving window and disease mapping methods—have been suggested in some of the literature to find clusters. However, the existing methods employ somewhat arbitrarily chosen tuning parameters, and the local clustering results are sensitive to the choices. In this paper, we propose a penalized likelihood method to overcome the limitations of existing local spatial clustering approaches for count data. We start with a Poisson regression model to accommodate any type of covariates, and formulate the clustering problem as a penalized likelihood estimation problem to find change points of intercepts in two-dimensional space. The cost of developing a new algorithm is minimized by modifying an existing least absolute shrinkage and selection operator algorithm. The computational details on the modifications are shown, and the proposed method is illustrated with Seoul tuberculosis data. 相似文献

8.

Multivariate Bayes Wavelet shrinkage and applications

Gabriel Huerta 《Journal of applied statistics》2005,32(5):529-542

In recent years, wavelet shrinkage has become a very appealing method for data de-noising and density function estimation. In particular, Bayesian modelling via hierarchical priors has introduced novel approaches for Wavelet analysis that had become very popular, and are very competitive with standard hard or soft thresholding rules. In this sense, this paper proposes a hierarchical prior that is elicited on the model parameters describing the wavelet coefficients after applying a Discrete Wavelet Transformation (DWT). In difference to other approaches, the prior proposes a multivariate Normal distribution with a covariance matrix that allows for correlations among Wavelet coefficients corresponding to the same level of detail. In addition, an extra scale parameter is incorporated that permits an additional shrinkage level over the coefficients. The posterior distribution for this shrinkage procedure is not available in closed form but it is easily sampled through Markov chain Monte Carlo (MCMC) methods. Applications on a set of test signals and two noisy signals are presented. 相似文献

9.

WLAD-LASSO method for robust estimation and variable selection in partially linear models

Hu Yang 《统计学通讯:理论与方法》2018,47(20):4958-4976

This paper focuses on robust estimation and variable selection for partially linear models. We combine the weighted least absolute deviation (WLAD) regression with the adaptive least absolute shrinkage and selection operator (LASSO) to achieve simultaneous robust estimation and variable selection for partially linear models. Compared with the LAD-LASSO method, the WLAD-LASSO method will resist to the heavy-tailed errors and outliers in the parametric components. In addition, we estimate the unknown smooth function by a robust local linear regression. Under some regular conditions, the theoretical properties of the proposed estimators are established. We further examine finite-sample performance of the proposed procedure by simulation studies and a real data example. 相似文献

10.

Statistical properties of parametric estimators for Markov chain vectors based on copula models

Wende Yi Stephen Shaoyi Liao 《Journal of statistical planning and inference》2010

To estimate and measure risks, two key classes of dependence relationship must be identified: temporal dependence and contemporaneous dependence. In this paper, we propose a parametric estimation model that uses a three-stage pseudo maximum likelihood estimation (3SPMLE), and we investigate the consistency and asymptotic normality of parametric estimators. The proposed model combines the concept of a copula and the methods of parametric estimators of two-stage pseudo maximum likelihood estimation (2SPMLE). The selection of a copula model that best captures the dependence structure is a critical problem. To solve this problem, we propose a model selection method that is based on the parametric pseudo-likelihood ratio under the 3SPMLE for stationary Markov vector-type models. 相似文献

11.

On the advantages of the non-concave penalized likelihood model selection method with minimum prediction errors in large-scale medical studies

A. Karagrigoriou K. Mylona 《Journal of applied statistics》2010,37(1):13-24

Variable and model selection problems are fundamental to high-dimensional statistical modeling in diverse fields of sciences. Especially in health studies, many potential factors are usually introduced to determine an outcome variable. This paper deals with the problem of high-dimensional statistical modeling through the analysis of the trauma annual data in Greece for 2005. The data set is divided into the experiment and control sets and consists of 6334 observations and 112 factors that include demographic, transport and intrahospital data used to detect possible risk factors of death. In our study, different model selection techniques are applied to the experiment set and the notion of deviance is used on the control set to assess the fit of the overall selected model. The statistical methods employed in this work were the non-concave penalized likelihood methods, smoothly clipped absolute deviation, least absolute shrinkage and selection operator, and Hard, the generalized linear logistic regression, and the best subset variable selection.The way of identifying the significant variables in large medical data sets along with the performance and the pros and cons of the various statistical techniques used are discussed. The performed analysis reveals the distinct advantages of the non-concave penalized likelihood methods over the traditional model selection techniques. 相似文献

12.

A fast wavelet approach for recovering damaged images

Donghoh Kim Youngjo Lee 《Journal of applied statistics》2008,35(8):927-938

A wavelet method is proposed for recovering damaged images. The proposed method combines wavelet shrinkage with preprocessing based on a binning process and an imputation procedure that is designed to extend the scope of wavelet shrinkage to data with missing values and perturbed locations. The proposed algorithm, termed as the BTW algorithm is simple to implement and efficient for recovering an image. Furthermore, this algorithm can be easily applied to wavelet regression for one-dimensional (1-D) signal estimation with irregularly spaced data. Simulation studies and real examples show that the proposed method can produce substantially effective results. 相似文献

13.

Real nonparametric regression using complex wavelets

Stuart Barber Guy P. Nason 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(4):927-939

Summary. Wavelet shrinkage is an effective nonparametric regression technique, especially when the underlying curve has irregular features such as spikes or discontinuities. The basic idea is simple: take the discrete wavelet transform of data consisting of a signal corrupted by noise; shrink or remove the wavelet coefficients to remove the noise; then invert the discrete wavelet transform to form an estimate of the true underlying curve. Various researchers have proposed increasingly sophisticated methods of doing this by using real-valued wavelets. Complex-valued wavelets exist but are rarely used. We propose two new complex-valued wavelet shrinkage techniques: one based on multiwavelet style shrinkage and the other using Bayesian methods. Extensive simulations show that our methods almost always give significantly more accurate estimates than methods based on real-valued wavelets. Further, our multiwavelet style shrinkage method is both simpler and dramatically faster than its competitors. To understand the excellent performance of this method we present a new risk bound on its hard thresholded coefficients. 相似文献

14.

M-estimator-based robust estimation of the number of components of a superimposed sinusoidal signal model

Sharmishtha Mitra 《Journal of applied statistics》2014,41(4):853-878

In this paper, we consider the problem of estimating the number of components of a superimposed nonlinear sinusoids model of a signal in the presence of additive noise. We propose and provide a detailed empirical comparison of robust methods for estimation of the number of components. The proposed methods, which are robust modifications of the commonly used information theoretic criteria, are based on various M-estimator approaches and are robust with respect to outliers present in the data and heavy-tailed noise. The proposed methods are compared with the usual non-robust methods through extensive simulations under varied model scenarios. We also present real signal analysis of two speech signals to show the usefulness of the proposed methodology. 相似文献

15.

Adaptive density estimation: A curse of support?

Patricia Reynaud-Bouret Vincent Rivoirard Christine Tuleau-Malot 《Journal of statistical planning and inference》2011,141(1):115-139

This paper deals with the classical problem of density estimation on the real line. Most of the existing papers devoted to minimax properties assume that the support of the underlying density is bounded and known. But this assumption may be very difficult to handle in practice. In this work, we show that, exactly as a curse of dimensionality exists when the data lie in R^d, there exists a curse of support as well when the support of the density is infinite. As for the dimensionality problem where the rates of convergence deteriorate when the dimension grows, the minimax rates of convergence may deteriorate as well when the support becomes infinite. This problem is not purely theoretical since the simulations show that the support-dependent methods are really affected in practice by the size of the density support, or by the weight of the density tail. We propose a method based on a biorthogonal wavelet thresholding rule that is adaptive with respect to the nature of the support and the regularity of the signal, but that is also robust in practice to this curse of support. The threshold, that is proposed here, is very accurately calibrated so that the gap between optimal theoretical and practical tuning parameters is almost filled. 相似文献

16.

基于拟似然方法的股票收益与波动率关系及其应用研究

林金官等《统计研究》2018,35(5):99-109

股票市场中收益与波动率的关系研究在金融证券领域起着很重要的作用,而随机波动率模型能够很好地拟合这种关系。本文将拟似然方法和渐近拟似然方法运用在随机波动率模型的参数估计方面,渐近拟似然方法可以避免因为人为的结构错误指定而造成的偏差,比较稳健。本文采用拟似然和渐近拟似然方法对随机波动率模型的参数估计进行了模拟探索,并和两种已有估计方法进行了对比,结果表明拟似然和渐近拟似然方法在模型的参数估计方面有着很好的估计结果。实证研究中,选取2000-2015年标普500指数作为研究对象,结果显示所选数据具有金融时间序列的常见特征。本文为金融证券领域中股票收益与波动率关系及其应用研究提供了一定的启示。相似文献

17.

Shrinkage tuning parameter selection in precision matrices estimation

Heng Lian 《Journal of statistical planning and inference》2011,141(8):2839-2848

Recent literature provides many computational and modeling approaches for covariance matrices estimation in a penalized Gaussian graphical models but relatively little study has been carried out on the choice of the tuning parameter. This paper tries to fill this gap by focusing on the problem of shrinkage parameter selection when estimating sparse precision matrices using the penalized likelihood approach. Previous approaches typically used K-fold cross-validation in this regard. In this paper, we first derived the generalized approximate cross-validation for tuning parameter selection which is not only a more computationally efficient alternative, but also achieves smaller error rate for model fitting compared to leave-one-out cross-validation. For consistency in the selection of nonzero entries in the precision matrix, we employ a Bayesian information criterion which provably can identify the nonzero conditional correlations in the Gaussian model. Our simulations demonstrate the general superiority of the two proposed selectors in comparison with leave-one-out cross-validation, 10-fold cross-validation and Akaike information criterion. 相似文献

18.

Robust estimation of complicated profiles using wavelets

Hamid Shahriari 《统计学通讯:理论与方法》2017,46(4):1573-1593

Some quality characteristics are well defined when treated as the response variables and their relationships are identified to some independent variables. This relationship is called a profile. The parametric models, such as linear models, may be used to model the profiles. However, due to the complexity of many processes in practical applications, it is inappropriate to model the process using parametric models. In these cases non parametric methods are used to model the processes. One of the most applicable non parametric methods used to model complicated profiles is the wavelet. Many authors considered the use of the wavelet transformation only for monitoring the processes in phase II. The problem of estimating the in-control profile in phase I using wavelet transformation is not deeply addressed. Usually classical estimators are used in phase I to estimate the in-control profiles, even when the wavelet transformation is used. These estimators are suitable if the data do not contain outliers. However, when the outliers exist, these estimators cannot estimate the in-control profile properly. In this research, a robust method of estimating the in-control profiles is proposed, which is insensitive to the presence of outliers and could be applied when the wavelet transformation is used. The proposed estimator is the combination of the robust clustering and the S-estimator. This estimator is compared with the classical estimator of the in-control profile in the presence of outliers. The results from a large simulation study show that using the proposed method, one can estimate the in-control profile precisely when the data are contaminated either locally or globally. 相似文献

19.

Segmentation of the mean of heteroscedastic data via cross-validation 总被引：1，自引：0，他引：1

Sylvain Arlot Alain Celisse 《Statistics and Computing》2011,21(4):613-632

This paper tackles the problem of detecting abrupt changes in the mean of a heteroscedastic signal by model selection, without knowledge on the variations of the noise. A new family of change-point detection procedures is proposed, showing that cross-validation methods can be successful in the heteroscedastic framework, whereas most existing procedures are not robust to heteroscedasticity. The robustness to heteroscedasticity of the proposed procedures is supported by an extensive simulation study, together with recent partial theoretical results. An application to Comparative Genomic Hybridization (CGH) data is provided, showing that robustness to heteroscedasticity can indeed be required for their analysis. 相似文献

20.

Robust principal component analysis via ES-algorithm

《Journal of the Korean Statistical Society》2014,43(1):149-159

In this paper, a new method for robust principal component analysis (PCA) is proposed. PCA is a widely used tool for dimension reduction without substantial loss of information. However, the classical PCA is vulnerable to outliers due to its dependence on the empirical covariance matrix. To avoid such weakness, several alternative approaches based on robust scatter matrix were suggested. A popular choice is ROBPCA that combines projection pursuit ideas with robust covariance estimation via variance maximization criterion. Our approach is based on the fact that PCA can be formulated as a regression-type optimization problem, which is the main difference from the previous approaches. The proposed robust PCA is derived by substituting square loss function with a robust penalty function, Huber loss function. A practical algorithm is proposed in order to implement an optimization computation, and furthermore, convergence properties of the algorithm are investigated. Results from a simulation study and a real data example demonstrate the promising empirical properties of the proposed method. 相似文献