期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Compressive Sensing Based Analysis of Anomalies in Generalized Linear Models

Brian Moore Balasubramaniam Natarajan 《统计学通讯:理论与方法》2013,42(13):2705-2719

In this article, we present a compressive sensing based framework for generalized linear model regression that employs a two-component noise model and convex optimization techniques to simultaneously detect outliers and determine optimally sparse representations of noisy data from arbitrary sets of basis functions. We then extend our model to include model order reduction capabilities that can uncover inherent sparsity in regression coefficients and achieve simple, superior fits. Second, we use the mixed ?₂/?₁ norm to develop another model that can efficiently uncover block-sparsity in regression coefficients. By performing model order reduction over all independent variables and basis functions, our algorithms successfully deemphasize the effect of independent variables that become uncorrelated with dependent variables. This desirable property has various applications in real-time anomaly detection, such as faulty sensor detection and sensor jamming in wireless sensor networks. After developing our framework and inheriting a stable recovery theorem from compressive sensing theory, we present two simulation studies on sparse or block-sparse problems that demonstrate the superior performance of our algorithms with respect to (1) classic outlier-invariant regression techniques like least absolute value and iteratively reweighted least-squares and (2) classic sparse-regularized regression techniques like LASSO. 相似文献

2.

The peculiar shrinkage properties of partial least squares regression

Neil A. Butler & Michael C. Denham 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2000,62(3):585-593

Partial least squares regression has been widely adopted within some areas as a useful alternative to ordinary least squares regression in the manner of other shrinkage methods such as principal components regression and ridge regression. In this paper we examine the nature of this shrinkage and demonstrate that partial least squares regression exhibits some undesirable properties. 相似文献

3.

An Extended Binomial Distribution with Applications

Abdulhamid A. Alzaid Maha A. Omair 《统计学通讯:理论与方法》2013,42(19):3511-3527

Based on Skellam (Poisson difference) distribution, an extended binomial distribution is introduced as a byproduct of extending Moran's characterization of Poisson distribution to the Skellam distribution. Basic properties of the distribution are investigated. Also, estimation of the distribution parameters is obtained. Applications with real data are also described. 相似文献

4.

Sparse canonical correlation analysis between an alcohol biomarker and self-reported alcohol consumption

Shanjun Helian Robert L. Cook 《统计学通讯:模拟与计算》2017,46(10):7924-7941

In investigating the correlation between an alcohol biomarker and self-report, we developed a method to estimate the canonical correlation between two high-dimensional random vectors with a small sample size. In reviewing the relevant literature, we found that our method is somewhat similar to an existing method, but that the existing method has been criticized as lacking theoretical grounding in comparison with an alternative approach. We provide theoretical and empirical grounding for our method, and we customize it for our application to produce a novel method, which selects linear combinations that are step functions with a sparse number of steps. 相似文献

5.

Functional linear discriminant analysis for irregularly sampled curves

Gareth M. James & Trevor J. Hastie 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2001,63(3):533-550

We introduce a technique for extending the classical method of linear discriminant analysis (LDA) to data sets where the predictor variables are curves or functions. This procedure, which we call functional linear discriminant analysis ( FLDA ), is particularly useful when only fragments of the curves are observed. All the techniques associated with LDA can be extended for use with FLDA. In particular FLDA can be used to produce classifications on new (test) curves, give an estimate of the discriminant function between classes and provide a one- or two-dimensional pictorial representation of a set of curves. We also extend this procedure to provide generalizations of quadratic and regularized discriminant analysis. 相似文献

6.

Simulation and Analytical Approach to the Identification of Significant Factors

Alexander V. Bulinski Alexander S. Rakitko 《统计学通讯:模拟与计算》2016,45(5):1430-1450

We develop our previous works concerning the identification of the collection of significant factors determining some, in general, nonbinary random response variable. Such identification is important, e.g., in biological and medical studies. Our approach is to examine the quality of response variable prediction by functions in (certain part of) the factors. The prediction error estimation requires some cross-validation procedure, certain prediction algorithm, and estimation of the penalty function. Using simulated data, we demonstrate the efficiency of our method. We prove a new central limit theorem for introduced regularized estimates under some natural conditions for arrays of exchangeable random variables. 相似文献

7.

Accuracy of regularized D-rule for binary classification

Won Son Johan Lim Xinlei Wang 《Journal of the Korean Statistical Society》2018,47(2):150-160

We consider a regularized D-classification rule for high dimensional binary classification, which adapts the linear shrinkage estimator of a covariance matrix as an alternative to the sample covariance matrix in the D-classification rule (D-rule in short). We find an asymptotic expression for misclassification rate of the regularized D-rule, when the sample size

n

and the dimension

p

both increase and their ratio

p ∕ n

approaches a positive constant

γ

. In addition, we compare its misclassification rate to the standard D-rule under various settings via simulation. 相似文献

8.

函数性广义线性模型曲线选择的正则化方法

下载免费PDF全文

张景肖刘燕平《统计研究》2012,29(9):95-102

本文对函数性广义线性模型曲线选择的正则化方法进行了较全面地综述,并比较了各种方法的性质。结果发现,函数性广义线性模型曲线选择问题具有群组效应,另外可能具有高维数据性质。同时通过数据模拟发现,Group Bridge、Group MCP、Elastic Net和Mnet表现出较好的数值结果。相似文献

9.

Learning rates of regularized regression for exponentially strongly mixing sequence

Yong-Li Xu Di-Rong Chen 《Journal of statistical planning and inference》2008

The study of regularized learning algorithms associated with least squared loss is one of very important issues. Wu et al. [2006. Learning rates of least-square regularized regression. Found. Comput. Math. 6, 171–192] established fast learning rates m^-θ

m^{- θ}

for the least square regularized regression in reproducing kernel Hilbert spaces under some assumptions on Mercer kernels and on regression functions, where m denoted the number of the samples and θ

θ

may be arbitrarily close to 1. They assumed as in most existing works that the set of samples were drawn independently from the underlying probability. However, independence is a very restrictive concept. Without the independence of samples, the study of learning algorithms is more involved, and little progress has been made. The aim of this paper is to establish the above results of Wu et al. for the dependent samples. The dependence of samples in this paper is expressed in terms of exponentially strongly mixing sequence. 相似文献

10.

On the regularized Laplacian eigenmaps

Ying Cao Di-Rong Chen 《Journal of statistical planning and inference》2012

To find an appropriate low-dimensional representation for complex data is one of the central problems in machine learning and data analysis. In this paper, a nonlinear dimensionality reduction algorithm called regularized Laplacian eigenmaps (RLEM) is proposed, motivated by the method for regularized spectral clustering. This algorithm provides a natural out-of-sample extension for dealing with points not in the original data set. The consistency of the RLEM algorithm is investigated. Moreover, a convergence rate is established depending on the approximation property and the capacity of the reproducing kernel Hilbert space measured by covering numbers. Experiments are given to illustrate our algorithm. 相似文献