期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Estimation in a linear regression model under the Kullback–Leibler loss and its application to model selection

Tatsuya Kubokawa Hisayuki Tsukuma 《Journal of statistical planning and inference》2007

This paper is concerned with the problem of constructing a good predictive distribution relative to the Kullback–Leibler information in a linear regression model. The problem is equivalent to the simultaneous estimation of regression coefficients and error variance in terms of a complicated risk, which yields a new challenging issue in a decision-theoretic framework. An estimator of the variance is incorporated here into a loss for estimating the regression coefficients. Several estimators of the variance and of the regression coefficients are proposed and shown to improve on usual benchmark estimators both analytically and numerically. Finally, the prediction problem of a distribution is noted to be related to an information criterion for model selection like the Akaike information criterion (AIC). Thus, several AIC variants are obtained based on proposed and improved estimators and are compared numerically with AIC as model selection procedures. 相似文献

2.

SHRINKAGE, PRETEST AND ABSOLUTE PENALTY ESTIMATORS IN PARTIALLY LINEAR MODELS

S. Ejaz Ahmed Kjell A. Doksum S. Hossain Jinhong You 《Australian & New Zealand Journal of Statistics》2007,49(4):435-454

We consider a partially linear model in which the vector of coefficients β in the linear part can be partitioned as ( β ₁, β ₂) , where β ₁ is the coefficient vector for main effects (e.g. treatment effect, genetic effects) and β ₂ is a vector for ‘nuisance’ effects (e.g. age, laboratory). In this situation, inference about β ₁ may benefit from moving the least squares estimate for the full model in the direction of the least squares estimate without the nuisance variables (Steinian shrinkage), or from dropping the nuisance variables if there is evidence that they do not provide useful information (pretesting). We investigate the asymptotic properties of Stein‐type and pretest semiparametric estimators under quadratic loss and show that, under general conditions, a Stein‐type semiparametric estimator improves on the full model conventional semiparametric least squares estimator. The relative performance of the estimators is examined using asymptotic analysis of quadratic risk functions and it is found that the Stein‐type estimator outperforms the full model estimator uniformly. By contrast, the pretest estimator dominates the least squares estimator only in a small part of the parameter space, which is consistent with the theory. We also consider an absolute penalty‐type estimator for partially linear models and give a Monte Carlo simulation comparison of shrinkage, pretest and the absolute penalty‐type estimators. The comparison shows that the shrinkage method performs better than the absolute penalty‐type estimation method when the dimension of the β ₂ parameter space is large. 相似文献

3.

Focused information criteria for copulas

Vinnie Ko Nils Lid Hjort Ingrid Hobk Haff 《Scandinavian Journal of Statistics》2019,46(4):1117-1140

In this paper, we extend the focused information criterion (FIC) to copula models. Copulas are often used for applications where the joint tail behavior of the variables is of particular interest, and selecting a copula that captures this well is then essential. Traditional model selection methods such as the Akaike information criterion (AIC) and the Bayesian information criterion (BIC) aim at finding the overall best‐fitting model, which is not necessarily the one best suited for the application at hand. The FIC, on the other hand, evaluates and ranks candidate models based on the precision of their point estimates of a context‐given focus parameter. This could be any quantity of particular interest, for example, the mean, a correlation, conditional probabilities, or measures of tail dependence. We derive FIC formulae for the maximum likelihood estimator, the two‐stage maximum likelihood estimator, and the so‐called pseudo‐maximum‐likelihood (PML) estimator combined with parametric margins. Furthermore, we confirm the validity of the AIC formula for the PML estimator combined with parametric margins. To study the numerical behavior of FIC, we have carried out a simulation study, and we have also analyzed a multivariate data set pertaining to abalones. The results from the study show that the FIC successfully ranks candidate models in terms of their performance, defined as how well they estimate the focus parameter. In terms of estimation precision, FIC clearly outperforms AIC, especially when the focus parameter relates to only a specific part of the model, such as the conditional upper‐tail probability. 相似文献

4.

Extending chipman's marl estimator to cases of ignorance in one or more parameter dimensions

Ron C. Mittelhammer Roger K. Conway 《统计学通讯:理论与方法》2013,42(8):1879-1888

In this note we examine the sense in which Chipman's (1964) minimum average risk linear (MARL) estimator can be extended to cases where a prior probability distribution on B in the linear model Y = XB + E is proper only on a set of linear combinations of having a smaller dimension than the dimension of the B parameter space. We define the estimator that can be considered MARL in the class of estimators for which the average risk matrix is defined. The MARL-type estimator then becomes operational in cases where there is ignorance about one or more dimensions of the parameter space. 相似文献

5.

Selection of Variables in Multivariate Regression Models for Large Dimensions

《统计学通讯:理论与方法》2012,41(13-14):2465-2489

The Akaike information criterion, AIC, and Mallows’ C _p statistic have been proposed for selecting a smaller number of regressors in the multivariate regression models with fully unknown covariance matrix. All of these criteria are, however, based on the implicit assumption that the sample size is substantially larger than the dimension of the covariance matrix. To obtain a stable estimator of the covariance matrix, it is required that the dimension of the covariance matrix is much smaller than the sample size. When the dimension is close to the sample size, it is necessary to use ridge-type estimators for the covariance matrix. In this article, we use a ridge-type estimators for the covariance matrix and obtain the modified AIC and modified C _p statistic under the asymptotic theory that both the sample size and the dimension go to infinity. It is numerically shown that these modified procedures perform very well in the sense of selecting the true model in large dimensional cases. 相似文献

6.

More on testing the normality assumptionin the Tobit Model

S. B. Caudill 《Journal of applied statistics》2009,36(12):1345-1352

In a recent volume of this journal, Holden [Testing the normality assumption in the Tobit Model, J. Appl. Stat. 31 (2004) pp. 521–532] presents Monte Carlo evidence comparing several tests for departures from normality in the Tobit Model. This study adds to the work of Holden by considering another test, and several information criteria, for detecting departures from normality in the Tobit Model. The test given here is a modified likelihood ratio statistic based on a partially adaptive estimator of the Censored Regression Model using the approach of Caudill [A partially adaptive estimator for the Censored Regression Model based on a mixture of normal distributions, Working Paper, Department of Economics, Auburn University, 2007]. The information criteria examined include the Akaike’s Information Criterion (AIC), the Consistent AIC (CAIC), the Bayesian information criterion (BIC), and the Akaike’s BIC (ABIC). In terms of fewest ‘rejections’ of a true null, the best performance is exhibited by the CAIC and the BIC, although, like some of the statistics examined by Holden, there are computational difficulties with each. 相似文献

7.

Adaptive nonparametric estimation in the presence of dependence

Nicolas Asin 《Journal of nonparametric statistics》2017,29(4):694-730

We consider nonparametric estimation problems in the presence of dependent data, notably nonparametric regression with random design and nonparametric density estimation. The proposed estimation procedure is based on a dimension reduction. The minimax optimal rate of convergence of the estimator is derived assuming a sufficiently weak dependence characterised by fast decreasing mixing coefficients. We illustrate these results by considering classical smoothness assumptions. However, the proposed estimator requires an optimal choice of a dimension parameter depending on certain characteristics of the function of interest, which are not known in practice. The main issue addressed in our work is an adaptive choice of this dimension parameter combining model selection and Lepski's method. It is inspired by the recent work of Goldenshluger and Lepski [(2011), ‘Bandwidth Selection in Kernel Density Estimation: Oracle Inequalities and Adaptive Minimax Optimality’, The Annals of Statistics, 39, 1608–1632]. We show that this data-driven estimator can attain the lower risk bound up to a constant provided a fast decay of the mixing coefficients. 相似文献

8.

Dimension reduced kernel estimation for distribution function with incomplete data

Hu Z Follmann DA Qin J 《Journal of statistical planning and inference》2011,141(9):3084-3093

This work focuses on the estimation of distribution functions with incomplete data, where the variable of interest Y has ignorable missingness but the covariate X is always observed. When X is high dimensional, parametric approaches to incorporate X—information is encumbered by the risk of model misspecification and nonparametric approaches by the curse of dimensionality. We propose a semiparametric approach, which is developed under a nonparametric kernel regression framework, but with a parametric working index to condense the high dimensional X—information for reduced dimension. This kernel dimension reduction estimator has double robustness to model misspecification and is most efficient if the working index adequately conveys the X—information about the distribution of Y. Numerical studies indicate better performance of the semiparametric estimator over its parametric and nonparametric counterparts. We apply the kernel dimension reduction estimation to an HIV study for the effect of antiretroviral therapy on HIV virologic suppression. 相似文献

9.

Shrinkage and pretest estimators for longitudinal data analysis under partially linear models

S. Hossain S. Ejaz Ahmed Grace Y. Yi B. Chen 《Journal of nonparametric statistics》2016,28(3):531-549

In this paper, we develop marginal analysis methods for longitudinal data under partially linear models. We employ the pretest and shrinkage estimation procedures to estimate the mean response parameters as well as the association parameters, which may be subject to certain restrictions. We provide the analytic expressions for the asymptotic biases and risks of the proposed estimators, and investigate their relative performance to the unrestricted semiparametric least-squares estimator (USLSE). We show that if the dimension of association parameters exceeds two, the risk of the shrinkage estimators is strictly less than that of the USLSE in most of the parameter space. On the other hand, the risk of the pretest estimator depends on the validity of the restrictions of association parameters. A simulation study is conducted to evaluate the performance of the proposed estimators relative to that of the USLSE. A real data example is applied to illustrate the practical usefulness of the proposed estimation procedures. 相似文献

10.

Shrinkage and penalized estimators in weighted least absolute deviations regression models

M. Norouzirad M. Arashi 《Journal of Statistical Computation and Simulation》2018,88(8):1557-1575

In this paper, we consider the estimation problem of the weighted least absolute deviation (WLAD) regression parameter vector when there are some outliers or heavy-tailed errors in the response and the leverage points in the predictors. We propose the pretest and James–Stein shrinkage WLAD estimators when some of the parameters may be subject to certain restrictions. We derive the asymptotic risk of the pretest and shrinkage WLAD estimators and show that if the shrinkage dimension exceeds two, the asymptotic risk of the shrinkage WLAD estimator is strictly less than the unrestricted WLAD estimator. On the other hand, the risk of the pretest WLAD estimator depends on the validity of the restrictions on the parameters. Furthermore, we study the WLAD absolute shrinkage and selection operator (WLAD-LASSO) and compare its relative performance with the pretest and shrinkage WLAD estimators. A simulation study is conducted to evaluate the performance of the proposed estimators relative to that of the unrestricted WLAD estimator. A real-life data example using body fat study is used to illustrate the performance of the suggested estimators. 相似文献

11.

Adaptive nonparametric empirical Bayes estimation via wavelet series: The minimax study

Rida Benhaddou Marianna Pensky 《Journal of statistical planning and inference》2013

In the present paper, we derive lower bounds for the risk of the nonparametric empirical Bayes estimators. In order to attain the optimal convergence rate, we propose generalization of the linear empirical Bayes estimation method which takes advantage of the flexibility of the wavelet techniques. We present an empirical Bayes estimator as a wavelet series expansion and estimate coefficients by minimizing the prior risk of the estimator. As a result, estimation of wavelet coefficients requires solution of a well-posed low-dimensional sparse system of linear equations. The dimension of the system depends on the size of wavelet support and smoothness of the Bayes estimator. An adaptive choice of the resolution level is carried out using Lepski et al. (1997) method. The method is computationally efficient and provides asymptotically optimal adaptive EB estimators. The theory is supplemented by numerous examples. 相似文献

12.

The exact risk performance of a pre-test estimator in a heteroskedastic linear regression model under the balanced loss function

Kazuhiro Ohtani David E. A. Giles Judith A. Giles 《Econometric Reviews》1997,16(1):119-130

We examine the risk of a pre-test estimator for regression coefficients after a pre-test for homoskedasticity under the Balanced Loss Function (BLF). We show analytically that the two stage Aitken estimator is dominated by the pre-test estimator with the critical value of unity, even if the BLF is used. We also show numerically that both the two stage Aitken estimator and the pre-test estimator can be dominated by the ordinary least squares estimator when “goodness of fit” is regarded as more important than precision of estimation. 相似文献

13.

Limit expressions for the risk of james-stein estimators

George Casella Jiunn Tzon Hwang 《Revue canadienne de statistique》1982,10(4):305-309

Limit expressions (as the dimension p ← ∞ ) are derived for the relative risk of the James-Stein estimator and its positive-part version. The limit is simple to evaluate, and gives the amount of improvement in risk that is possible. The technique used is to bound the risk, both above and below. with bounds that converge to the same limit. For the James-Stein estimator these bounds are simple to calculate, and are quite accurate even for moderate dimensions. 相似文献

14.

Shrinkage estimation of location parameters in a multivariate skew-normal distribution

Tatsuya Kubokawa William E. Strawderman Ryota Yuasa 《统计学通讯:理论与方法》2020,49(8):2008-2024

Abstract

This paper studies decision theoretic properties of Stein type shrinkage estimators in simultaneous estimation of location parameters in a multivariate skew-normal distribution with known skewness parameters under a quadratic loss. The benchmark estimator is the best location equivariant estimator which is minimax. A class of shrinkage estimators improving on the best location equivariant estimator is constructed when the dimension of the location parameters is larger than or equal to four. An empirical Bayes estimator is also derived, and motivated from the Bayesian procedure, we suggest a simple skew-adjusted shrinkage estimator and show its dominance property. The performances of these estimators are investigated by simulation. 相似文献

15.

Improved estimation of the mean vector for student-t model

Shahjahan Khan 《统计学通讯:理论与方法》2013,42(3):507-527

Improved James-Stein type estimation of the mean vector μ of a multovaroate Student-t population of dimension p with ν degrees of freedom is considered. In addition to the sample data, uncertain prior information on the value of the mean vector, in the form of a null hypothesis, is used for the estiamtion. The usual maximum liklihood estimator((mle) of μ is obtained and a test statistic for testing H₀:μ=μ₀ is derived. Based on the mle of μ and the tes statistic the preliminary test estimator (PTE), Stein-type shrinkage estimator (SE) and positive-rule shrinkage esiimator (PRSE) are defined. The bias and the quadratic risk of the estimators are evaiuated. The relative performances of the estimators are mvestigated by analyzing the risks under different condltlons It is observed that the FRSE dommates over he other three estimators, regardless of the vaiidity of the null hypothesis and the value ν. 相似文献

16.

Article: 2

Yingcun Xia Howell Tong W. K. Li Li-Xing Zhu 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2002,64(3):363-410

Summary. Searching for an effective dimension reduction space is an important problem in regression, especially for high dimensional data. We propose an adaptive approach based on semiparametric models, which we call the (conditional) minimum average variance estimation (MAVE) method, within quite a general setting. The MAVE method has the following advantages. Most existing methods must undersmooth the nonparametric link function estimator to achieve a faster rate of consistency for the estimator of the parameters (than for that of the nonparametric function). In contrast, a faster consistency rate can be achieved by the MAVE method even without undersmoothing the nonparametric link function estimator. The MAVE method is applicable to a wide range of models, with fewer restrictions on the distribution of the covariates, to the extent that even time series can be included. Because of the faster rate of consistency for the parameter estimators, it is possible for us to estimate the dimension of the space consistently. The relationship of the MAVE method with other methods is also investigated. In particular, a simple outer product gradient estimator is proposed as an initial estimator. In addition to theoretical results, we demonstrate the efficacy of the MAVE method for high dimensional data sets through simulation. Two real data sets are analysed by using the MAVE approach. 相似文献

17.

Alternative estimators of the common regression matrix in two GMANOVA models under weighted quadratic losses

《Journal of statistical planning and inference》2006,136(4):1331-1348

We consider the problem of estimating the common regression matrix of two GMANOVA models with different unknown covariance matrices under certain type of loss functions which include a weighted quadratic loss function as a special case. We consider a class of estimators, which contains the Graybill–Deal-type estimator proposed by Sugiura and Kubokawa (Ann. Inst. Statist. Math. 40 (1988) 119), and we give its risk representation via Kubokawa and Srivastava's (Ann. Statist. 27 (1999) 600; J. Multivariate Anal. 76 (2001) 138) identities when the error matrices follow the elliptically contoured distributions. Using the method similar to an approximate minimization of the unbiased risk estimate due to Stein (Studies in the Statistical Theory of Estimation, vol. 74, Nauka, Leningrad, 1977, p. 4), we obtain an alternative estimator to the Graybill–Deal-type estimator which was given under the normality assumption. However, it seems difficult to evaluate the risk of our proposed estimator analytically because of complex nature of its risk function. Instead, we conduct a Monte-Carlo simulation to evaluate the performance of our proposed estimator. The results indicate that our proposed estimator compares favorably with the Graybill–Deal-type estimator. 相似文献

18.

A formal bayes multiple shrinkage estimator

Edward I. George 《统计学通讯:理论与方法》2013,42(7):2099-2114

A new minimax multiple shrinkage estimator is constructed. This estimator which can adaptively shrink towards many subspace targets, is formal Bayes with respect to a mixture of harmonic priors. Unbiased estimates of risk and simulation results suggest that the risk properties of this estimator are very similar to those of the multiple shrinkage Stein estimator proposed by George (1986a). A special case is seen to be admissible. 相似文献

19.

Dynamic factor analysis for short panels: estimating performance trajectories for water utilities

Nikolaos Zirogiannis Yorghos Tripodis 《Statistical Methods and Applications》2018,27(1):131-150

We develop a novel estimation algorithm for a dynamic factor model (DFM) applied to panel data with a short time dimension and a large cross sectional dimension. Current DFMs usually require panels with a minimum of 20 years of quarterly data (80 time observations per panel). In contrast, the application we consider includes panels with a median of 8 annual observations. As a result, the time dimension in our paper is substantially shorter than previous work in the DFM literature. This difference increases the computational challenges of the estimation process which we address by developing the “Two-Cycle Conditional Expectation - Maximization” (2CCEM) algorithm which is a variant of the EM algorithm and its extensions. We analyze the conditions under which our model is identified and provide simulation results demonstrating consistency of our 2CCEM estimator. We apply the DFM to a dataset of 802 water and sanitation utilities from 43 countries and use the 2CCEM algorithm in order to estimate dynamic performance trajectories for each utility. 相似文献

20.

Adaptive-Cox model averaging for right-censored data

Yu-Mei Chang Pao-Sheng Shen 《统计学通讯:理论与方法》2017,46(19):9364-9376

In medical studies, Cox proportional hazards model is a commonly used method to deal with the right-censored survival data accompanied by many explanatory covariates. In practice, the Akaike's information criterion (AIC) or the Bayesian information criterion (BIC) is usually used to select an appropriate subset of covariates. It is well known that neither the AIC criterion nor the BIC criterion dominates for all situations. In this paper, we propose an adaptive-Cox model averaging procedure to get a more robust hazard estimator. First, by applying AIC and BIC criteria to perturbed datasets, we obtain two model averaging (MA) estimated survival curves, called AIC-MA and BIC-MA. Then, based on Kullback–Leibler loss, a better estimate of survival curve between AIC-MA and BIC-MA is chosen, which results in an adaptive-Cox estimate of survival curve. Simulation results show the superiority of our approach and an application of the proposed method is also presented by analyzing the German Breast Cancer Study dataset. 相似文献