期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian generalized fused lasso modeling via NEG distribution

Kaito Shimamura Masao Ueki Sadanori Konishi 《统计学通讯:理论与方法》2019,48(16):4132-4153

The fused lasso penalizes a loss function by the L₁ norm for both the regression coefficients and their successive differences to encourage sparsity of both. In this paper, we propose a Bayesian generalized fused lasso modeling based on a normal-exponential-gamma (NEG) prior distribution. The NEG prior is assumed into the difference of successive regression coefficients. The proposed method enables us to construct a more versatile sparse model than the ordinary fused lasso using a flexible regularization term. Simulation studies and real data analyses show that the proposed method has superior performance to the ordinary fused lasso. 相似文献

2.

Rejoinder to 'Ahmed, M.S. (1998). A note on regression-type estimators using multiple auxiliary information.'

Rahul Mukerjee T.J. Rao & K. Vijayan 《Australian & New Zealand Journal of Statistics》2000,42(2):245-245

In the estimators t ₃ , t ₄ , t ₅ of Mukerjee, Rao & Vijayan (1987), b _{y x} and b _{y z} are partial regression coefficients of y on x and z , respectively, based on the smaller sample. With the above interpretation of b _{y x} and b _{y z} in t ₃ , t ₄ , t ₅ , all the calculations in Mukerjee at al. (1987) are correct. In this connection, we also wish to make it explicit that b _{x z} in t ₅ is an ordinary and not a partial regression coefficient. The 'corrected' MSEs of t ₃ , t ₄ , t ₅ , as given in Ahmed (1998 Section 3) are computed assuming that our b _{y x} and b _{y z} are ordinary and not partial regression coefficients. Indeed, we had no intention of giving estimators using the corresponding ordinary regression coefficients which would lead to estimators inferior to those given by Kiregyera (1984). We accept responsibility for any notational confusion created by us and express regret to readers who have been confused by our notation. Finally, in consideration of the above, it may be noted that Tripathi & Ahmed's (1995) estimator t ₀ , quoted also in Ahmed (1998), is no better than t ₅ of Mukerjee at al. (1987). 相似文献

3.

IMPROVING UPON THE BEST INVARIANT ESTIMATOR IN MULTIVARIATE LOCATION PROBLEMS

Madan L. Puri Dan A. Ralescu 《Australian & New Zealand Journal of Statistics》1983,25(3):453-462

We are concerned with estimators which improve upon the best invariant estimator, in estimating a location parameter θ. If the loss function is L(θ - a) with L convex, we give sufficient conditions for the inadmissibility of δ₀(X) = X. If the loss is a weighted sum of squared errors, we find various classes of estimators δ which are better than δ₀. In general, δ is the convolution of δ₁ (an estimator which improves upon δ₀ outside of a compact set) with a suitable probability density in R^p. The critical dimension of inadmissibility depends on the estimator δ₁ We also give several examples of estimators δ obtained in this way and state some open problems. 相似文献

4.

Tilting methods for assessing the influence of components in a classifier

Peter Hall D. M. Titterington Jing-Hao Xue 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(4):783-803

Summary. Many contemporary classifiers are constructed to provide good performance for very high dimensional data. However, an issue that is at least as important as good classification is determining which of the many potential variables provide key information for good decisions. Responding to this issue can help us to determine which aspects of the datagenerating mechanism (e.g. which genes in a genomic study) are of greatest importance in terms of distinguishing between populations. We introduce tilting methods for addressing this problem. We apply weights to the components of data vectors, rather than to the data vectors themselves (as is commonly the case in related work). In addition we tilt in a way that is governed by L ₂-distance between weight vectors, rather than by the more commonly used Kullback–Leibler distance. It is shown that this approach, together with the added constraint that the weights should be non-negative, produces an algorithm which eliminates vector components that have little influence on the classification decision. In particular, use of the L ₂-distance in this problem produces properties that are reminiscent of those that arise when L ₁-penalties are employed to eliminate explanatory variables in very high dimensional prediction problems, e.g. those involving the lasso. We introduce techniques that can be implemented very rapidly, and we show how to use bootstrap methods to assess the accuracy of our variable ranking and variable elimination procedures. 相似文献

5.

Likelihood Ratio Tests Under Local and Fixed Alternatives in Monotone Function Problems

MOULINATH BANERJEE 《Scandinavian Journal of Statistics》2005,32(4):507-525

Abstract. We focus on a class of non-standard problems involving non-parametric estimation of a monotone function that is characterized by n ^1/3 rate of convergence of the maximum likelihood estimator, non-Gaussian limit distributions and the non-existence of -regular estimators. We have shown elsewhere that under a null hypothesis of the type ψ ( z ₀) = θ ₀ ( ψ being the monotone function of interest) in non-standard problems of the above kind, the likelihood ratio statistic has a 'universal' limit distribution that is free of the underlying parameters in the model. In this paper, we illustrate its limiting behaviour under local alternatives of the form ψ _n( z ), where ψ _n(·) and ψ (·) vary in O ( n ^−1/3) neighbourhoods around z ₀ and ψ _n converges to ψ at rate n ^1/3 in an appropriate metric. Apart from local alternatives, we also consider the behaviour of the likelihood ratio statistic under fixed alternatives and establish the convergence in probability of an appropriately scaled version of the same to a constant involving a Kullback–Leibler distance. 相似文献

6.

Ordering of Sequentially Sampled Exponential Experiments

Eitan Greenshtein & Erik Torgersen 《Scandinavian Journal of Statistics》1998,25(2):325-329

Let X ₁, X ₂, ... be a sequence of i.i.d. random variables, X _i∼ F _θ, θ∈Θ. Let N ₁ and N ₂ be two stopping rules. For a class of exponential families { F _θ: θ∈Θ} we show that the experiment Y ₁ = ( X ₁, ..., X _N1) carries more statistical information than Y ₂ = ( X ₁, ..., x _N2) only if N ₁ is stochastically larger then N ₂ 相似文献

7.

Semiparametric Likelihood Based Method for Goodness of Fit Tests and Estimation in Upgraded Mixture Models

Jing Qin 《Scandinavian Journal of Statistics》1998,25(4):681-691

We use Owen's (1988, 1990) empirical likelihood method in upgraded mixture models. Two groups of independent observations are available. One is z ₁, ..., z _n which is observed directly from a distribution F ( z ). The other one is x ₁, ..., x _m which is observed indirectly from F ( z ), where the x _is have density ∫ p ( x | z ) dF ( z ) and p ( x | z ) is a conditional density function. We are interested in testing H ₀: p ( x | z ) = p ( x | z ; θ ), for some specified smooth density function. A semiparametric likelihood ratio based statistic is proposed and it is shown that it converges to a chi-squared distribution. This is a simple method for doing goodness of fit tests, especially when x is a discrete variable with finitely many values. In addition, we discuss estimation of θ and F ( z ) when H ₀ is true. The connection between upgraded mixture models and general estimating equations is pointed out. 相似文献

8.

Empirical Likelihood for Non-Smooth Criterion Functions

ELISA M. MOLANES LOPEZ INGRID VAN KEILEGOM NOËL VERAVERBEKE 《Scandinavian Journal of Statistics》2009,36(3):413-432

Abstract. Suppose that X ₁,…, X _n is a sequence of independent random vectors, identically distributed as a d -dimensional random vector X . Let be a parameter of interest and be some nuisance parameter. The unknown, true parameters ( μ ₀, ν ₀) are uniquely determined by the system of equations E { g ( X , μ ₀, ν ₀)} = 0 , where g = ( g ₁,…, g _{p + q}) is a vector of p + q functions. In this paper we develop an empirical likelihood (EL) method to do inference for the parameter μ ₀. The results in this paper are valid under very mild conditions on the vector of criterion functions g . In particular, we do not require that g ₁,…, g _{p + q} are smooth in μ or ν . This offers the advantage that the criterion function may involve indicators, which are encountered when considering, e.g. differences of quantiles, copulas, ROC curves, to mention just a few examples. We prove the asymptotic limit of the empirical log-likelihood ratio, and carry out a small simulation study to test the performance of the proposed EL method for small samples. 相似文献

9.

Exact Slopes of Test Statistics for the Multivariate Exponential Family

Gie-Whan Kim 《Scandinavian Journal of Statistics》1997,24(3):387-406

The objective of this paper is to investigate exact slopes of test statistics { T_n } when the random vectors X ₁, ..., X_n are distributed according to an unknown member of an exponential family { P _θ; θ∈Ω. Here Ω is a parameter set. We will be concerned with the hypothesis testing problem of H ₀θ∈Ω₀ vs H ₁: θ∉Ω₀ where Ω₀ is a subset of Ω. It will be shown that for an important class of problems and test statistics the exact slope of { T_n } at η in Ω−Ω₀ is determined by the shortest Kullback–Leibler distance from {θ: T_n (λ(θ)) = T_n (λ(π))} to Ω₀, λ_θ = E _θ)( X ). 相似文献

10.

Nonparametric multistep-ahead prediction in time series analysis

Rong Chen Lijian Yang Christian Hafner 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(3):669-686

Summary. We consider the problem of multistep-ahead prediction in time series analysis by using nonparametric smoothing techniques. Forecasting is always one of the main objectives in time series analysis. Research has shown that non-linear time series models have certain advantages in multistep-ahead forecasting. Traditionally, nonparametric k -step-ahead least squares prediction for non-linear autoregressive AR( d ) models is done by estimating E ( X _{t + k} | X _t, …, X _{t − d +1}) via nonparametric smoothing of X _{t + k} on ( X _t, …, X _{t − d +1}) directly. We propose a multistage nonparametric predictor. We show that the new predictor has smaller asymptotic mean-squared error than the direct smoother, though the convergence rate is the same. Hence, the predictor proposed is more efficient. Some simulation results, advice for practical bandwidth selection and a real data example are provided. 相似文献

11.

Detecting changes in the mean of functional observations

István Berkes Robertas Gabrys Lajos Horváth Piotr Kokoszka 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(5):927-946

Summary. Principal component analysis has become a fundamental tool of functional data analysis. It represents the functional data as X _i( t )= μ ( t )+Σ_{1≤ l <∞} η _{i , l}+ v _l( t ), where μ is the common mean, v _l are the eigenfunctions of the covariance operator and the η _{i , l} are the scores. Inferential procedures assume that the mean function μ ( t ) is the same for all values of i . If, in fact, the observations do not come from one population, but rather their mean changes at some point(s), the results of principal component analysis are confounded by the change(s). It is therefore important to develop a methodology to test the assumption of a common functional mean. We develop such a test using quantities which can be readily computed in the R package fda. The null distribution of the test statistic is asymptotically pivotal with a well-known asymptotic distribution. The asymptotic test has excellent finite sample performance. Its application is illustrated on temperature data from England. 相似文献

12.

On the robustness of the generalized fused lasso to prior specifications

Vivian?Viallon Email author Sophie?Lambert-Lacroix H?lger?Hoefling Franck?Picard 《Statistics and Computing》2016,26(1-2):285-301

Using networks as prior knowledge to guide model selection is a way to reach structured sparsity. In particular, the fused lasso that was originally designed to penalize differences of coefficients corresponding to successive features has been generalized to handle features whose effects are structured according to a given network. As any prior information, the network provided in the penalty may contain misleading edges that connect coefficients whose difference is not zero, and the extent to which the performance of the method depend on the suitability of the graph has never been clearly assessed. In this work we investigate the theoretical and empirical properties of the adaptive generalized fused lasso in the context of generalized linear models. In the fixed \(p\) setting, we show that, asymptotically, adding misleading edges in the graph does not prevent the adaptive generalized fused lasso from enjoying asymptotic oracle properties, while forgetting suitable edges can be more problematic. These theoretical results are complemented by an extensive simulation study that assesses the robustness of the adaptive generalized fused lasso against misspecification of the network as well as its applicability when theoretical coefficients are not exactly equal. Our contribution is also to evaluate the applicability of the generalized fused lasso for the joint modeling of multiple sparse regression functions. Illustrations are provided on two real data examples. 相似文献

13.

Estimating smooth monotone functions 总被引：1，自引：0，他引：1

J. O. Ramsay 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1998,60(2):365-375

Many situations call for a smooth strictly monotone function f of arbitrary flexibility. The family of functions defined by the differential equation D ² f = w Df , where w is an unconstrained coefficient function comprises the strictly monotone twice differentiable functions. The solution to this equation is f = C ₀ + C ₁ D ⁻¹{exp( D ⁻¹ w )}, where C ₀ and C ₁ are arbitrary constants and D ⁻¹ is the partial integration operator. A basis for expanding w is suggested that permits explicit integration in the expression of f . In fitting data, it is also useful to regularize f by penalizing the integral of w ² since this is a measure of the relative curvature in f . Applications are discussed to monotone nonparametric regression, to the transformation of the dependent variable in non-linear regression and to density estimation. 相似文献

14.

Computation of the Generalized F Distribution

Charles F. Dunkl & Donald E. Ramirez 《Australian & New Zealand Journal of Statistics》2001,43(1):21-31

Exact expressions for the cumulative distribution function of a random variable of the form ( α ₁ X ₁+ α ₂ X ₂)/ Y are given where X ₁, X ₂ and Y are independent chi-squared random variables. The expressions are applied to the detection of joint outliers and Hotelling's mis-specified T ² distribution. 相似文献

15.

Penalized Projection Estimator for Volatility Density

F. COMTE V. GENON-CATALOT 《Scandinavian Journal of Statistics》2006,33(4):875-893

Abstract. In this paper, we consider a stochastic volatility model ( Y _t, V _t), where the volatility (V_t) is a positive stationary Markov process. We assume that ( ln V _t) admits a stationary density f that we want to estimate. Only the price process Y _t is observed at n discrete times with regular sampling interval Δ . We propose a non-parametric estimator for f obtained by a penalized projection method. Under mixing assumptions on ( V _t), we derive bounds for the quadratic risk of the estimator. Assuming that Δ=Δ_n tends to 0 while the number of observations and the length of the observation time tend to infinity, we discuss the rate of convergence of the risk. Examples of models included in this framework are given. 相似文献

16.

Estimation and testing stationarity for double-autoregressive models

Shiqing Ling 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(1):63-78

Summary. The paper considers the double-autoregressive model y _t = φ y _{t −1}+ ɛ _t with ɛ _t = . Consistency and asymptotic normality of the estimated parameters are proved under the condition E ln | φ +√ α η _t|<0, which includes the cases with | φ |=1 or | φ |>1 as well as . It is well known that all kinds of estimators of φ in these cases are not normal when ɛ _t are independent and identically distributed. Our result is novel and surprising. Two tests are proposed for testing stationarity of the model and their asymptotic distributions are shown to be a function of bivariate Brownian motions. Critical values of the tests are tabulated and some simulation results are reported. An application to the US 90-day treasury bill rate series is given. 相似文献

17.

Goodness-of-fit Tests for Semi-Markov and Markov Survival Models with One Intermediate State 总被引：2，自引：0，他引：2

I-Shou Chang Yuan-Chuan Chuang & Chao A. Hsiung 《Scandinavian Journal of Statistics》2001,28(3):505-525

Survival data with one intermediate state are described by semi-Markov and Markov models for counting processes whose intensities are defined in terms of two stopping times T ₁< T ₂. Problems of goodness-of-fit for these models are studied. The test statistics are proposed by comparing Nelson–Aalen estimators for data stratified according to T ₁. Asymptotic distributions of these statistics are established in terms of the weak convergence of some random fields. Asymptotic consistency of these test statistics is also established. Simulation studies are included to indicate their numerical performance. 相似文献

18.

Prior elicitation, variable selection and Bayesian computation for logistic regression models

M.-H. Chen J. G. Ibrahim & C. Yiannoutsos 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1999,61(1):223-242

Bayesian selection of variables is often difficult to carry out because of the challenge in specifying prior distributions for the regression parameters for all possible models, specifying a prior distribution on the model space and computations. We address these three issues for the logistic regression model. For the first, we propose an informative prior distribution for variable selection. Several theoretical and computational properties of the prior are derived and illustrated with several examples. For the second, we propose a method for specifying an informative prior on the model space, and for the third we propose novel methods for computing the marginal distribution of the data. The new computational algorithms only require Gibbs samples from the full model to facilitate the computation of the prior and posterior model probabilities for all possible models. Several properties of the algorithms are also derived. The prior specification for the first challenge focuses on the observables in that the elicitation is based on a prior prediction y ₀ for the response vector and a quantity a ₀ quantifying the uncertainty in y ₀. Then, y ₀ and a ₀ are used to specify a prior for the regression coefficients semi-automatically. Examples using real data are given to demonstrate the methodology. 相似文献

19.

Estimation of Diffusion Processes by Simulated Moment Methods 总被引：1，自引：0，他引：1

Emmanuelle Clement 《Scandinavian Journal of Statistics》1997,24(3):353-369

We consider the parameter estimation of a diffusion process and we suppose that the trend and the diffusion coefficient depend on the parameter θ. The process is observed at time ( t_i )_{i =0,..., n} with Δ = t_i ₊₁− t_i fixed and we propose here to estimate θ from simulated moment methods. 相似文献

20.

L _p –Estimators as Estimates of a Parameter of Location for a Sharp–pointed Symmetric Density

Miguel A. Arcones 《Scandinavian Journal of Statistics》1998,25(4):693-715

We study the asymptotics of L _p estimators, p > 0, over a sample having a symmetric density with a sharp–point at the centre of symmetry of the distribution. The rates of convergence of the L _p estimators in this situation depend on p and on the shape of the density. To obtain some of the limit distributions, we present new results in the asymptotics of M–estimators. We extend the delta method to the case when the Euclidean norm of the conveniently normalized M–estimators converge to a power of the Euclidean norm of a (possibly Gaussian) stable distribution. 相似文献