期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Goodness-of-fit Tests Based on the Kernel Density Estimator

RICARDO CAO GÁBOR LUGOSI 《Scandinavian Journal of Statistics》2005,32(4):599-616

Abstract. Given an i.i.d. sample drawn from a density f on the real line, the problem of testing whether f is in a given class of densities is considered. Testing procedures constructed on the basis of minimizing the L ₁-distance between a kernel density estimate and any density in the hypothesized class are investigated. General non-asymptotic bounds are derived for the power of the test. It is shown that the concentration of the data-dependent smoothing factor and the 'size' of the hypothesized class of densities play a key role in the performance of the test. Consistency and non-asymptotic performance bounds are established in several special cases, including testing simple hypotheses, translation/scale classes and symmetry. Simulations are also carried out to compare the behaviour of the method with the Kolmogorov-Smirnov test and an L ₂ density-based approach due to Fan [ Econ. Theory 10 (1994) 316]. 相似文献

2.

Empirical Likelihood for Non-Smooth Criterion Functions

ELISA M. MOLANES LOPEZ INGRID VAN KEILEGOM NOËL VERAVERBEKE 《Scandinavian Journal of Statistics》2009,36(3):413-432

Abstract. Suppose that X ₁,…, X _n is a sequence of independent random vectors, identically distributed as a d -dimensional random vector X . Let be a parameter of interest and be some nuisance parameter. The unknown, true parameters ( μ ₀, ν ₀) are uniquely determined by the system of equations E { g ( X , μ ₀, ν ₀)} = 0 , where g = ( g ₁,…, g _{p + q}) is a vector of p + q functions. In this paper we develop an empirical likelihood (EL) method to do inference for the parameter μ ₀. The results in this paper are valid under very mild conditions on the vector of criterion functions g . In particular, we do not require that g ₁,…, g _{p + q} are smooth in μ or ν . This offers the advantage that the criterion function may involve indicators, which are encountered when considering, e.g. differences of quantiles, copulas, ROC curves, to mention just a few examples. We prove the asymptotic limit of the empirical log-likelihood ratio, and carry out a small simulation study to test the performance of the proposed EL method for small samples. 相似文献

3.

Exact Slopes of Test Statistics for the Multivariate Exponential Family

Gie-Whan Kim 《Scandinavian Journal of Statistics》1997,24(3):387-406

The objective of this paper is to investigate exact slopes of test statistics { T_n } when the random vectors X ₁, ..., X_n are distributed according to an unknown member of an exponential family { P _θ; θ∈Ω. Here Ω is a parameter set. We will be concerned with the hypothesis testing problem of H ₀θ∈Ω₀ vs H ₁: θ∉Ω₀ where Ω₀ is a subset of Ω. It will be shown that for an important class of problems and test statistics the exact slope of { T_n } at η in Ω−Ω₀ is determined by the shortest Kullback–Leibler distance from {θ: T_n (λ(θ)) = T_n (λ(π))} to Ω₀, λ_θ = E _θ)( X ). 相似文献

4.

Detecting changes in the mean of functional observations

István Berkes Robertas Gabrys Lajos Horváth Piotr Kokoszka 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(5):927-946

Summary. Principal component analysis has become a fundamental tool of functional data analysis. It represents the functional data as X _i( t )= μ ( t )+Σ_{1≤ l <∞} η _{i , l}+ v _l( t ), where μ is the common mean, v _l are the eigenfunctions of the covariance operator and the η _{i , l} are the scores. Inferential procedures assume that the mean function μ ( t ) is the same for all values of i . If, in fact, the observations do not come from one population, but rather their mean changes at some point(s), the results of principal component analysis are confounded by the change(s). It is therefore important to develop a methodology to test the assumption of a common functional mean. We develop such a test using quantities which can be readily computed in the R package fda. The null distribution of the test statistic is asymptotically pivotal with a well-known asymptotic distribution. The asymptotic test has excellent finite sample performance. Its application is illustrated on temperature data from England. 相似文献

5.

Computation of the Generalized F Distribution

Charles F. Dunkl & Donald E. Ramirez 《Australian & New Zealand Journal of Statistics》2001,43(1):21-31

Exact expressions for the cumulative distribution function of a random variable of the form ( α ₁ X ₁+ α ₂ X ₂)/ Y are given where X ₁, X ₂ and Y are independent chi-squared random variables. The expressions are applied to the detection of joint outliers and Hotelling's mis-specified T ² distribution. 相似文献

6.

Non-parametric Regression with Dependent Censored Data 总被引：1，自引：0，他引：1

ANOUAR EL GHOUCH INGRID VAN KEILEGOM 《Scandinavian Journal of Statistics》2008,35(2):228-247

Abstract. Let ( X _i, Y _i) ( i = 1 ,…, n ) be n replications of a random vector ( X , Y ), where Y is supposed to be subject to random right censoring. The data ( X _i, Y _i) are assumed to come from a stationary α -mixing process. We consider the problem of estimating the function m ( x ) = E ( φ ( Y ) | X = x ), for some known transformation φ . This problem is approached in the following way: first, we introduce a transformed variable , that is not subject to censoring and satisfies the relation , and then we estimate m ( x ) by applying local linear regression techniques. As a by-product, we obtain a general result on the uniform rate of convergence of kernel type estimators of functionals of an unknown distribution function, under strong mixing assumptions. 相似文献

7.

IMPROVING UPON THE BEST INVARIANT ESTIMATOR IN MULTIVARIATE LOCATION PROBLEMS

Madan L. Puri Dan A. Ralescu 《Australian & New Zealand Journal of Statistics》1983,25(3):453-462

We are concerned with estimators which improve upon the best invariant estimator, in estimating a location parameter θ. If the loss function is L(θ - a) with L convex, we give sufficient conditions for the inadmissibility of δ₀(X) = X. If the loss is a weighted sum of squared errors, we find various classes of estimators δ which are better than δ₀. In general, δ is the convolution of δ₁ (an estimator which improves upon δ₀ outside of a compact set) with a suitable probability density in R^p. The critical dimension of inadmissibility depends on the estimator δ₁ We also give several examples of estimators δ obtained in this way and state some open problems. 相似文献

8.

Frequency Domain Tests of Semiparametric Hypotheses for Locally Stationary Processes

MARIOS SERGIDES EFSTATHIOS PAPARODITIS 《Scandinavian Journal of Statistics》2009,36(4):800-821

Abstract. Many time series in applied sciences obey a time-varying spectral structure. In this article, we focus on locally stationary processes and develop tests of the hypothesis that the time-varying spectral density has a semiparametric structure, including the interesting case of a time-varying autoregressive moving-average (tvARMA) model. The test introduced is based on a L ₂-distance measure of a kernel smoothed version of the local periodogram rescaled by the time-varying spectral density of the estimated semiparametric model. The asymptotic distribution of the test statistic under the null hypothesis is derived. As an interesting special case, we focus on the problem of testing for the presence of a tvAR model. A semiparametric bootstrap procedure to approximate more accurately the distribution of the test statistic under the null hypothesis is proposed. Some simulations illustrate the behaviour of our testing methodology in finite sample situations. 相似文献

9.

Semiparametric Likelihood Based Method for Goodness of Fit Tests and Estimation in Upgraded Mixture Models

Jing Qin 《Scandinavian Journal of Statistics》1998,25(4):681-691

We use Owen's (1988, 1990) empirical likelihood method in upgraded mixture models. Two groups of independent observations are available. One is z ₁, ..., z _n which is observed directly from a distribution F ( z ). The other one is x ₁, ..., x _m which is observed indirectly from F ( z ), where the x _is have density ∫ p ( x | z ) dF ( z ) and p ( x | z ) is a conditional density function. We are interested in testing H ₀: p ( x | z ) = p ( x | z ; θ ), for some specified smooth density function. A semiparametric likelihood ratio based statistic is proposed and it is shown that it converges to a chi-squared distribution. This is a simple method for doing goodness of fit tests, especially when x is a discrete variable with finitely many values. In addition, we discuss estimation of θ and F ( z ) when H ₀ is true. The connection between upgraded mixture models and general estimating equations is pointed out. 相似文献

10.

Bootstrapping frequency domain tests in multivariate time series with an application to comparing spectral densities

Holger Dette Efstathios Paparoditis 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2009,71(4):831-857

Summary. We propose a general bootstrap procedure to approximate the null distribution of non-parametric frequency domain tests about the spectral density matrix of a multivariate time series. Under a set of easy-to-verify conditions, we establish asymptotic validity of the bootstrap procedure proposed. We apply a version of this procedure together with a new statistic to test the hypothesis that the spectral densities of not necessarily independent time series are equal. The test statistic proposed is based on an L ₂-distance between the non-parametrically estimated individual spectral densities and an overall, 'pooled' spectral density, the latter being obtained by using the whole set of m time series considered. The effects of the dependence between the time series on the power behaviour of the test are investigated. Some simulations are presented and a real life data example is discussed. 相似文献

11.

Rejoinder to 'Ahmed, M.S. (1998). A note on regression-type estimators using multiple auxiliary information.'

Rahul Mukerjee T.J. Rao & K. Vijayan 《Australian & New Zealand Journal of Statistics》2000,42(2):245-245

In the estimators t ₃ , t ₄ , t ₅ of Mukerjee, Rao & Vijayan (1987), b _{y x} and b _{y z} are partial regression coefficients of y on x and z , respectively, based on the smaller sample. With the above interpretation of b _{y x} and b _{y z} in t ₃ , t ₄ , t ₅ , all the calculations in Mukerjee at al. (1987) are correct. In this connection, we also wish to make it explicit that b _{x z} in t ₅ is an ordinary and not a partial regression coefficient. The 'corrected' MSEs of t ₃ , t ₄ , t ₅ , as given in Ahmed (1998 Section 3) are computed assuming that our b _{y x} and b _{y z} are ordinary and not partial regression coefficients. Indeed, we had no intention of giving estimators using the corresponding ordinary regression coefficients which would lead to estimators inferior to those given by Kiregyera (1984). We accept responsibility for any notational confusion created by us and express regret to readers who have been confused by our notation. Finally, in consideration of the above, it may be noted that Tripathi & Ahmed's (1995) estimator t ₀ , quoted also in Ahmed (1998), is no better than t ₅ of Mukerjee at al. (1987). 相似文献

12.

Modelling and smoothing parameter estimation with multiple quadratic penalties 总被引：1，自引：0，他引：1

S. N. Wood 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2000,62(2):413-428

Penalized likelihood methods provide a range of practical modelling tools, including spline smoothing, generalized additive models and variants of ridge regression. Selecting the correct weights for penalties is a critical part of using these methods and in the single-penalty case the analyst has several well-founded techniques to choose from. However, many modelling problems suggest a formulation employing multiple penalties, and here general methodology is lacking. A wide family of models with multiple penalties can be fitted to data by iterative solution of the generalized ridge regression problem minimize || W ^1/2 ( Xp − y ) ||²ρ+Σ_{i =1}^m θ_i p ' S _i p ( p is a parameter vector, X a design matrix, S _i a non-negative definite coefficient matrix defining the i th penalty with associated smoothing parameter θ_i, W a diagonal weight matrix, y a vector of data or pseudodata and ρ an 'overall' smoothing parameter included for computational efficiency). This paper shows how smoothing parameter selection can be performed efficiently by applying generalized cross-validation to this problem and how this allows non-linear, generalized linear and linear models to be fitted using multiple penalties, substantially increasing the scope of penalized modelling methods. Examples of non-linear modelling, generalized additive modelling and anisotropic smoothing are given. 相似文献

13.

Stochastic Monotonicity and Conditioning in the Limit

Olle Nerman 《Scandinavian Journal of Statistics》1998,25(3):569-572

Suppose that {( X _n, Y _n)} is a sequence of pairs of cector-valued stochastic variables which converges weakly to ( X , Y ), and that { y _n} converges to y . Sufficient conditions for the conditional distribution of X _n given Y = y are given in terms of stochastic monotonicity. Conditions, which guarantee that also moments of the conditional distributions converge to the moments of the ones of the limit, are also derived. 相似文献

14.

Ordering of Sequentially Sampled Exponential Experiments

Eitan Greenshtein & Erik Torgersen 《Scandinavian Journal of Statistics》1998,25(2):325-329

Let X ₁, X ₂, ... be a sequence of i.i.d. random variables, X _i∼ F _θ, θ∈Θ. Let N ₁ and N ₂ be two stopping rules. For a class of exponential families { F _θ: θ∈Θ} we show that the experiment Y ₁ = ( X ₁, ..., X _N1) carries more statistical information than Y ₂ = ( X ₁, ..., x _N2) only if N ₁ is stochastically larger then N ₂ 相似文献

15.

Weighted Wilcoxon Estimates for Autoregression 总被引：1，自引：0，他引：1

Jeffrey T. Terpstra Joseph W. McKean & Joshua D. Naranjo 《Australian & New Zealand Journal of Statistics》2001,43(4):399-419

This paper explores the class of weighted Wilcoxon (WW) estimates in the context of autoregressive parameter estimation, giving special attention to three sub-classes of so-called WW-estimates. When the weights are constant, the estimate is equivalent to using Jaeckel's estimate with Wilcoxon scores. The paper presents asymptotic linearity properties for the three sub-classes of WW-estimates. These properties imply that the estimates are asymptotically normal at rate n ^½. Tests of hypotheses as well as standard errors for confidence interval procedures can be based on such results. Furthermore, the estimates can be computed with an L ₁ regression routine once the weights have been calculated. Examples and a Monte Carlo study over innovation and additive outlier models suggest that WW-estimates can be both robust and highly efficient. 相似文献

16.

An Analysis of Swendsen–Wang and Related Sampling Methods

George S. Fishman 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1999,61(3):623-641

Convergence rates, statistical efficiency and sampling costs are studied for the original and extended Swendsen–Wang methods of generating a sample path { S _j , j ≥1} with equilibrium distribution π , with r distinct elements, on a finite state space X of size N ₁. Given S _{j -1}, each method uses auxiliary random variables to identify the subset of X from which S _j is to be randomly sampled. Let π_min and π_max denote respectively the smallest and largest elements in π and let N_r denote the number of elements in π with value π_max. For a single auxiliary variable, uniform sampling from the subset and ( N ₁− N_r )π_min+ N_r π_max≈1, our results show rapid convergence and high statistical efficiency for large π_min/π_max or N_r / N ₁ and slow convergence and poor statistical efficiency for small π_min/π_max and N_r / N₁ . Other examples provide additional insight. For extended Swendsen–Wang methods with non-uniform subset sampling, the analysis identifies the properties of a decomposition of π( x ) that favour fast convergence and high statistical efficiency. In the absence of exploitable special structure, subset sampling can be costly regardless of which of these methods is employed. 相似文献

17.

Penalized Projection Estimator for Volatility Density

F. COMTE V. GENON-CATALOT 《Scandinavian Journal of Statistics》2006,33(4):875-893

Abstract. In this paper, we consider a stochastic volatility model ( Y _t, V _t), where the volatility (V_t) is a positive stationary Markov process. We assume that ( ln V _t) admits a stationary density f that we want to estimate. Only the price process Y _t is observed at n discrete times with regular sampling interval Δ . We propose a non-parametric estimator for f obtained by a penalized projection method. Under mixing assumptions on ( V _t), we derive bounds for the quadratic risk of the estimator. Assuming that Δ=Δ_n tends to 0 while the number of observations and the length of the observation time tend to infinity, we discuss the rate of convergence of the risk. Examples of models included in this framework are given. 相似文献

18.

Prior elicitation, variable selection and Bayesian computation for logistic regression models

M.-H. Chen J. G. Ibrahim & C. Yiannoutsos 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1999,61(1):223-242

Bayesian selection of variables is often difficult to carry out because of the challenge in specifying prior distributions for the regression parameters for all possible models, specifying a prior distribution on the model space and computations. We address these three issues for the logistic regression model. For the first, we propose an informative prior distribution for variable selection. Several theoretical and computational properties of the prior are derived and illustrated with several examples. For the second, we propose a method for specifying an informative prior on the model space, and for the third we propose novel methods for computing the marginal distribution of the data. The new computational algorithms only require Gibbs samples from the full model to facilitate the computation of the prior and posterior model probabilities for all possible models. Several properties of the algorithms are also derived. The prior specification for the first challenge focuses on the observables in that the elicitation is based on a prior prediction y ₀ for the response vector and a quantity a ₀ quantifying the uncertainty in y ₀. Then, y ₀ and a ₀ are used to specify a prior for the regression coefficients semi-automatically. Examples using real data are given to demonstrate the methodology. 相似文献

19.

Estimation and testing stationarity for double-autoregressive models

Shiqing Ling 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(1):63-78

Summary. The paper considers the double-autoregressive model y _t = φ y _{t −1}+ ɛ _t with ɛ _t = . Consistency and asymptotic normality of the estimated parameters are proved under the condition E ln | φ +√ α η _t|<0, which includes the cases with | φ |=1 or | φ |>1 as well as . It is well known that all kinds of estimators of φ in these cases are not normal when ɛ _t are independent and identically distributed. Our result is novel and surprising. Two tests are proposed for testing stationarity of the model and their asymptotic distributions are shown to be a function of bivariate Brownian motions. Critical values of the tests are tabulated and some simulation results are reported. An application to the US 90-day treasury bill rate series is given. 相似文献

20.

Outlier Identification Procedures for Contingency Tables using Maximum Likelihood and L₁ Estimates

Sonja Kuhnt 《Scandinavian Journal of Statistics》2004,31(3):431-442

Abstract. Observed cell counts in contingency tables are perceived as outliers if they have low probability under an anticipated loglinear Poisson model. New procedures for the identification of such outliers are derived using the classical maximum likelihood estimator and an estimator based on the L ₁ norm. 相似文献