首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Abstract.  Given an i.i.d. sample drawn from a density f on the real line, the problem of testing whether f is in a given class of densities is considered. Testing procedures constructed on the basis of minimizing the L 1-distance between a kernel density estimate and any density in the hypothesized class are investigated. General non-asymptotic bounds are derived for the power of the test. It is shown that the concentration of the data-dependent smoothing factor and the 'size' of the hypothesized class of densities play a key role in the performance of the test. Consistency and non-asymptotic performance bounds are established in several special cases, including testing simple hypotheses, translation/scale classes and symmetry. Simulations are also carried out to compare the behaviour of the method with the Kolmogorov-Smirnov test and an L 2 density-based approach due to Fan [ Econ. Theory 10 (1994) 316].  相似文献   

2.
Abstract.  Suppose that X 1 ,…,  X n is a sequence of independent random vectors, identically distributed as a d -dimensional random vector X . Let     be a parameter of interest and     be some nuisance parameter. The unknown, true parameters ( μ 0 , ν 0 ) are uniquely determined by the system of equations E { g ( X , μ 0 , ν 0 )} =   0 , where g  =  ( g 1 ,…, g p + q ) is a vector of p + q functions. In this paper we develop an empirical likelihood (EL) method to do inference for the parameter μ 0 . The results in this paper are valid under very mild conditions on the vector of criterion functions g . In particular, we do not require that g 1 ,…, g p + q are smooth in μ or ν . This offers the advantage that the criterion function may involve indicators, which are encountered when considering, e.g. differences of quantiles, copulas, ROC curves, to mention just a few examples. We prove the asymptotic limit of the empirical log-likelihood ratio, and carry out a small simulation study to test the performance of the proposed EL method for small samples.  相似文献   

3.
The objective of this paper is to investigate exact slopes of test statistics { Tn } when the random vectors X 1, ..., Xn are distributed according to an unknown member of an exponential family { P θ; θ∈Ω. Here Ω is a parameter set. We will be concerned with the hypothesis testing problem of H 0θ∈Ω0 vs H 1: θ∉Ω0 where Ω0 is a subset of Ω. It will be shown that for an important class of problems and test statistics the exact slope of { Tn } at η in Ω−Ω0 is determined by the shortest Kullback–Leibler distance from {θ: Tn (λ(θ)) = Tn (λ(π))} to Ω0, λθ = E θ)( X ).  相似文献   

4.
Summary.  Principal component analysis has become a fundamental tool of functional data analysis. It represents the functional data as X i ( t )= μ ( t )+Σ1≤ l <∞ η i ,  l +  v l ( t ), where μ is the common mean, v l are the eigenfunctions of the covariance operator and the η i ,  l are the scores. Inferential procedures assume that the mean function μ ( t ) is the same for all values of i . If, in fact, the observations do not come from one population, but rather their mean changes at some point(s), the results of principal component analysis are confounded by the change(s). It is therefore important to develop a methodology to test the assumption of a common functional mean. We develop such a test using quantities which can be readily computed in the R package fda. The null distribution of the test statistic is asymptotically pivotal with a well-known asymptotic distribution. The asymptotic test has excellent finite sample performance. Its application is illustrated on temperature data from England.  相似文献   

5.
Exact expressions for the cumulative distribution function of a random variable of the form ( α 1 X 1+ α 2 X 2)/ Y are given where X 1, X 2 and Y are independent chi-squared random variables. The expressions are applied to the detection of joint outliers and Hotelling's mis-specified T 2 distribution.  相似文献   

6.
Non-parametric Regression with Dependent Censored Data   总被引:1,自引:0,他引:1  
Abstract.  Let ( X i , Y i ) ( i = 1 ,…, n ) be n replications of a random vector ( X , Y  ), where Y is supposed to be subject to random right censoring. The data ( X i , Y i ) are assumed to come from a stationary α -mixing process. We consider the problem of estimating the function m ( x ) = E ( φ ( Y ) |  X = x ), for some known transformation φ . This problem is approached in the following way: first, we introduce a transformed variable     , that is not subject to censoring and satisfies the relation     , and then we estimate m ( x ) by applying local linear regression techniques. As a by-product, we obtain a general result on the uniform rate of convergence of kernel type estimators of functionals of an unknown distribution function, under strong mixing assumptions.  相似文献   

7.
We are concerned with estimators which improve upon the best invariant estimator, in estimating a location parameter θ. If the loss function is L(θ - a) with L convex, we give sufficient conditions for the inadmissibility of δ0(X) = X. If the loss is a weighted sum of squared errors, we find various classes of estimators δ which are better than δ0. In general, δ is the convolution of δ1 (an estimator which improves upon δ0 outside of a compact set) with a suitable probability density in Rp. The critical dimension of inadmissibility depends on the estimator δ1 We also give several examples of estimators δ obtained in this way and state some open problems.  相似文献   

8.
Abstract.  Many time series in applied sciences obey a time-varying spectral structure. In this article, we focus on locally stationary processes and develop tests of the hypothesis that the time-varying spectral density has a semiparametric structure, including the interesting case of a time-varying autoregressive moving-average (tvARMA) model. The test introduced is based on a L 2 -distance measure of a kernel smoothed version of the local periodogram rescaled by the time-varying spectral density of the estimated semiparametric model. The asymptotic distribution of the test statistic under the null hypothesis is derived. As an interesting special case, we focus on the problem of testing for the presence of a tvAR model. A semiparametric bootstrap procedure to approximate more accurately the distribution of the test statistic under the null hypothesis is proposed. Some simulations illustrate the behaviour of our testing methodology in finite sample situations.  相似文献   

9.
We use Owen's (1988, 1990) empirical likelihood method in upgraded mixture models. Two groups of independent observations are available. One is z 1, ..., z n which is observed directly from a distribution F ( z ). The other one is x 1, ..., x m which is observed indirectly from F ( z ), where the x i s have density ∫ p ( x | z ) dF ( z ) and p ( x | z ) is a conditional density function. We are interested in testing H 0: p ( x | z ) = p ( x | z ; θ ), for some specified smooth density function. A semiparametric likelihood ratio based statistic is proposed and it is shown that it converges to a chi-squared distribution. This is a simple method for doing goodness of fit tests, especially when x is a discrete variable with finitely many values. In addition, we discuss estimation of θ and F ( z ) when H 0 is true. The connection between upgraded mixture models and general estimating equations is pointed out.  相似文献   

10.
Summary.  We propose a general bootstrap procedure to approximate the null distribution of non-parametric frequency domain tests about the spectral density matrix of a multivariate time series. Under a set of easy-to-verify conditions, we establish asymptotic validity of the bootstrap procedure proposed. We apply a version of this procedure together with a new statistic to test the hypothesis that the spectral densities of not necessarily independent time series are equal. The test statistic proposed is based on an L 2-distance between the non-parametrically estimated individual spectral densities and an overall, 'pooled' spectral density, the latter being obtained by using the whole set of m time series considered. The effects of the dependence between the time series on the power behaviour of the test are investigated. Some simulations are presented and a real life data example is discussed.  相似文献   

11.
In the estimators t 3 , t 4 , t 5 of Mukerjee, Rao & Vijayan (1987), b y x and b y z are partial regression coefficients of y on x and z , respectively, based on the smaller sample. With the above interpretation of b y x and b y z in t 3 , t 4 , t 5 , all the calculations in Mukerjee at al. (1987) are correct. In this connection, we also wish to make it explicit that b x z in t 5 is an ordinary and not a partial regression coefficient. The 'corrected' MSEs of t 3 , t 4 , t 5 , as given in Ahmed (1998 Section 3) are computed assuming that our b y x and b y z are ordinary and not partial regression coefficients. Indeed, we had no intention of giving estimators using the corresponding ordinary regression coefficients which would lead to estimators inferior to those given by Kiregyera (1984). We accept responsibility for any notational confusion created by us and express regret to readers who have been confused by our notation. Finally, in consideration of the above, it may be noted that Tripathi & Ahmed's (1995) estimator t 0 , quoted also in Ahmed (1998), is no better than t 5 of Mukerjee at al. (1987).  相似文献   

12.
Penalized likelihood methods provide a range of practical modelling tools, including spline smoothing, generalized additive models and variants of ridge regression. Selecting the correct weights for penalties is a critical part of using these methods and in the single-penalty case the analyst has several well-founded techniques to choose from. However, many modelling problems suggest a formulation employing multiple penalties, and here general methodology is lacking. A wide family of models with multiple penalties can be fitted to data by iterative solution of the generalized ridge regression problem minimize || W 1/2 ( Xp − y ) ||2ρ+Σ i =1 m  θ i p ' S i p ( p is a parameter vector, X a design matrix, S i a non-negative definite coefficient matrix defining the i th penalty with associated smoothing parameter θ i , W a diagonal weight matrix, y a vector of data or pseudodata and ρ an 'overall' smoothing parameter included for computational efficiency). This paper shows how smoothing parameter selection can be performed efficiently by applying generalized cross-validation to this problem and how this allows non-linear, generalized linear and linear models to be fitted using multiple penalties, substantially increasing the scope of penalized modelling methods. Examples of non-linear modelling, generalized additive modelling and anisotropic smoothing are given.  相似文献   

13.
Suppose that {( X n , Y n )} is a sequence of pairs of cector-valued stochastic variables which converges weakly to ( X , Y ), and that { y n } converges to y . Sufficient conditions for the conditional distribution of X n given Y = y are given in terms of stochastic monotonicity. Conditions, which guarantee that also moments of the conditional distributions converge to the moments of the ones of the limit, are also derived.  相似文献   

14.
Let X 1, X 2, ... be a sequence of i.i.d. random variables, X i∼ F θ, θ∈Θ. Let N 1 and N 2 be two stopping rules. For a class of exponential families { F θ: θ∈Θ} we show that the experiment Y 1 = ( X 1, ..., X N1) carries more statistical information than Y 2 = ( X 1, ..., x N2) only if N 1 is stochastically larger then N 2  相似文献   

15.
Weighted Wilcoxon Estimates for Autoregression   总被引:1,自引:0,他引:1  
This paper explores the class of weighted Wilcoxon (WW) estimates in the context of autoregressive parameter estimation, giving special attention to three sub-classes of so-called WW-estimates. When the weights are constant, the estimate is equivalent to using Jaeckel's estimate with Wilcoxon scores. The paper presents asymptotic linearity properties for the three sub-classes of WW-estimates. These properties imply that the estimates are asymptotically normal at rate n ½. Tests of hypotheses as well as standard errors for confidence interval procedures can be based on such results. Furthermore, the estimates can be computed with an L 1 regression routine once the weights have been calculated. Examples and a Monte Carlo study over innovation and additive outlier models suggest that WW-estimates can be both robust and highly efficient.  相似文献   

16.
Convergence rates, statistical efficiency and sampling costs are studied for the original and extended Swendsen–Wang methods of generating a sample path { S j , j ≥1} with equilibrium distribution π , with r distinct elements, on a finite state space X of size N 1. Given S j -1, each method uses auxiliary random variables to identify the subset of X from which S j is to be randomly sampled. Let πmin and πmax denote respectively the smallest and largest elements in π and let Nr denote the number of elements in π with value πmax. For a single auxiliary variable, uniform sampling from the subset and ( N 1− Nrmin+ Nr πmax≈1, our results show rapid convergence and high statistical efficiency for large πminmax or Nr / N 1 and slow convergence and poor statistical efficiency for small πminmax and Nr / N1 . Other examples provide additional insight. For extended Swendsen–Wang methods with non-uniform subset sampling, the analysis identifies the properties of a decomposition of π( x ) that favour fast convergence and high statistical efficiency. In the absence of exploitable special structure, subset sampling can be costly regardless of which of these methods is employed.  相似文献   

17.
Abstract.  In this paper, we consider a stochastic volatility model ( Y t , V t ), where the volatility (V t ) is a positive stationary Markov process. We assume that ( ln V t ) admits a stationary density f that we want to estimate. Only the price process Y t is observed at n discrete times with regular sampling interval Δ . We propose a non-parametric estimator for f obtained by a penalized projection method. Under mixing assumptions on ( V t ), we derive bounds for the quadratic risk of the estimator. Assuming that Δ=Δ n tends to 0 while the number of observations and the length of the observation time tend to infinity, we discuss the rate of convergence of the risk. Examples of models included in this framework are given.  相似文献   

18.
Bayesian selection of variables is often difficult to carry out because of the challenge in specifying prior distributions for the regression parameters for all possible models, specifying a prior distribution on the model space and computations. We address these three issues for the logistic regression model. For the first, we propose an informative prior distribution for variable selection. Several theoretical and computational properties of the prior are derived and illustrated with several examples. For the second, we propose a method for specifying an informative prior on the model space, and for the third we propose novel methods for computing the marginal distribution of the data. The new computational algorithms only require Gibbs samples from the full model to facilitate the computation of the prior and posterior model probabilities for all possible models. Several properties of the algorithms are also derived. The prior specification for the first challenge focuses on the observables in that the elicitation is based on a prior prediction y 0 for the response vector and a quantity a 0 quantifying the uncertainty in y 0. Then, y 0 and a 0 are used to specify a prior for the regression coefficients semi-automatically. Examples using real data are given to demonstrate the methodology.  相似文献   

19.
Summary.  The paper considers the double-autoregressive model y t  =  φ y t −1+ ɛ t with ɛ t  =     . Consistency and asymptotic normality of the estimated parameters are proved under the condition E  ln | φ  +√ α η t |<0, which includes the cases with | φ |=1 or | φ |>1 as well as     . It is well known that all kinds of estimators of φ in these cases are not normal when ɛ t are independent and identically distributed. Our result is novel and surprising. Two tests are proposed for testing stationarity of the model and their asymptotic distributions are shown to be a function of bivariate Brownian motions. Critical values of the tests are tabulated and some simulation results are reported. An application to the US 90-day treasury bill rate series is given.  相似文献   

20.
Abstract.  Observed cell counts in contingency tables are perceived as outliers if they have low probability under an anticipated loglinear Poisson model. New procedures for the identification of such outliers are derived using the classical maximum likelihood estimator and an estimator based on the L 1 norm.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号