首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We use Owen's (1988, 1990) empirical likelihood method in upgraded mixture models. Two groups of independent observations are available. One is z 1, ..., z n which is observed directly from a distribution F ( z ). The other one is x 1, ..., x m which is observed indirectly from F ( z ), where the x i s have density ∫ p ( x | z ) dF ( z ) and p ( x | z ) is a conditional density function. We are interested in testing H 0: p ( x | z ) = p ( x | z ; θ ), for some specified smooth density function. A semiparametric likelihood ratio based statistic is proposed and it is shown that it converges to a chi-squared distribution. This is a simple method for doing goodness of fit tests, especially when x is a discrete variable with finitely many values. In addition, we discuss estimation of θ and F ( z ) when H 0 is true. The connection between upgraded mixture models and general estimating equations is pointed out.  相似文献   

2.
Abstract.  Suppose that X 1 ,…,  X n is a sequence of independent random vectors, identically distributed as a d -dimensional random vector X . Let     be a parameter of interest and     be some nuisance parameter. The unknown, true parameters ( μ 0 , ν 0 ) are uniquely determined by the system of equations E { g ( X , μ 0 , ν 0 )} =   0 , where g  =  ( g 1 ,…, g p + q ) is a vector of p + q functions. In this paper we develop an empirical likelihood (EL) method to do inference for the parameter μ 0 . The results in this paper are valid under very mild conditions on the vector of criterion functions g . In particular, we do not require that g 1 ,…, g p + q are smooth in μ or ν . This offers the advantage that the criterion function may involve indicators, which are encountered when considering, e.g. differences of quantiles, copulas, ROC curves, to mention just a few examples. We prove the asymptotic limit of the empirical log-likelihood ratio, and carry out a small simulation study to test the performance of the proposed EL method for small samples.  相似文献   

3.
Goodness of Fit via Non-parametric Likelihood Ratios   总被引:1,自引:0,他引:1  
Abstract.  To test if a density f is equal to a specified f 0, one knows by the Neyman–Pearson lemma the form of the optimal test at a specified alternative f 1. Any non-parametric density estimation scheme allows an estimate of f . This leads to estimated likelihood ratios. Properties are studied of tests which for the density estimation ingredient use log-linear expansions. Such expansions are either coupled with subset selectors like the Akaike information criterion and the Bayesian information criterion regimes, or use order growing with sample size. Our tests are generalized to testing the adequacy of general parametric models, and to work also in higher dimensions. The tests are related to, but are different from, the 'smooth tests' that go back to Neyman [Skandinavisk Aktuarietidsskrift 20(1937) 149] and that have been studied extensively in recent literature. Our tests are large-sample equivalent to such smooth tests under local alternative conditions, but different from the smooth tests and often better under non-local conditions.  相似文献   

4.
Estimating smooth monotone functions   总被引:1,自引:0,他引:1  
Many situations call for a smooth strictly monotone function f of arbitrary flexibility. The family of functions defined by the differential equation D  2 f  = w Df , where w is an unconstrained coefficient function comprises the strictly monotone twice differentiable functions. The solution to this equation is f = C 0 + C 1  D −1{exp( D −1 w )}, where C 0 and C 1 are arbitrary constants and D −1 is the partial integration operator. A basis for expanding w is suggested that permits explicit integration in the expression of f . In fitting data, it is also useful to regularize f by penalizing the integral of w 2 since this is a measure of the relative curvature in f . Applications are discussed to monotone nonparametric regression, to the transformation of the dependent variable in non-linear regression and to density estimation.  相似文献   

5.
Summary.  It is well known that in a sequential study the probability that the likelihood ratio for a simple alternative hypothesis H 1 versus a simple null hypothesis H 0 will ever be greater than a positive constant c will not exceed 1/ c under H 0. However, for a composite alternative hypothesis, this bound of 1/ c will no longer hold when a generalized likelihood ratio statistic is used. We consider a stepwise likelihood ratio statistic which, for each new observation, is updated by cumulatively multiplying the ratio of the conditional likelihoods for the composite alternative hypothesis evaluated at an estimate of the parameter obtained from the preceding observations versus the simple null hypothesis. We show that, under the null hypothesis, the probability that this stepwise likelihood ratio will ever be greater than c will not exceed 1/ c . In contrast, under the composite alternative hypothesis, this ratio will generally converge in probability to ∞. These results suggest that a stepwise likelihood ratio statistic can be useful in a sequential study for testing a composite alternative versus a simple null hypothesis. For illustration, we conduct two simulation studies, one for a normal response and one for an exponential response, to compare the performance of a sequential test based on a stepwise likelihood ratio statistic with a constant boundary versus some existing approaches.  相似文献   

6.
It is known that the empirical likelihood ratio can be used to construct confidence regions for smooth functions of the mean, Fréchet differentiable statistical functionals and for a class of M-functionals. In this paper, we argue that this use can be extended to the class of functionals which are smooth functions of M-functionals. In particular, we find the conditions under which the empirical log-likelihood ratio for this kind of functionals admits a χ2 approxima tion. Furthermore, we investigate, by simulation methods, the related approximation error in some contexts of practical interest.  相似文献   

7.
Let X 1, X 2, ... be a sequence of i.i.d. random variables, X i∼ F θ, θ∈Θ. Let N 1 and N 2 be two stopping rules. For a class of exponential families { F θ: θ∈Θ} we show that the experiment Y 1 = ( X 1, ..., X N1) carries more statistical information than Y 2 = ( X 1, ..., x N2) only if N 1 is stochastically larger then N 2  相似文献   

8.
Abstract.  The supremum difference between the cumulative sum diagram, and its greatest convex minorant (GCM), in case of non-parametric isotonic regression is considered. When the regression function is strictly increasing, and the design points are unequally spaced, but approximate a positive density in even a slow rate ( n −1/3), then the difference is shown to shrink in a very rapid (close to n −2/3) rate. The result is analogous to the corresponding result in case of a monotone density estimation established by Kiefer and Wolfowitz, but uses entirely different representation. The limit distribution of the GCM as a process on the unit interval is obtained when the design variables are i.i.d. with a positive density. Finally, a pointwise asymptotic normality result is proved for the smooth monotone estimator, obtained by the convolution of a kernel with the classical monotone estimator.  相似文献   

9.
Summary.  Because highly correlated data arise from many scientific fields, we investigate parameter estimation in a semiparametric regression model with diverging number of predictors that are highly correlated. For this, we first develop a distribution-weighted least squares estimator that can recover directions in the central subspace, then use the distribution-weighted least squares estimator as a seed vector and project it onto a Krylov space by partial least squares to avoid computing the inverse of the covariance of predictors. Thus, distrbution-weighted partial least squares can handle the cases with high dimensional and highly correlated predictors. Furthermore, we also suggest an iterative algorithm for obtaining a better initial value before implementing partial least squares. For theoretical investigation, we obtain strong consistency and asymptotic normality when the dimension p of predictors is of convergence rate O { n 1/2/ log ( n )} and o ( n 1/3) respectively where n is the sample size. When there are no other constraints on the covariance of predictors, the rates n 1/2 and n 1/3 are optimal. We also propose a Bayesian information criterion type of criterion to estimate the dimension of the Krylov space in the partial least squares procedure. Illustrative examples with a real data set and comprehensive simulations demonstrate that the method is robust to non-ellipticity and works well even in 'small n –large p ' problems.  相似文献   

10.
Abstract.  In this paper, we consider a stochastic volatility model ( Y t , V t ), where the volatility (V t ) is a positive stationary Markov process. We assume that ( ln V t ) admits a stationary density f that we want to estimate. Only the price process Y t is observed at n discrete times with regular sampling interval Δ . We propose a non-parametric estimator for f obtained by a penalized projection method. Under mixing assumptions on ( V t ), we derive bounds for the quadratic risk of the estimator. Assuming that Δ=Δ n tends to 0 while the number of observations and the length of the observation time tend to infinity, we discuss the rate of convergence of the risk. Examples of models included in this framework are given.  相似文献   

11.
Suppose the p -variate random vector W , partitioned into q variables W1 and p - q variables W2, follows a multivariate normal mixture distribution. If the investigator is mainly interested in estimation of the parameters of the distribution of W1, there are two possibilities: (1) use only the data on W1 for estimation, and (2) estimate the parameters of the p -variate mixture distribution, and then extract the estimates of the marginal distribution of W1. In this article we study the choice between these two possibilities mainly for the case of two mixture components with identical covariance matrices. We find the asymptotic distribution of the linear discriminant function coefficients using the work of Efron (1975 ) and O'Neill (1978 ), and give a Wald–test for redundancy of W2. A simulation study gives further insights into conditions under which W2 should be used in the analysis: in summary, the inclusion of W2 seems justified if Δ 2.1, the Mahalanobis distance between the two component distributions based on the conditional distribution of W2 given W1, is at least 2.  相似文献   

12.
The objective of this paper is to investigate exact slopes of test statistics { Tn } when the random vectors X 1, ..., Xn are distributed according to an unknown member of an exponential family { P θ; θ∈Ω. Here Ω is a parameter set. We will be concerned with the hypothesis testing problem of H 0θ∈Ω0 vs H 1: θ∉Ω0 where Ω0 is a subset of Ω. It will be shown that for an important class of problems and test statistics the exact slope of { Tn } at η in Ω−Ω0 is determined by the shortest Kullback–Leibler distance from {θ: Tn (λ(θ)) = Tn (λ(π))} to Ω0, λθ = E θ)( X ).  相似文献   

13.
Some studies of the bootstrap have assessed the effect of smoothing the estimated distribution that is resampled, a process usually known as the smoothed bootstrap. Generally, the smoothed distribution for resampling is a kernel estimate and is often rescaled to retain certain characteristics of the empirical distribution. Typically the effect of such smoothing has been measured in terms of the mean-squared error of bootstrap point estimates. The reports of these previous investigations have not been encouraging about the efficacy of smoothing. In this paper the effect of resampling a kernel-smoothed distribution is evaluated through expansions for the coverage of bootstrap percentile confidence intervals. It is shown that, under the smooth function model, proper bandwidth selection can accomplish a first-order correction for the one-sided percentile method. With the objective of reducing the coverage error the appropriate bandwidth for one-sided intervals converges at a rate of n −1/4, rather than the familiar n −1/5 for kernel density estimation. Applications of this same approach to bootstrap t and two-sided intervals yield optimal bandwidths of order n −1/2. These bandwidths depend on moments of the smooth function model and not on derivatives of the underlying density of the data. The relationship of this smoothing method to both the accelerated bias correction and the bootstrap t methods provides some insight into the connections between three quite distinct approximate confidence intervals.  相似文献   

14.
In the estimators t 3 , t 4 , t 5 of Mukerjee, Rao & Vijayan (1987), b y x and b y z are partial regression coefficients of y on x and z , respectively, based on the smaller sample. With the above interpretation of b y x and b y z in t 3 , t 4 , t 5 , all the calculations in Mukerjee at al. (1987) are correct. In this connection, we also wish to make it explicit that b x z in t 5 is an ordinary and not a partial regression coefficient. The 'corrected' MSEs of t 3 , t 4 , t 5 , as given in Ahmed (1998 Section 3) are computed assuming that our b y x and b y z are ordinary and not partial regression coefficients. Indeed, we had no intention of giving estimators using the corresponding ordinary regression coefficients which would lead to estimators inferior to those given by Kiregyera (1984). We accept responsibility for any notational confusion created by us and express regret to readers who have been confused by our notation. Finally, in consideration of the above, it may be noted that Tripathi & Ahmed's (1995) estimator t 0 , quoted also in Ahmed (1998), is no better than t 5 of Mukerjee at al. (1987).  相似文献   

15.
Abstract.  We focus on a class of non-standard problems involving non-parametric estimation of a monotone function that is characterized by n 1/3 rate of convergence of the maximum likelihood estimator, non-Gaussian limit distributions and the non-existence of     -regular estimators. We have shown elsewhere that under a null hypothesis of the type ψ ( z 0) =  θ 0 ( ψ being the monotone function of interest) in non-standard problems of the above kind, the likelihood ratio statistic has a 'universal' limit distribution that is free of the underlying parameters in the model. In this paper, we illustrate its limiting behaviour under local alternatives of the form ψ n ( z ), where ψ n (·) and ψ (·) vary in O ( n −1/3) neighbourhoods around z 0 and ψ n converges to ψ at rate n 1/3 in an appropriate metric. Apart from local alternatives, we also consider the behaviour of the likelihood ratio statistic under fixed alternatives and establish the convergence in probability of an appropriately scaled version of the same to a constant involving a Kullback–Leibler distance.  相似文献   

16.
Let X = (X1, - Xp)prime; ˜ Np (μ, Σ) where μ= (μ1, -, μp)' and Σ= diag (Σ21, -, Σ2p) are both unknown and p3. Let (ni - 2) wi2i! X2ni, independent. of wi (I ≠ j = 1, -, p). Assume that (w1, -, wp) and X are independent. Define W = diag (w1, -, wp) and ¶ X ¶2w= X'W-1Q-1W-1X where Q = diag (q1, -,n qp), qi > 0, i = 1, -, p. In this paper, the minimax estimator of Berger & Bock (1976), given by δ (X, W) = [Ip - r(X, W) ¶ X ¶-2w Q-1W-1] X, is shown to be minimax relative to the convex loss (δ - μ)'[αQ + (1 - α) Σ-1] δ - μ)/C, where C =α tr (Σ) + (1 - α)p and 0 α 1, under certain conditions on r(X, W). This generalizes the above mentioned result of Berger & Bock.  相似文献   

17.
A simple derivation of the non-central χ2 distribution is presented. This requires no advanced mathematical knowledge, and is suitable for use in elementary courses.
The non-central χ2 distribution is of great importance in statistical theory, both in its own right, and as a step in the derivation of other distributions; however, it tends to be neglected in statistical courses largely, it seems, because the standard derivations are too difficult. A recent review of derivations by Guenther (1964) shows that most require some knowledge of n -dimensional geometry or the equivalent matrix theory. The alternative is the use of generating functions, which is straightforward, apart from the inversion back to the density function. An objection to such methods is that they give no insight into the probabilistic nature of the proble.  相似文献   

18.
In this paper we consider the problem of testing for a scale change in the infinite order moving average process X j = i =0 a i j i , where j are i.i.d. r.v.s with E 1 < for some > 0. In performing the test, a cusum of squares test statistic analogous to Inclan & Tiao's (1994) statistic is considered. It is well-known from the literature that outliers affect test procedures leading to false conclusions. In order to remedy this, a cusum of squares test based on trimmed observations is considered. It is demonstrated that this test is robust against outliers, is valid for infinite variance processes as well. Simulation results are given for illustration.  相似文献   

19.
Summary.  The method of Bayesian model selection for join point regression models is developed. Given a set of K +1 join point models M 0,  M 1, …,  M K with 0, 1, …,  K join points respec-tively, the posterior distributions of the parameters and competing models M k are computed by Markov chain Monte Carlo simulations. The Bayes information criterion BIC is used to select the model M k with the smallest value of BIC as the best model. Another approach based on the Bayes factor selects the model M k with the largest posterior probability as the best model when the prior distribution of M k is discrete uniform. Both methods are applied to analyse the observed US cancer incidence rates for some selected cancer sites. The graphs of the join point models fitted to the data are produced by using the methods proposed and compared with the method of Kim and co-workers that is based on a series of permutation tests. The analyses show that the Bayes factor is sensitive to the prior specification of the variance σ 2, and that the model which is selected by BIC fits the data as well as the model that is selected by the permutation test and has the advantage of producing the posterior distribution for the join points. The Bayesian join point model and model selection method that are presented here will be integrated in the National Cancer Institute's join point software ( http://www.srab.cancer.gov/joinpoint/ ) and will be available to the public.  相似文献   

20.
Let X 1, . . ., Xn be independent identically distributed random variables with a common continuous (cumulative) distribution function (d.f.) F , and F^n the empirical d.f. (e.d.f.) based on X 1, . . ., Xn . Let G be a smooth d.f. and Gθ = G (·–θ) its translation through θ∈ R . Using a Kolmogorov-Lévy type metric ρα defined on the space of d.f.s. on R , the paper derives both null and non-null limiting distributions of √ n [ ρα ( Fn , Gθn ) – ρα ( F, Gθ )], √ n (θ n –θ) and √ nρα ( Gθ , Gθ ), where θ n and θ are the minimum ρα -distance parameters for Fn and F from G , respectively. These distributions are known explicitly in important particular cases; with some complementary Monte Carlo simulations, they help us clarify our understanding of estimation using minimum distance methods and supremum type metrics. We advocate use of the minimum distance method with supremum type metrics in cases of non-null models. The resulting functionals are Hadamard differentiable and efficient. For small scale parameters the minimum distance functionals are close to medians of the parent distributions. The optimal small scale models result in minimum distance estimators having asymptotic variances very competitive and comparable with best known robust estimators.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号