Abstract.  We present a wavelet procedure for defining confidence intervals for f ( x 0), where x 0 is a given point and f is an unknown density from which there are independent observations. We use an undersmoothing method which is shown to be near optimal (up to a logarithmic term) in a first order sense. We propose a second order correction using the Edgeworth expansion. The adaptation with respect to the unknown regularity of f is given via a Lepskii type algorithm and has the advantage to be well located. The theoretical results are proved under weak assumptions and concern very irregular or oscillating functions. An empirical study gives some hints for choosing the constant of the threshold level. The results are very encouraging for the length of the intervals as well as for the coverage accuracy.  相似文献   

We develop an approach to evaluating frequentist model averaging procedures by considering them in a simple situation in which there are two‐nested linear regression models over which we average. We introduce a general class of model averaged confidence intervals, obtain exact expressions for the coverage and the scaled expected length of the intervals, and use these to compute these quantities for the model averaged profile likelihood (MPI) and model‐averaged tail area confidence intervals proposed by D. Fletcher and D. Turek. We show that the MPI confidence intervals can perform more poorly than the standard confidence interval used after model selection but ignoring the model selection process. The model‐averaged tail area confidence intervals perform better than the MPI and postmodel‐selection confidence intervals but, for the examples that we consider, offer little over simply using the standard confidence interval for θ under the full model, with the same nominal coverage.  相似文献   

In this paper we consider confidence intervals for the ratio of two population variances. We propose a confidence interval for the ratio of two variances based on the t-statistic by deriving its Edgeworth expansion and considering Hall's and Johnson's transformations. Then, we consider the coverage accuracy of suggested intervals and intervals based on the F-statistic for some distributions.  相似文献   

Abstract.  The paper develops empirical Bayes (EB) confidence intervals for population means with distributions belonging to the natural exponential family-quadratic variance function (NEF-QVF) family when the sample size for a particular population is moderate or large. The basis for such development is to find an interval centred around the posterior mean which meets the target coverage probability asymptotically, and then show that the difference between the coverage probabilities of the Bayes and EB intervals is negligible up to a certain order. The approach taken is Edgeworth expansion so that the sample sizes from the different populations need not be significantly large. The proposed intervals meet the target coverage probabilities asymptotically, and are easy to construct. We illustrate use of these intervals in the context of small area estimation both through real and simulated data. The proposed intervals are different from the bootstrap intervals. The latter can be applied quite generally, but the order of accuracy of these intervals in meeting the desired coverage probability is unknown.  相似文献   

The problem of estimating the difference between two binomial proportions is considered. Closed-form approximate confidence intervals (CIs) and a fiducial CI for the difference between proportions are proposed. The approximate CIs are simple to compute, and they perform better than the classical Wald CI in terms of coverage probabilities and precision. Numerical studies indicate that these approximate CIs can be used safely for practical applications under a simple condition. The fiducial CI is more accurate than the approximate CIs in terms of coverage probabilities. The fiducial CIs, the Newcombe CIs, and the Miettinen–Nurminen CIs are comparable in terms of coverage probabilities and precision. The interval estimation procedures are illustrated using two examples.  相似文献   

Approximate confidence intervals are given for the lognormal regression problem. The error in the nominal level can be reduced to O(n ?2), where n is the sample size. An alternative procedure is given which avoids the non-robust assumption of lognormality. This amounts to finding a confidence interval based on M-estimates for a general smooth function of both ? and F, where ? are the parameters of the general (possibly nonlinear) regression problem and F is the unknown distribution function of the residuals. The derived intervals are compared using theory, simulation and real data sets.  相似文献   

Abstract.  This article extends recent results [Scand. J. Statist. 28 (2001) 699] about exact non-parametric inferences based on order statistics with progressive type-II censoring. The extension lies in that non-parametric inferences are now covered where the dependence between involved order statistics cannot be circumvented. These inferences include: (a) tolerance intervals containing at least a specified proportion of the parent distribution, (b) prediction intervals containing at least a specified number of observations in a future sample, and (c) outer and/or inner confidence intervals for a quantile interval of the parent distribution. The inferences are valid for any parent distribution with continuous distribution function. The key result shows how the probability of an event involving k dependent order statistics that are observable/uncensored with progressive type-II censoring can be represented as a mixture with known weights of corresponding probabilities involving k dependent ordinary order statistics. Further applications/developments concerning exact Kolmogorov-type confidence regions are indicated.  相似文献   

Group testing is the process of combining individual samples and testing them as a group for the presence of an attribute. The use of such testing to estimate proportions is an important statistical tool in many applications. When samples are collected and tested in groups of different size, complications arise in the construction of exact confidence intervals. In this case, the numbers of positive groups has a multivariate distribution, and the difficulty stems from a lack of a natural ordering of the sample points. Exact two‐sided intervals such as the equal‐tail method based on maximum likelihood estimation, and those based on joint probability or likelihood ratio statistics, have been previously considered. In this paper several new estimators are developed and assessed. We show that the combined tails (or Blaker) method based on a suitable ordering statistic, is the best choice in this setting. The methods are illustrated using a study involving the infection prevalence of Myxobolus cerebralis among free‐ranging fish.  相似文献   

Abstract. We study the coverage properties of Bayesian confidence intervals for the smooth component functions of generalized additive models (GAMs) represented using any penalized regression spline approach. The intervals are the usual generalization of the intervals first proposed by Wahba and Silverman in 1983 and 1985, respectively, to the GAM component context. We present simulation evidence showing these intervals have close to nominal ‘across‐the‐function’ frequentist coverage probabilities, except when the truth is close to a straight line/plane function. We extend the argument introduced by Nychka in 1988 for univariate smoothing splines to explain these results. The theoretical argument suggests that close to nominal coverage probabilities can be achieved, provided that heavy oversmoothing is avoided, so that the bias is not too large a proportion of the sampling variability. The theoretical results allow us to derive alternative intervals from a purely frequentist point of view, and to explain the impact that the neglect of smoothing parameter variability has on confidence interval performance. They also suggest switching the target of inference for component‐wise intervals away from smooth components in the space of the GAM identifiability constraints.  相似文献   

Consider a linear regression model with independent normally distributed errors. Suppose that the scalar parameter of interest is a specified linear combination of the components of the regression parameter vector. Also suppose that we have uncertain prior information that a parameter vector, consisting of specified distinct linear combinations of these components, takes a given value. Part of our evaluation of a frequentist confidence interval for the parameter of interest is the scaled expected length, defined to be the expected length of this confidence interval divided by the expected length of the standard confidence interval for this parameter, with the same confidence coefficient. We say that a confidence interval for the parameter of interest utilizes this uncertain prior information if (a) the scaled expected length of this interval is substantially less than one when the prior information is correct, (b) the maximum value of the scaled expected length is not too large and (c) this confidence interval reverts to the standard confidence interval, with the same confidence coefficient, when the data happen to strongly contradict the prior information. We present a new confidence interval for a scalar parameter of interest, with specified confidence coefficient, that utilizes this uncertain prior information. A factorial experiment with one replicate is used to illustrate the application of this new confidence interval.  相似文献   

It is shown how various exact non-parametric inferences based on order statistics in one or two random samples can be generalized to situations with progressive type-II censoring, which is a kind of evolutionary right censoring. Ordinary type-II right censoring is a special case of such progressive censoring. These inferences include confidence intervals for a given parent quantile, prediction intervals for a given order statistic of a future sample, and related two-sample inferences based on exceedance probabilities. The proposed inferences are valid for any parent distribution with continuous distribution function. The key result is that each observable uncensored order statistic that becomes available with progressive type-II censoring can be represented as a mixture with known weights of underlying ordinary order statistics. The importance of this mixture representation lies in that various properties of such observable order statistics can be deduced immediately from well-known properties of ordinary order statistics.  相似文献   

Abstract. The focus of this article is on simultaneous confidence bands over a rectangular covariate region for a linear regression model with k>1 covariates, for which only conservative or approximate confidence bands are available in the statistical literature stretching back to Working & Hotelling (J. Amer. Statist. Assoc. 24 , 1929; 73–85). Formulas of simultaneous confidence levels of the hyperbolic and constant width bands are provided. These involve only a k‐dimensional integral; it is unlikely that the simultaneous confidence levels can be expressed as an integral of less than k‐dimension. These formulas allow the construction for the first time of exact hyperbolic and constant width confidence bands for at least a small k(>1) by using numerical quadrature. Comparison between the hyperbolic and constant width bands is then addressed under both the average width and minimum volume confidence set criteria. It is observed that the constant width band can be drastically less efficient than the hyperbolic band when k>1. Finally it is pointed out how the methods given in this article can be applied to more general regression models such as fixed‐effect or random‐effect generalized linear regression models.  相似文献   

For the two-sided Student t confidence interval for the mean of a normal distribution there is, for any sample size, a sufficiently large confidence level that ensures that the interval covers all the observations; there are also sufficiently small confidence levels guaranteeing, respectively, that (a) the interval does not cover all the observations and (b) the interval lies within the extreme observations. Necessary and sufficient conditions are also obtained for the width of the confidence interval to always exceed the sample range, as well as for the reverse inequality. Some implications of the results are discussed.  相似文献   

Some studies of the bootstrap have assessed the effect of smoothing the estimated distribution that is resampled, a process usually known as the smoothed bootstrap. Generally, the smoothed distribution for resampling is a kernel estimate and is often rescaled to retain certain characteristics of the empirical distribution. Typically the effect of such smoothing has been measured in terms of the mean-squared error of bootstrap point estimates. The reports of these previous investigations have not been encouraging about the efficacy of smoothing. In this paper the effect of resampling a kernel-smoothed distribution is evaluated through expansions for the coverage of bootstrap percentile confidence intervals. It is shown that, under the smooth function model, proper bandwidth selection can accomplish a first-order correction for the one-sided percentile method. With the objective of reducing the coverage error the appropriate bandwidth for one-sided intervals converges at a rate of n −1/4, rather than the familiar n −1/5 for kernel density estimation. Applications of this same approach to bootstrap t and two-sided intervals yield optimal bandwidths of order n −1/2. These bandwidths depend on moments of the smooth function model and not on derivatives of the underlying density of the data. The relationship of this smoothing method to both the accelerated bias correction and the bootstrap t methods provides some insight into the connections between three quite distinct approximate confidence intervals.  相似文献   

Setting confidence bounds or intervals for a parameter in a restricted parameter space is an important issue in applications and is widely discussed in the recent literature. In this article, we focus on the distributions in the exponential families, and propose general forms of the truncated Pratt interval and rp interval for the means. We take the Poisson distribution as an example to illustrate the method and compare it with the other existing intervals. Besides possessing the merits from the theoretical inferences, the proposed intervals are also shown to be competitive approaches from simulation and real-data application studies.  相似文献   

It is well known that the Wilson procedure is superior to many existing procedures because it is less sensitive to p than any other procedures, therefore it is less costly. The procedures proposed in this article work as well as the Wilson procedure when 0.1 ≤p ≤ 0.9, and are even less sensitive (i.e., more robust) than the Wilson procedure when p is close to 0 or 1. Specifically, when the nominal coverage probability is 0.95, the Wilson procedure requires a sample size 1, 021 to guarantee that the coverage probabilities stay above 0.92 for any 0.001 ≤ min {p, 1 ?p} <0.01. By contrast, our procedures guarantee the same coverage probabilities but only need a sample size 177 without increasing either the expected interval width or the standard deviation of the interval width.  相似文献   

Suppose that we have a nonparametric regression model Y = m(X) + ε with XRp, where X is a random design variable and is observed completely, and Y is the response variable and some Y-values are missing at random. Based on the “complete” data sets for Y after nonaprametric regression imputation and inverse probability weighted imputation, two estimators of the regression function m(x0) for fixed x0Rp are proposed. Asymptotic normality of two estimators is established, which is used to construct normal approximation-based confidence intervals for m(x0). We also construct an empirical likelihood (EL) statistic for m(x0) with limiting distribution of χ21, which is used to construct an EL confidence interval for m(x0).  相似文献   

