首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Investigators and epidemiologists often use statistics based on the parameters of a multinomial distribution. Two main approaches have been developed to assess the inferences of these statistics. The first one uses asymptotic formulae which are valid for large sample sizes. The second one computes the exact distribution, which performs quite well for small samples. They present some limitations for sample sizes N neither large enough to satisfy the assumption of asymptotic normality nor small enough to allow us to generate the exact distribution. We analytically computed the 1/N corrections of the asymptotic distribution for any statistics based on a multinomial law. We applied these results to the kappa statistic in 2×2 and 3×3 tables. We also compared the coverage probability obtained with the asymptotic and the corrected distributions under various hypothetical configurations of sample size and theoretical proportions. With this method, the estimate of the mean and the variance were highly improved as well as the 2.5 and the 97.5 percentiles of the distribution, allowing us to go down to sample sizes around 20, for data sets not too asymmetrical. The order of the difference between the exact and the corrected values was 1/N2 for the mean and 1/N3 for the variance.  相似文献   

2.
A two sample test of likelihood ratio type is proposed, assuming normal distribution theory, for testing the hypothesis that two samples come from identical normal populations versus the alternative that the populations are normal but vary in mean value and variance with one population having a smaller mean and smaller variance than the other. The small sample and large sample distribution of the proposed statistic are derived assuming normality. Some computations are presented which show the speed of convergence of small sample critical values to their asymptotic counterparts. Comparisons of local power of the proposed test are made with several potential competing tests. Asymptotics for the test statistic are derived when underlying distributions are not necessarily normal.  相似文献   

3.
The generalized regression (greg) predictor for the finite population total of a real variable is often employed when values of an auxiliary variable are available. Several variance estimators for it do well in large samples though bearing no optimality properties. We find a variance estimator which, under a restrictive model, has an optimality property under ‘exact’ as well as ‘asymptotic’ analysis. But this involves model parameters. Under a further restriction on the model, two model-parameter-free variance estimators are derived sharing the same ‘asymptotic’ optimality. Numerical illustrations through simulation are presented to demonstrate marginal improvements in using them rather than their predecessors. Two of the latter, though not optimal, are simpler, intuitively appealing, compete well in large samples, generally applicable and should be persisted with in practice.  相似文献   

4.
We derive the ?1-limit of trimmed sums of order statistics from location-scale distributions satisfying certain assumptions. Based on this limit, an approximation to the asymptotic variance of a Best-Asymptotic-Normal (BAN) estimator for the location parameter is developed. Associated formulae are derived for four location-scale distributions commonly used in lifetime data analysis. The approximation is analyzed via the properties of the approximating function and by comparison to the exact values for a special case. Applications are illustrated by applying the approximation to comparing location parameters and to selecting the population with the largest location parameter, using censored samples from location-scale populations.  相似文献   

5.
We study robustness properties of two types of M-estimators of scale when both location and scale parameters are unknown: (i) the scale estimator arising from simultaneous M-estimation of location and scale; and (ii) its symmetrization about the sample median. The robustness criteria considered are maximal asymptotic bias and maximal asymptotic variance when the known symmetric unimodal error distribution is subject to unknown, possibly asymmetric, £-con-tamination. Influence functions and asymptotic variance functionals are derived, and computations of asymptotic biases and variances, under the normal distribution with ε-contamination at oo, are presented for the special subclass arising from Huber's Proposal 2 and its symmetrized version. Symmetrization is seen to reduce both asymptotic bias and variance. Some complementary theoretical results are obtained, and the tradeoff between asymptotic bias and variance is discussed.  相似文献   

6.
Formulae are given for the Fisher information loss on parameters for the mean and the variance when some values of a Gaussian process are not observed. The special case of a one-parameter first-order conditional process on a rectangular lattice is considered in detail, and formulae are compared with numerical results.  相似文献   

7.
In an epidemiological study the regression slope between a response and predictor variable is underestimated when the predictor variable is measured imprecisely. Repeat measurements of the predictor in individuals in a subset of the study or in a separate study can be used to estimate a multiplicative factor to correct for this 'regression dilution bias'. In applied statistics publications various methods have been used to estimate this correction factor. Here we compare six different estimation methods and explain how they fall into two categories, namely regression and correlation-based methods. We provide new asymptotic variance formulae for the optimal correction factors in each category, when these are estimated from the repeat measurements subset alone, and show analytically and by simulation that the correlation method of choice gives uniformly lower variance. The simulations also show that, when the correction factor is not much greater than 1, this correlation method gives a correction factor which is closer to the true value than that from the best regression method on up to 80% of occasions. We also provide a variance formula for a modified correlation method which uses the standard deviation of the predictor variable in the main study; this shows further improved performance provided that the correction factor is not too extreme. A confidence interval for a corrected regression slope in an epidemiological study should reflect the imprecision of both the uncorrected slope and the estimated correction factor. We provide formulae for this and show that, particularly when the correction factor is large and the size of the subset of repeat measures is small, the effect of allowing for imprecision in the estimated correction factor can be substantial.  相似文献   

8.
This paper addresses the problem of obtaining maximum likelihood estimates for the parameters of the Pearson Type I distribution (beta distribution with unknown end points and shape parameters). Since they do not seem to have appeared in the literature, the likelihood equations and the information matrix are derived. The regularity conditions which ensure asymptotic normality and efficiency are examined, and some apparent conflicts in the literature are noted. To ensure regularity, the shape parameters must be greater than two, giving an (assymmetrical) bell-shaped distribution with high contact in the tails. A numerical investigation was carried out to explore the bias and variance of the maximum likelihood estimates and their dependence on sample size. The numerical study indicated that only for large samples (n ≥ 1000) does the bias in the estimates become small and does the Cramér-Rao bound give a good approximation for their variance. The likelihood function has a global maximum which corresponds to parameter estimates that are inadmissable. Useful parameter estimates can be obtained at a local maximum, which is sometimes difficult to locate when the sample size is small.  相似文献   

9.
The authors propose a quasi‐likelihood approach analogous to two‐way analysis of variance for the estimation of the parameters of generalized linear mixed models with two components of dispersion. They discuss both the asymptotic and small‐sample behaviour of their estimators, and illustrate their use with salamander mating data.  相似文献   

10.
This paper presents a step-stress accelerated life test for two stress variables to obtain optimal hold times under a Type-I hybrid censoring scheme. An exponentially distributed life and a cumulative exposure model are assumed. The maximum-likelihood estimates are given, from which the asymptotic variance and the Fisher information matrix are obtained. The optimal test plan is determined for each combination of stress levels by minimizing the asymptotic variance of reliability estimate at a typical operating condition. Finally, simulation results are discussed to illustrate the proposed criteria. Simulation results show that the proposed optimum plan is robust, and the initial estimates have a small effect on optimal values.  相似文献   

11.
We develop and show applications of two new test statistics for deciding if one ARIMA model provides significantly better h-step-ahead forecasts than another, as measured by the difference of approximations to their asymptotic mean square forecast errors. The two statistics differ in the variance estimates used for normalization. Both variance estimates are consistent even when the models considered are incorrect. Our main variance estimate is further distinguished by accounting for parameter estimation, while the simpler variance estimate treats parameters as fixed. Their broad consistency properties offer improvements to what are known as tests of Diebold and Mariano (1995) type, which are tests that treat parameters as fixed and use variance estimates that are generally not consistent in our context. We show how these statistics can be calculated for any pair of ARIMA models with the same differencing operator.  相似文献   

12.
We consider two estimation schemes based on penalized quasilikelihood and quasi-pseudo-likelihood in Poisson mixed models. The asymptotic bias in regression coefficients and variance components estimated by penalized quasilikelihood (PQL) is studied for small values of the variance components. We show the PQL estimators of both regression coefficients and variance components in Poisson mixed models have a smaller order of bias compared to those for binomial data. Unbiased estimating equations based on quasi-pseudo-likelihood are proposed and are shown to yield consistent estimators under some regularity conditions. The finite sample performance of these two methods is compared through a simulation study.  相似文献   

13.
This paper deals with the estimation of "the fraction of variance expiained" in one-way classification. A comparative study of two estimators for model II (random effects) is made by computing approximately their biases and mean-square errors in the balanced case. A similar study is made for model I (fixed effects) where we study one estimator and give asymptotic formulae for its bias and mean-square error.  相似文献   

14.
We show that the jackknife technique fails badly when applied to the problem of estimating the variance of a sample quantile. When viewed as a point estimator, the jackknife estimator is known to be inconsistent. We show that the ratio of the jackknife variance estimate to the true variance has an asymptotic Weibull distribution with parameters 1 and 1/2. We also show that if the jackknife variance estimate is used to Studentize the sample quantile, the asymptotic distribution of the resulting Studentized statistic is markedly nonnormal, having infinite mean. This result is in stark contrast with that obtained in simpler problems, such as that of constructing confidence intervals for a mean, where the jackknife-Studentized statistic has an asymptotic standard normal distribution.  相似文献   

15.
In a polynomial regression with measurement errors in the covariate, the latter being supposed to be normally distributed, one has (at least) three ways to estimate the unknown regression parameters: one can apply ordinary least squares (OLS) to the model without regard to the measurement error or one can correct for the measurement error, either by correcting the estimating equation (ALS) or by correcting the mean and variance functions of the dependent variable, which is done by conditioning on the observable, error ridden, counter part of the covariate (SLS). While OLS is biased, the other two estimators are consistent. Their asymptotic covariance matrices and thus their relative efficiencies can be compared to each other, in particular for the case of a small measurement error variance. In this case, it appears that ALS and SLS become almost equally efficient, even when they differ noticeably from OLS.  相似文献   

16.
A disease prevalence can be estimated by classifying subjects according to whether they have the disease. When gold-standard tests are too expensive to be applied to all subjects, partially validated data can be obtained by double-sampling in which all individuals are classified by a fallible classifier, and some of individuals are validated by the gold-standard classifier. However, it could happen in practice that such infallible classifier does not available. In this article, we consider two models in which both classifiers are fallible and propose four asymptotic test procedures for comparing disease prevalence in two groups. Corresponding sample size formulae and validated ratio given the total sample sizes are also derived and evaluated. Simulation results show that (i) Score test performs well and the corresponding sample size formula is also accurate in terms of the empirical power and size in two models; (ii) the Wald test based on the variance estimator with parameters estimated under the null hypothesis outperforms the others even under small sample sizes in Model II, and the sample size estimated by this test is also accurate; (iii) the estimated validated ratios based on all tests are accurate. The malarial data are used to illustrate the proposed methodologies.  相似文献   

17.
Liang and Zeger (1986) introduced a class of estimating equations that gives consistent estimates of regression parameters and of their asymptotic variances in the class of generalized linear models for cluster correlated data. When the independent variables or covariates in such models are subject to measurement errors, the parameter estimates obtained from these estimating equations are no longer consistent. To correct for the effect of measurement errors, an estimator with smaller asymptotic bias is constructed along the lines of Stefanski (1985), assuming that the measurement error variance is either known or estimable. The asymptotic distribution of the bias-corrected estimator and a consistent estimator of its asymptotic variance are also given. The special case of a binary logistic regression model is studied in detail. For this case, methods based on conditional scores and quasilikelihood are also extended to cluster correlated data. Results of a small simulation study on the performance of the proposed estimators and associated tests of hypotheses are reported.  相似文献   

18.
Inequality-constrained regression models have received increased attention in longitudinal analysis during recent years. Regression parameters are usually obtained from iteration algorithms. An analytical formulae of the estimators cannot be provided. Therefore, the asymptotic behavior of estimators has not been fully clarified yet. This paper presents a TS estimation (TS for short) and the asymptotic distribution of the estimators. Simulations are conducted to compare constrained TS estimation, constrained ordinary least squares (OLS) estimation and TS estimation in terms of sample bias, sample mean-square error (MSE) and sample variance of the estimators.  相似文献   

19.
Estimation in Semiparametric Marginal Shared Gamma Frailty Models   总被引:1,自引:0,他引:1  
The semiparametric marginal shared frailty models in survival analysis have the non–parametric hazard functions multiplied by a random frailty in each cluster, and the survival times conditional on frailties are assumed to be independent. In addition, the marginal hazard functions have the same form as in the usual Cox proportional hazard models. In this paper, an approach based on maximum likelihood and expectation–maximization is applied to semiparametric marginal shared gamma frailty models, where the frailties are assumed to be gamma distributed with mean 1 and variance θ. The estimates of the fixed–effect parameters and their standard errors obtained using this approach are compared in terms of both bias and efficiency with those obtained using the extended marginal approach. Similarly, the standard errors of our frailty variance estimates are found to compare favourably with those obtained using other methods. The asymptotic distribution of the frailty variance estimates is shown to be a 50–50 mixture of a point mass at zero and a truncated normal random variable on the positive axis for θ0 = 0. Simulations demonstrate that, for θ0 < 0, it is approximately an x −(100 − x )%, 0 ≤ x ≤ 50, mixture between a point mass at zero and a truncated normal random variable on the positive axis for small samples and small values of θ0; otherwise, it is approximately normal.  相似文献   

20.
This article considers a class of estimators for the location and scale parameters in the location-scale model based on ‘synthetic data’ when the observations are randomly censored on the right. The asymptotic normality of the estimators is established using counting process and martingale techniques when the censoring distribution is known and unknown, respectively. In the case when the censoring distribution is known, we show that the asymptotic variances of this class of estimators depend on the data transformation and have a lower bound which is not achievable by this class of estimators. However, in the case that the censoring distribution is unknown and estimated by the Kaplan–Meier estimator, this class of estimators has the same asymptotic variance and attains the lower bound for variance for the case of known censoring distribution. This is different from censored regression analysis, where asymptotic variances depend on the data transformation. Our method has three valuable advantages over the method of maximum likelihood estimation. First, our estimators are available in a closed form and do not require an iterative algorithm. Second, simulation studies show that our estimators being moment-based are comparable to maximum likelihood estimators and outperform them when sample size is small and censoring rate is high. Third, our estimators are more robust to model misspecification than maximum likelihood estimators. Therefore, our method can serve as a competitive alternative to the method of maximum likelihood in estimation for location-scale models with censored data. A numerical example is presented to illustrate the proposed method.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号