首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Abstract.  Wang & Wells [ J. Amer. Statist. Assoc. 95 (2000) 62] describe a non-parametric approach for checking whether the dependence structure of a random sample of censored bivariate data is appropriately modelled by a given family of Archimedean copulas. Their procedure is based on a truncated version of the Kendall process introduced by Genest & Rivest [ J. Amer. Statist. Assoc. 88 (1993) 1034] and later studied by Barbe et al . [ J. Multivariate Anal. 58 (1996) 197]. Although Wang & Wells (2000) determine the asymptotic behaviour of their truncated process, their model selection method is based exclusively on the observed value of its L 2-norm. This paper shows how to compute asymptotic p -values for various goodness-of-fit test statistics based on a non-truncated version of Kendall's process. Conditions for weak convergence are met in the most common copula models, whether Archimedean or not. The empirical behaviour of the proposed goodness-of-fit tests is studied by simulation, and power comparisons are made with a test proposed by Shih [ Biometrika 85 (1998) 189] for the gamma frailty family.  相似文献   

2.
Although the asymptotic distributions of the likelihood ratio for testing hypotheses of null variance components in linear mixed models derived by Stram and Lee [1994. Variance components testing in longitudinal mixed effects model. Biometrics 50, 1171–1177] are valid, their proof is based on the work of Self and Liang [1987. Asymptotic properties of maximum likelihood estimators and likelihood tests under nonstandard conditions. J. Amer. Statist. Assoc. 82, 605–610] which requires identically distributed random variables, an assumption not always valid in longitudinal data problems. We use the less restrictive results of Vu and Zhou [1997. Generalization of likelihood ratio tests under nonstandard conditions. Ann. Statist. 25, 897–916] to prove that the proposed mixture of chi-squared distributions is the actual asymptotic distribution of such likelihood ratios used as test statistics for null variance components in models with one or two random effects. We also consider a limited simulation study to evaluate the appropriateness of the asymptotic distribution of such likelihood ratios in moderately sized samples.  相似文献   

3.
Biao Zhang 《Statistics》2016,50(5):1173-1194
Missing covariate data occurs often in regression analysis. We study methods for estimating the regression coefficients in an assumed conditional mean function when some covariates are completely observed but other covariates are missing for some subjects. We adopt the semiparametric perspective of Robins et al. [Estimation of regression coefficients when some regressors are not always observed. J Amer Statist Assoc. 1994;89:846–866] on regression analyses with missing covariates, in which they pioneered the use of two working models, the working propensity score model and the working conditional score model. A recent approach to missing covariate data analysis is the empirical likelihood method of Qin et al. [Empirical likelihood in missing data problems. J Amer Statist Assoc. 2009;104:1492–1503], which effectively combines unbiased estimating equations. In this paper, we consider an alternative likelihood approach based on the full likelihood of the observed data. This full likelihood-based method enables us to generate estimators for the vector of the regression coefficients that are (a) asymptotically equivalent to those of Qin et al. [Empirical likelihood in missing data problems. J Amer Statist Assoc. 2009;104:1492–1503] when the working propensity score model is correctly specified, and (b) doubly robust, like the augmented inverse probability weighting (AIPW) estimators of Robins et al. [Estimation of regression coefficients when some regressors are not always observed. J Am Statist Assoc. 1994;89:846–866]. Thus, the proposed full likelihood-based estimators improve on the efficiency of the AIPW estimators when the working propensity score model is correct but the working conditional score model is possibly incorrect, and also improve on the empirical likelihood estimators of Qin, Zhang and Leung [Empirical likelihood in missing data problems. J Amer Statist Assoc. 2009;104:1492–1503] when the reverse is true, that is, the working conditional score model is correct but the working propensity score model is possibly incorrect. In addition, we consider a regression method for estimation of the regression coefficients when the working conditional score model is correctly specified; the asymptotic variance of the resulting estimator is no greater than the semiparametric variance bound characterized by the theory of Robins et al. [Estimation of regression coefficients when some regressors are not always observed. J Amer Statist Assoc. 1994;89:846–866]. Finally, we compare the finite-sample performance of various estimators in a simulation study.  相似文献   

4.
Sarjinder Singh 《Statistics》2013,47(3):566-574
In this note, a dual problem to the calibration of design weights of the Deville and Särndal [Calibration estimators in survey sampling, J. Amer. Statist. Assoc. 87 (1992), pp. 376–382] method has been considered. We conclude that the chi-squared distance between the design weights and the calibrated weights equals the square of the standardized Z-score obtained by the difference between the known population total of the auxiliary variable and its corresponding Horvitz and Thompson [A generalization of sampling without replacement from a finite universe, J. Amer. Statist. Assoc. 47 (1952), pp. 663–685] estimator divided by the sample standard deviation of the auxiliary variable to obtain the linear regression estimator in survey sampling.  相似文献   

5.
In this paper we propose test statistics for a general hypothesis concerning the adequacy of multivariate random-effects covariance structures in a multivariate growth curve model with differing numbers of random effects (Lange, N., N.M. Laird, J. Amer. Statist. Assoc. 84 (1989) 241–247). Since the exact likelihood ratio (LR) statistic for the hypothesis is complicated, it is suggested to use a modified LR statistic. An asymptotic expansion of the null distribution of the statistic is obtained. The exact LR statistic is also discussed.  相似文献   

6.
Bayesian Semiparametric Regression for Median Residual Life   总被引:3,自引:0,他引:3  
Abstract.  With survival data there is often interest not only in the survival time distribution but also in the residual survival time distribution. In fact, regression models to explain residual survival time might be desired. Building upon recent work of Kottas & Gelfand [ J. Amer. Statist. Assoc. 96 (2001) 1458], we formulate a semiparametric median residual life regression model induced by a semiparametric accelerated failure time regression model. We utilize a Bayesian approach which allows full and exact inference. Classical work essentially ignores covariates and is either based upon parametric assumptions or is limited to asymptotic inference in non-parametric settings. No regression modelling of median residual life appears to exist. The Bayesian modelling is developed through Dirichlet process mixing. The models are fitted using Gibbs sampling. Residual life inference is implemented extending the approach of Gelfand & Kottas [ J. Comput. Graph. Statist. 11 (2002) 289]. Finally, we present a fairly detailed analysis of a set of survival times with moderate censoring for patients with small cell lung cancer.  相似文献   

7.
This note discusses an extension to the score test statistics for overdispersion in Poisson and binomial regression models [Dean, C.B., 1992. Testing for overdispersion in Poisson and binomial regression models. J. Amer. Statist. Assoc. 87, 451–457]. Examples illustrate the application of the extended results.  相似文献   

8.
The Pareto distribution is found in a large number of real world situations and is also a well-known model for extreme events. In the spirit of Neyman [1937. Smooth tests for goodness of fit. Skand. Aktuarietidskr. 20, 149–199] and Thomas and Pierce [1979. Neyman's smooth goodness-of-fit test when the hypothesis is composite. J. Amer. Statist. Assoc. 74, 441–445], we propose a smooth goodness of fit test for the Pareto distribution family which is motivated by LeCam's theory of local asymptotic normality (LAN). We establish the behavior of the associated test statistic firstly under the null hypothesis that the sample follows a Pareto distribution and secondly under local alternatives using the LAN framework. Finally, simulations are provided in order to study the finite sample behavior of the test statistic.  相似文献   

9.
In this paper, we describe an overall strategy for robust estimation of multivariate location and shape, and the consequent identification of outliers and leverage points. Parts of this strategy have been described in a series of previous papers (Rocke, Ann. Statist., in press; Rocke and Woodruff, Statist. Neerlandica 47 (1993), 27–42, J. Amer. Statist. Assoc., in press; Woodruff and Rocke, J. Comput. Graphical Statist. 2 (1993), 69–95; J. Amer. Statist. Assoc. 89 (1994), 888–896) but the overall structure is presented here for the first time. After describing the first-level architecture of a class of algorithms for this problem, we review available information about possible tactics for each major step in the process. The major steps that we have found to be necessary are as follows: (1) partition the data into groups of perhaps five times the dimension; (2) for each group, search for the best available solution to a combinatorial estimator such as the Minimum Covariance Determinant (MCD) — these are the preliminary estimates; (3) for each preliminary estimate, iterate to the solution of a smooth estimator chosen for robustness and outlier resistance; and (4) choose among the final iterates based on a robust criterion, such as minimum volume. Use of this algorithm architecture can enable reliable, fast, robust estimation of heavily contaminated multivariate data in high (> 20) dimension even with large quantities of data. A computer program implementing the algorithm is available from the authors.  相似文献   

10.
Khuri (Technometrics 27 (1985) 213) and Levy and Neill (Comm. Statist. A 19 (1990) 1987) presented regression lack of fit tests for multiresponse data with replicated observations available at points in the experimental region, thereby extending the classical univariate lack of fit test given by Fisher (J. Roy. Statist. Soc. 85 (1922) 597). In this paper, multivariate tests for lack of fit in a linear multiresponse model are derived for the common circumstance in which replicated observations are not obtained. The tests are based on the union–intersection principle, and provide multiresponse extensions of the univariate tests for between- and within-cluster lack of fit introduced by Christensen (Ann. of Statist. 17 (1989) 673; J. Amer. Statist. Assoc. 86 (1991) 752). Since the properties of these tests depend on the choice of multivariate clusters of the observations, a multiresponse generalization of the maximin power clustering criterion given by Miller, Neill and Sherfey (Ann. of Statist. 26 (1998) 1411; J. Amer. Statist. Assoc. 94 (1999) 610) is also developed.  相似文献   

11.
We consider a certain (simplified) version of the density bounded class; see Lavine (J. Amer. Statist. Assoc. 86 (1991) 400–403). We find that this class has certain advantages over many other similar classes in terms of elicitation, ease of calculation and other characteristics of interest. Bounds on posterior expectations are considered. In particular, we show how certain bounds may be found in the (multidimensional) normal linear model problem.  相似文献   

12.
This work considers probability models for partitions of a set of n elements using a predictive approach, i.e., models that are specified in terms of the conditional probability of either joining an already existing cluster or forming a new one. The inherent structure can be motivated by resorting to hierarchical models of either parametric or nonparametric nature. Parametric examples include the product partition models (PPMs) and the model-based approach of Dasgupta and Raftery (J. Amer. Statist. Assoc. 93 (1998) 294), while nonparametric alternatives include the Dirichlet process, and more generally, the species sampling models (SSMs). Under exchangeability, PPMs and SSMs induce the same type of partition structure. The methods are discussed in the context of outlier detection in normal linear regression models and of (univariate) density estimation.  相似文献   

13.
Semiparametric transformation models provide flexible regression models for survival analysis, including the Cox proportional hazards and the proportional odds models as special cases. We consider the application of semiparametric transformation models in case-cohort studies, where the covariate data are observed only on cases and on a subcohort randomly sampled from the full cohort. We first propose an approximate profile likelihood approach with full-cohort data, which amounts to the pseudo-partial likelihood approach of Zucker [2005. A pseudo-partial likelihood method for semiparametric survival regression with covariate errors. J. Amer. Statist. Assoc. 100, 1264–1277]. Simulation results show that our proposal is almost as efficient as the nonparametric maximum likelihood estimator. We then extend this approach to the case-cohort design, applying the Horvitz–Thompson weighting method to the estimating equations from the approximated profile likelihood. Two levels of weights can be utilized to achieve unbiasedness and to gain efficiency. The resulting estimator has a closed-form asymptotic covariance matrix, and is found in simulations to be substantially more efficient than the estimator based on martingale estimating equations. The extension to left-truncated data will be discussed. We illustrate the proposed method on data from a cardiovascular risk factor study conducted in Taiwan.  相似文献   

14.
The local polynomial quasi-likelihood estimation has several good statistical properties such as high minimax efficiency and adaptation of edge effects. In this paper, we construct a local quasi-likelihood regression estimator for a left truncated model, and establish the asymptotic normality of the proposed estimator when the observations form a stationary and α-mixing sequence, such that the corresponding result of Fan et al. [Local polynomial kernel regression for generalized linear models and quasilikelihood functions, J. Amer. Statist. Assoc. 90 (1995), pp. 141–150] is extended from the independent and complete data to the dependent and truncated one. Finite sample behaviour of the estimator is investigated via simulations too.  相似文献   

15.
This paper considers the problem of testing equality between two independent binomial proportions. Hwang and Yang (Statist. Sinica 11 (2001) 807) apply the Neyman–Pearson fundamental lemma and the estimated truth approach to derive optimal procedures, named expected p-values. This p-value has been shown to be identical to the mid p-value in Lancaster (J. Amer. Statist. Assoc. (1961) 223) for the one-sided test. For the two-sided test, the paper proves the usual two-sided mid p-value is identical to the expected p-value in the balanced sample case.  相似文献   

16.
A stratified Warner''s randomized response model   总被引:2,自引:0,他引:2  
This paper proposes a new stratified randomized response model based on Warner's (J. Amer. Statist. Assoc. 60 (1965) 63) model that has an optimal allocation and large gain in precision. It also presents a drawback of the Hong et al. (Korean J. Appl. Statist. 7 (1994) 141) model under their proportional sampling assumption. It is shown that the proposed model is more efficient than the Hong et al. (Korean J. Appl. Statist. 7 (1994) 141) stratified randomized response model. Additionally, it is shown that the estimator based on the proposed method is more efficient than the Warner (J. Amer. Statist. Assoc. 60 (1965) 63), the Mangat and Singh (Biometrika 77 (1990) 439) and the Mangat (J. Roy. Statist. SQC. Ser. B 56 (1) (1994) 93) estimators under the conditions presented in both the case of completely truthful reporting and that of not completely truthful reporting by the respondents.  相似文献   

17.
Responses in a one-factor experiment with A; ordered treatments follow an umbrella, ordering if they consist of two piecewise monotone segments, i.e. increasing and then decreasing, or the converse. This paper proposes a non-parametric distribution-free confidence procedure for umbrella orderings, the aim being to identify the treatments that correspond to the optimal effects. It uses a method that joins the seemingly unrelated theories of U-statistics and isotonic regression. A random confidence subset of the ordered treatments is constructed, such that it contains all the unknown peaks (optimal treatments) of an umbrella ordering with any prespecified confidence level. The paper demonstrates that the proposed confidence procedure is nonparametric distribution-free and, further, that the proposed procedure naturally implies a test for umbrella alternatives. Since the proposed confidence procedure is always more informative than tests for umbrella alternatives, it should be used in their place in practice. An example illustrates the proposed procedure.  相似文献   

18.
Motivated by Chaudhuri's work [1996. On a geometric notion of quantiles for multivariate data. J. Amer. Statist. Assoc. 91, 862–872] on unconditional geometric quantiles, we explore the asymptotic properties of sample geometric conditional quantiles, defined through kernel functions, in high-dimensional spaces. We establish a Bahadur-type linear representation for the geometric conditional quantile estimator and obtain the convergence rate for the corresponding remainder term. From this, asymptotic normality including bias on the estimated geometric conditional quantile is derived. Based on these results, we propose confidence ellipsoids for multivariate conditional quantiles. The methodology is illustrated via data analysis and a Monte Carlo study.  相似文献   

19.
For measuring the accuracy of a continuous diagnostic test, the receiver operating characteristic (ROC) curve is often used. The empirical ROC curve is the most commonly used non-parametric estimator for the ROC curve. Recently, Lloyd (J. Amer. Statist. Assoc. 93(1998) 1356) proposed a kernel smoothing estimator for the ROC curve and showed his estimator has better mean square error than the empirical ROC curve estimator. However, Lloyd's estimator involves two bandwidths and has a boundary problem. In addition, his choice of bandwidths is ad hoc. In this paper we propose another kernel smoothing estimator which involves only one bandwidth and does not have the boundary problem. Furthermore, our choice of the bandwidth is asymptotically optimal.  相似文献   

20.
The generalized Kronecker sum was used by Wang and Wu (J. Amer. Statist. Assoc. 86 (1991) 450) and Dey and Midha (Statist. Probab. Lett. 28 (1996) 211; Proc. AP Akad. Sci. 5 (2001) 39) to construct mixed orthogonal arrays. We modify their methods to obtain several families of mixed orthogonal arrays. Some new arrays with run size less than 100 are found.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号