首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In this article, we consider the problems of testing the goodness of fit of the parametric accelerated failure time model and the Cox proportional hazards model. We consider omnibus test statistics based on residuals. The statistical distributions of Kolmogorov, Cramer-von Mises–Smirnov, and Anderson–Darling statistics are all investigated by means of Monte Carlo simulations. Type-I, Type-II, and independent random censoring situations are all considered in this study. A Monte Carlo power study has also been carried out for these tests to distinguish between various baseline models, which reveals that the Anderson–Darling test performs better than the others.  相似文献   

2.
The issue of estimating usual nutrient intake distributions and prevalence of inadequate nutrient intakes is of interest in nutrition studies. Box–Cox transformations coupled with the normal distribution are usually employed for modeling nutrient intake data. When the data present highly asymmetric distribution or include outliers, this approach may lead to implausible estimates. Additionally, it does not allow interpretation of the parameters in terms of characteristics of the original data and requires back transformation of the transformed data to the original scale. This paper proposes an alternative approach for estimating usual nutrient intake distributions and prevalence of inadequate nutrient intakes through a Box–Cox t model with random intercept. The proposed model is flexible enough for modeling highly asymmetric data even when outliers are present. Unlike the usual approach, the proposed model does not require a transformation of the data. A simulation study suggests that the Box–Cox t model with random intercept estimates the usual intake distribution satisfactorily, and that it should be preferable to the usual approach particularly in cases of highly asymmetric heavy-tailed data. In applications to data sets on intake of 19 micronutrients, the Box–Cox t models provided better fit than its competitors in most of the cases.  相似文献   

3.
This article presents methods for testing covariate effect in the Cox proportional hazards model based on Kullback–Leibler divergence and Renyi's information measure. Renyi's measure is referred to as the information divergence of order γ (γ ≠ 1) between two distributions. In the limiting case γ → 1, Renyi's measure becomes Kullback–Leibler divergence. In our case, the distributions correspond to the baseline and one possibly due to a covariate effect. Our proposed statistics are simple transformations of the parameter vector in the Cox proportional hazards model, and are compared with the Wald, likelihood ratio and score tests that are widely used in practice. Finally, the methods are illustrated using two real-life data sets.  相似文献   

4.
The authors consider the problem of simultaneous transformation and variable selection for linear regression. They propose a fully Bayesian solution to the problem, which allows averaging over all models considered including transformations of the response and predictors. The authors use the Box‐Cox family of transformations to transform the response and each predictor. To deal with the change of scale induced by the transformations, the authors propose to focus on new quantities rather than the estimated regression coefficients. These quantities, referred to as generalized regression coefficients, have a similar interpretation to the usual regression coefficients on the original scale of the data, but do not depend on the transformations. This allows probabilistic statements about the size of the effect associated with each variable, on the original scale of the data. In addition to variable and transformation selection, there is also uncertainty involved in the identification of outliers in regression. Thus, the authors also propose a more robust model to account for such outliers based on a t‐distribution with unknown degrees of freedom. Parameter estimation is carried out using an efficient Markov chain Monte Carlo algorithm, which permits moves around the space of all possible models. Using three real data sets and a simulated study, the authors show that there is considerable uncertainty about variable selection, choice of transformation, and outlier identification, and that there is advantage in dealing with all three simultaneously. The Canadian Journal of Statistics 37: 361–380; 2009 © 2009 Statistical Society of Canada  相似文献   

5.
ABSTRACT

Calculating the expected values of different types of random variables is a central topic in mathematical statistics. Targeted toward students and instructors in both introductory probability and statistics courses and graduate-level measure-theoretic probability courses, this pedagogical note casts light on a general expectation formula stated in terms of distribution and survival functions of random variables and discusses its educational merits. Often consigned to an end-of-chapter exercise in mathematical statistics textbooks with minimal discussion and presented under superfluous technical assumptions, this unconventional expectation formula provides an invaluable opportunity for students to appreciate the geometric meaning of expectations, which is overlooked in most undergraduate and graduate curricula, and serves as an efficient tool for the calculation of expected values that could be much more laborious by traditional means. For students’ benefit, this formula deserves a thorough in-class treatment in conjunction with the teaching of expectations. Besides clarifying some commonly held misconceptions and showing the pedagogical value of the expectation formula, this note offers guidance for instructors on teaching the formula taking the background of the target student group into account.  相似文献   

6.
The pretest–posttest design is widely used to investigate the effect of an experimental treatment in biomedical research. The treatment effect may be assessed using analysis of variance (ANOVA) or analysis of covariance (ANCOVA). The normality assumption for parametric ANOVA and ANCOVA may be violated due to outliers and skewness of data. Nonparametric methods, robust statistics, and data transformation may be used to address the nonnormality issue. However, there is no simultaneous comparison for the four statistical approaches in terms of empirical type I error probability and statistical power. We studied 13 ANOVA and ANCOVA models based on parametric approach, rank and normal score-based nonparametric approach, Huber M-estimation, and Box–Cox transformation using normal data with and without outliers and lognormal data. We found that ANCOVA models preserve the nominal significance level better and are more powerful than their ANOVA counterparts when the dependent variable and covariate are correlated. Huber M-estimation is the most liberal method. Nonparametric ANCOVA, especially ANCOVA based on normal score transformation, preserves the nominal significance level, has good statistical power, and is robust for data distribution.  相似文献   

7.
Abstract. The Yule–Simpson paradox notes that an association between random variables can be reversed when averaged over a background variable. Cox and Wermuth introduced the concept of distribution dependence between two random variables X and Y, and gave two dependence conditions, each of which guarantees that reversal of qualitatively similar conditional dependences cannot occur after marginalizing over the background variable. Ma, Xie and Geng studied the uniform collapsibility of distribution dependence over a background variable W, under stronger homogeneity condition. Collapsibility ensures that associations are the same for conditional and marginal models. In this article, we use the notion of average collapsibility, which requires only the conditional effects average over the background variable to the corresponding marginal effect and investigate its conditions for distribution dependence and for quantile regression coefficients.  相似文献   

8.
In this article scan statistics for detecting a local change in variance for two-dimensional normal data are discussed. When the precise size of the rectangular window, where a local change in variance has occurred, is unknown, multiple and variable window scan statistics are proposed. A simulation study is presented to evaluate the performance of the scan statistics investigated in this article via comparison of power. A method for estimating the rectangular region, where a change in variance has occurred, and the size of the change in variance is also discussed.  相似文献   

9.
This paper concerns maximum likelihood estimation for the semiparametric shared gamma frailty model; that is the Cox proportional hazards model with the hazard function multiplied by a gamma random variable with mean 1 and variance θ. A hybrid ML-EM algorithm is applied to 26 400 simulated samples of 400 to 8000 observations with Weibull hazards. The hybrid algorithm is much faster than the standard EM algorithm, faster than standard direct maximum likelihood (ML, Newton Raphson) for large samples, and gives almost identical results to the penalised likelihood method in S-PLUS 2000. When the true value θ0 of θ is zero, the estimates of θ are asymptotically distributed as a 50–50 mixture between a point mass at zero and a normal random variable on the positive axis. When θ0 > 0, the asymptotic distribution is normal. However, for small samples, simulations suggest that the estimates of θ are approximately distributed as an x ? (100 ? x)% mixture, 0 ≤ x ≤ 50, between a point mass at zero and a normal random variable on the positive axis even for θ0 > 0. In light of this, p-values and confidence intervals need to be adjusted accordingly. We indicate an approximate method for carrying out the adjustment.  相似文献   

10.
The need to establish the independence of the sample mean and the sample variance in sampling from a normal population arises early in a course in statistics. For the result is an essential ingredient in the derivation of the Student-t distribution for statistical inference. Often this need arises before the tools, notably multivariate methods, for a rigorous proof are available. Occasionally one will find attempts to derive this result using only bivariate assumptions. A recent article in this journal, as well as some current textbooks, offer such a proof. In all cases there are serious questions about the validity of the proofs.  相似文献   

11.
In this paper we consider properties of the logarithmic and Tukey's lambda-type transformations of random variables that follow beta or unit-gamma distributions. Beta distributions often arise as models for random proportions, and unit-gamma distributions, although not well- known, may serve the same purpose. The latter possess many properties similar to those of beta distributions. Some transformations of random variables that follow a beta distribution are considered by Johnson (1949) and Johnson and Kotz (1970,1973). These are used to obtain a -new"random variable that potentially approximately follows a normal distribution, so that practical analyses become possible. We study normality -related properties of the above transformations. This is done for the first time for unit-gamma distributions. Under the logarithmic transformation the beta and unit-gamma distributions become, respectively, the logarithmic F and generalized logistic distributions. The distributions of the transformed beta and unit-gamma distributions after application of Tukey's lambda-type transformations cannot be derived easily; however, we obtain the first four moments and expressions for the skewness and kudos is of the transformed variables. Values of skewness and kurtosis for a variety of different parameter values are calculated, and in consequence, the near (or not near) normality of the transformed variables is evaluated. Comments on the use of the various transformations are provided..  相似文献   

12.
In this article, we discuss some properties of Renyi entropy and Renyi information of order statistics. Some bounds for Renyi entropy of order statistics are obtained. Also, we relate Renyi entropy ordering of order statistics to Renyi entropy ordering and other well known orderings of parent random variables. Then it is proved that the Renyi information between order statistics and parent random variable is distribution free, and it is shown, as expected, the distance is minimum for the median.  相似文献   

13.
In two-phase linear regression models, it is a standard assumption that the random errors of two phases have constant variances. However, this assumption is not necessarily appropriate. This paper is devoted to the tests for variance heterogeneity in these models. We initially discuss the simultaneous test for variance heterogeneity of two phases. When the simultaneous test shows that significant heteroscedasticity occurs in the whole model, we construct two individual tests to investigate whether or not both phases or one of them have/has significant heteroscedasticity. Several score statistics and their adjustments based on Cox and Reid [D. R. Cox and N. Reid, Parameter orthogonality and approximate conditional inference. J. Roy. Statist. Soc. Ser. B 49 (1987), pp. 1–39] are obtained and illustrated with Australian onion data. The simulated powers of test statistics are investigated through Monte Carlo methods.  相似文献   

14.
Summary Heavy tail distributions can be generated by applying specific non-linear transformations to a Gaussian random variable. Within this work we introduce power kurtosis transformations which are essentially determined by their generator function. Examples are theH-transformation of Tukey (1960), theK-transformation of MacGillivray and Cannon (1997) and theJ-transformation of Fischer and Klein (2004).Furthermore, we derive a general condition on the generator function which guarantees that the corresponding transformation is actually tail-increasing. In this case the exponent of the power kurtosis transformation can be interpreted as a kurtosis parameter. We also prove that the transformed distributions can be ordered with respect to the partial ordering of van Zwet (1964) for symmetric distributions.  相似文献   

15.
中美两本统计学教材的对比及其启示   总被引:5,自引:0,他引:5  
龚凤乾 《统计研究》2008,25(2):101-108
内容提要:本文通过对两本中美统计学教材进行细致的分析与比较,探讨经济管理类学生统计学教材中描述统计部分及推断统计部分的合理安排,应用部分如何体现统计软件的使用以及外文教材中的独特之处如何有效的把握等等。在这些比较的基础上,提出统计学教材要适当增加篇幅等八条建议,这对我国统计学教材的改革与建设有较大的参考价值。  相似文献   

16.
Combining patient-level data from clinical trials can connect rare phenomena with clinical endpoints, but statistical techniques applied to a single trial may become problematical when trials are pooled. Estimating the hazard of a binary variable unevenly distributed across trials showcases a common pooled database issue. We studied how an unevenly distributed binary variable can compromise the integrity of fixed and random effects Cox proportional hazards (cph) models. We compared fixed effect and random effects cph models on a set of simulated datasets inspired by a 17-trial pooled database of patients presenting with ST segment elevation myocardial infarction (STEMI) and non-STEMI undergoing percutaneous coronary intervention. An unevenly distributed covariate can bias hazard ratio estimates, inflate standard errors, raise type I error, and reduce power. While uneveness causes problems for all cph models, random effects suffer least. Compared to fixed effect models, random effects suffer lower bias and trade inflated type I errors for improved power. Contrasting hazard rates between trials prevent accurate estimates from both fixed and random effects models.  相似文献   

17.
A modification of the critical values of Simes’ test is suggested in this article when the underlying test statistics are multivariate normal with a common non-negative correlation, yielding a more powerful test than the original Simes’ test. A step-up multiple testing procedure with these modified critical values, which is shown to control false discovery rate (FDR), is presented as a modification of the traditional Benjamini–Hochberg (BH) procedure. Simulations were carried out to compare this modified BH procedure with the BH and other modified BH procedures in terms of false non-discovery rate (FNR), 1–FDR–FNR and average power. The present modified BH procedure is observed to perform well compared to others when the test statistics are highly correlated and most of the hypotheses are true.  相似文献   

18.
Gnanadesikan 1977 illustrates the utility of the power transformations considered by Moore and Tukey (1954) Box and Cox (1964), and Andrews, Gnanadesikan, and Warner (1971). These transformations have been used to obtain and assess both the marginal and joint normality of the underlying distributions. This paper investigates the utility of this procedure in defining homoscedastic transformations in multivariate populations.  相似文献   

19.
Statistical distributions generated from any J- or U-shaped random variables are cumbersome to derive if not completely indefinable and thus are unavailable analytically because of the singularities at the tails of the basic random variable. This paper presents a computational method for providing a numerical convolution derived from a basic U-shaped random variable composed of a continuous part mixed with (or contaminated by) a discrete part at the tails. The J-shaped sampling distribution case is implied as a special case. Though the computations are based on a background Normal Distribution, it can be generalized on any other distribution.Such distributions will open up an area of sampling distributions of mixed random variables that are not elaborately covered in textbooks dealing with the theory of distributions.  相似文献   

20.
Linear mixed models were developed to handle clustered data and have been a topic of increasing interest in statistics for the past 50 years. Generally, the normality (or symmetry) of the random effects is a common assumption in linear mixed models but it may, sometimes, be unrealistic, obscuring important features of among-subjects variation. In this article, we utilize skew-normal/independent distributions as a tool for robust modeling of linear mixed models under a Bayesian paradigm. The skew-normal/independent distributions is an attractive class of asymmetric heavy-tailed distributions that includes the skew-normal distribution, skew-t, skew-slash and the skew-contaminated normal distributions as special cases, providing an appealing robust alternative to the routine use of symmetric distributions in this type of models. The methods developed are illustrated using a real data set from Framingham cholesterol study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号