首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Testing of hypotheses under balanced ANOVA models is fairly simple and generally based on the usual ANOVA sums of squares. Difficulties may arise in special cases when these sums of squares do not form a complete sufficient statistic. There is a huge literature on this subject which was recently surveyed in Seifert's contribution to the book of Mumak (1904). But there are only a few results about unbalanced models. In such models the consideration of likelihood ratios leads to more complex sums of squares known from MINQUE theory.

Uniform optimality of testsusually reduces to local optimality. Here we prespnt a small review of methods proposed for testing of hypotheses in unbalanced models. where MINQUEI playb a major role. We discuss the use of iterated MINQUE for the construction of asymptotically optimal tests described in Humak (1984) and approximate tests based on locally uncorrelated linear combinations of MINQUE estimators by Seifert (1985), We show that the latter tests coincide with robust locally optimal invariant tests proposeci by Kariya and Sinha and Das and Sinha, if the number of variance components is two. Explicit expressions for corresponding tests are given for the unbalanced two-way cross classification random model, which covers some other models as special cases. A simulation study under lines the relevance of MINQUE for testing of hypotheses problems.  相似文献   

2.
Lynn R. LaMotte 《Statistics》2018,52(1):228-238
The between-within split of total sum of squares in one-way analysis of variance (ANOVA) is intuitively appealing and computationally simple, whether balanced or not. In the balanced two-factor setting, the same heuristic and computations apply to analyse treatment sum of squares into main effects and interaction effects sums of squares. Accomplishing the same in unbalanced settings is more difficult, requiring development of tests of general linear hypotheses. However, textbooks treat unbalanced settings with proportional subclasss numbers (psn) as essentially equivalent to balanced settings. It is shown here that, while psn permit an ANOVA-like partition of sums of squares, test statistics for main effects of the two factors generally test the wrong hypotheses when the model includes interaction effects.  相似文献   

3.
Abstract

Type III methods were introduced by SAS to address difficulties in dummy-variable models for effects of multiple factors and covariates. They are widely used in practice; they are the default method in several statistical computing packages. Type III sums of squares (SSs) are defined by a set of instructions; an explicit mathematical formulation does not seem to exist.

An explicit formulation is derived in this paper. It is used to illustrate Type III SSs and to establish their properties in the two-factor ANOVA model.  相似文献   

4.
Methods for analysing unbalanced factorial designs can be traced back to the work of Frank Yates in the 1930s . Yet, still today the question on how his methods of fitting constants (Type II) and weighted squares of means (Type III) behave when negligible or insignificant interactions exist, is still unanswered. In this paper, by means of a simulation study, Type II and Type III ANOVA results are examined for all unbalanced structures originating from a 2x3 balanced factorial design within homogeneous groups (design types) accounting for structure, number of observations lost and which cells contained the missing observations. The two level factor is further analysed to test the null hypothesis, for both Type II and Type III analyses, that the unbalanced structures within each design type provide comparable F values. These results are summarised and the conclusion shows that this work agrees with statements made by Yates Burdick and Herr and Shaw and Mitchell-Olds, but there are some results which require further investigation.  相似文献   

5.
Definition of effects and calculation of sums of squares for various tests of hypotheses in unbalanced analysis of variance has been a topic of considerable interest for at least 10 years. Conceptually, these concerns apply to balanced cases as well. It is suggested that proceeding logically from highest-order effects to lowest-order effects in a careful fashion helps to resolve the difficulties pointed out by various writers, including those concerned about completely missing cells.  相似文献   

6.
Methods for analyzing unbalanced factorial designs can be traced back to Yates (1934). Today, most major statistical programs perform, by default, unbalanced ANOVA based on Type III sums of squares (Yates's weighted squares of means). As criticized by Nelder and Lane (1995), this analysis is founded on unrealistic models—models with interactions, but without all corresponding main effects. The Type II analysis (Yates's method of fitting constants) is usually not preferred because of the underlying assumption of no interactions. This argument is, however, also founded on unrealistic models. Furthermore, by considering the power of the two methods, it is clear that Type II is preferable.  相似文献   

7.
The well-known method of unweighted sums of squares (USSs) is examined using, as an example, a random two-way classification model with interaction. In particular, a better motivation is given to the association between the harmonic mean of the cell frequencies and the USSs. Furthermore, a procedure is developed for determining the adequacy of the USSs as approximate balanced analysis of variance (ANOVA) sums of squares. This procedure is easy to apply and provides a better insight into the effects of design and model’s variance components on such an approximation. The proposed methodology can be extended to higher-order models and other types of sums of squares.  相似文献   

8.
Much research has been conducted to develop confidence Intervals on linear combinations and ratios of variance components in balanced and unbalanced random models.This paper first presents confidence intervals on functions of variance components in balanced designs.These results assume that classical analysis of variance sums of squares are independent and have exact scaled chi-squared distributions.In unbalanced designs, either one or both of these assumptions are violated, and modifications to the balanced model intervals are required.We report results of some recent work that examines various modifications for some particular unbalanced designs.  相似文献   

9.
ABSTRACT

Very often researchers plan a balanced design for cluster randomization clinical trials in conducting medical research, but unavoidable circumstances lead to unbalanced data. By adopting three or more levels of nested designs, they usually ignore the higher level of nesting and consider only two levels, this situation leads to underestimation of variance at higher levels. While calculating the sample size for three-level nested designs, in order to achieve desired power, intra-class correlation coefficients (ICCs) at individual level as well as higher levels need to be considered and must be provided along with respective standard errors. In the present paper, the standard errors of analysis of variance (ANOVA) estimates of ICCs for three-level unbalanced nested design are derived. To conquer the strong appeal of distributional assumptions, balanced design, equality of variances between clusters and large sample, general expressions for standard errors of ICCs which can be deployed in unbalanced cluster randomization trials are postulated. The expressions are evaluated on real data as well as highly unbalanced simulated data.  相似文献   

10.
ABSTRACT

The purposes of this paper are to abstract from a number of articles variance component estimation procedures which can be used for completely random balanced incomplete block designs, to develop an iterated least squares (ITLS) computing algorithm for calculating maximum likelihood estimates, and to compare these procedures by use of simulated experiments. Based on the simulated experiments, the estimated mean square errors of the ITLS estimates are generally less than*those for previously proposed analysis of variance and symmetric sums estimators.  相似文献   

11.
In the complete balanced model for the analysis of variance, the equivalence of sums of squares and quadratic forms is seen to imply well-fitting patterns involving Kronecker products of identity matrices and scalar multiples of matrices with all elements equal to 1. The questions of symmetry, idempotency, and orthogonality so central to this topic are answered by simple multiplications; ranks are determined from simple traces. The associations between the forms of the two-factor model are presented here in a way that is accessible to first-year students and makes generalizations to higher order models transparent. The lack of patterns in incomplete or unbalanced models is noted. Additional steps in design and analysis are suggested in the references.  相似文献   

12.
The intra-cluster correlation is insisted on nested error regression model that, in practice, is rarely known. This article demonstrates the size in generalized least squares (GLS) F-test using Fuller–Battese transformation and modification F-test. For the balanced case, the former using strictly positive, analysis of covariance (ANCOVA) and analysis of variance (ANOVA) estimators of intra-cluster correlation can control the size for moderate intra-cluster correlations. For small intra-cluster correlation, they perform well when the numbers of cluster are large. The latter using the ANOVA estimator performs well except for small numbers of cluster. When intra-cluster correlation is large, it cannot control the size. For the unbalanced case, the GLS F-test using the Fuller–Battese transformation and the modification F-test using the strictly positive, the ANCOVA and the ANOVA estimators maintain the significance level for small total sample size and small intra-cluster correlations when there is a large variation in cluster sizes, but they perform well in controlling the size for large total sample size and small different variation in cluster sizes. Besides, Henderson’s method 3 estimator maintains the significance level for a few situations.  相似文献   

13.

Variance components in factorial designs with balanced data are commonly estimated by equating mean squares to expected mean squares. For unbalanced data, the usual extensions of this approach are the Henderson methods, which require formulas that are rather involved. Alternatively, maximum likelihood estimation based on normality has been proposed. Although the algorithm for maximum likelihood is computationally complex, programs exist in some statistical packages. This article introduces a simpler method, that of creating a balanced data set by resampling from the original one. Revised formulas for expected mean squares are presented for the two-way case; they are easily generalized to larger factorial designs. The results of a number of simulation studies indicate that, in certain types of designs, the proposed method has performance advantages over both the Henderson Method I and maximum likelihood estimators.  相似文献   

14.
This paper is an overview of a unified framework for analyzing designed experiments with univariate or multivariate responses. Both categorical and continuous design variables are considered. To handle unbalanced data, we introduce the so-called Type II* sums of squares. This means that the results are independent of the scale chosen for continuous design variables. Furthermore, it does not matter whether two-level variables are coded as categorical or continuous. Overall testing of all responses is done by 50-50 MANOVA, which handles several highly correlated responses. Univariate p-values for each response are adjusted by using rotation testing. To illustrate multivariate effects, mean values and mean predictions are illustrated in a principal component score plot or directly as curves. For the unbalanced cases, we introduce a new variant of adjusted means, which are independent to the coding of two-level variables. The methodology is exemplified by case studies from cheese and fish pudding production.  相似文献   

15.
Min Wang  Xiaoqian Sun 《Statistics》2013,47(5):1104-1115
In practical situations, most experimental designs often yield unbalanced data which have different numbers of observations per unit because of cost constraints, missing data, etc. In this paper, we consider the Bayesian approach to hypothesis testing or model selection under the one-way unbalanced fixed-effects analysis-of-variance (ANOVA) model. We adopt Zellner's g-prior with the beta-prime distribution for g, which results in an explicit closed-form expression of the Bayes factor without integral representation. Furthermore, we investigate the model selection consistency of the Bayes factor under three different asymptotic scenarios: either the number of units goes to infinity, the number of observations per unit goes to infinity, or both go to infinity. The results presented extend some existing ones of the Bayes factor for the balanced ANOVA models in the literature.  相似文献   

16.
Missing data, and the bias they can cause, are an almost ever‐present concern in clinical trials. The last observation carried forward (LOCF) approach has been frequently utilized to handle missing data in clinical trials, and is often specified in conjunction with analysis of variance (LOCF ANOVA) for the primary analysis. Considerable advances in statistical methodology, and in our ability to implement these methods, have been made in recent years. Likelihood‐based, mixed‐effects model approaches implemented under the missing at random (MAR) framework are now easy to implement, and are commonly used to analyse clinical trial data. Furthermore, such approaches are more robust to the biases from missing data, and provide better control of Type I and Type II errors than LOCF ANOVA. Empirical research and analytic proof have demonstrated that the behaviour of LOCF is uncertain, and in many situations it has not been conservative. Using LOCF as a composite measure of safety, tolerability and efficacy can lead to erroneous conclusions regarding the effectiveness of a drug. This approach also violates the fundamental basis of statistics as it involves testing an outcome that is not a physical parameter of the population, but rather a quantity that can be influenced by investigator behaviour, trial design, etc. Practice should shift away from using LOCF ANOVA as the primary analysis and focus on likelihood‐based, mixed‐effects model approaches developed under the MAR framework, with missing not at random methods used to assess robustness of the primary analysis. Copyright © 2004 John Wiley & Sons, Ltd.  相似文献   

17.
Staggered nested experimental designs are the most popular class of unbalanced nested designs in practical fields. The most important features of the staggered nested design are that it has a very simple open-ended structure and each sum of squares in the analysis of variance has almost the same degrees of freedom. Based on the features, a class of unbalanced nested designs that is a generalization of the staggered nested design is proposed in this paper. Formulae for the estimation of variance components and their sums are provided. Comparing the variances of the estimators to the staggered nested designs, it is found that some of the generalized staggered nested designs are more efficient than the traditional staggered nested design in estimating some of the variance components and their sums. An example is provided for illustration.  相似文献   

18.
Exact sampling distributions of sums of squares in the unbalanced one-way random model are obtained under heterogeneous error variances. These distributions are used to investigate the effect of heteroscedasticity and unbalancedness on the probability of negative estimate of the group variance component. The computed results reveal that heteroscedasticity affects the probability of negative estimate in all situations of group sizes. Further, the probability decreases with heterogeneity of error variances for balanced situations and increases with variability among group size for equal error variances case.  相似文献   

19.
Serial P-values     
When a collection of hypotheses is to be tested it is necessary to maintain a bound on the simultaneous Type I error rate. Serial P-values are used to define a serial test that does provide such a bound. Moreover, serial P-values are meaningful in the context of multiple tests, with or without the ‘rejection-confirmation’ decisions. The method is particularly suited to the analysis of unbalanced data, especially contingency tables.  相似文献   

20.
Approximate t-tests of single degree of freedom hypotheses in generalized least squares analyses (GLS) of mixed linear models using restricted maximum likelihood (REML) estimates of variance components have been previously developed by Giesbrecht and Burns (GB), and by Jeske and Harville (JH), using method of moment approximations for the degrees of freedom (df) for the tstatistics. This paper proposes approximate Fstatistics for tests of multiple df hypotheses using one-moment and two-moment approximations which may be viewed as extensions of the GB and JH methods. The paper focuses specifically on tests of hypotheses concerning the main-plot treatment factor in split-plot experiments with missing data. Simulation results indicate usually satisfactory control of Type I error rates.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号