首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A supersaturated design is a factorial design in which the number of effects to be estimated is greater than the available number of experimental runs. It is used in many experiments for screening purposes, i.e., for studying a large number of factors and then identifying the active ones. The goal with such a design is to identify just a few of the factors under consideration, that have dominant effects and to do this at minimum cost. While most of the literature on supersaturated designs has focused on the construction of designs and their optimality, the data analysis of such designs remains still at an early stage. In this paper, we incorporate the parameter model complexity into the supersaturated design analysis process, by assuming generalized linear models for a Bernoulli response, for analyzing main effects designs and discovering simultaneously the effects that are significant.  相似文献   

2.
Most methods for describing the relationship among random variables require specific probability distributions and some assumptions concerning random variables. Mutual information, based on entropy to measure the dependency among random variables, does not need any specific distribution and assumptions. Redundancy, which is an analogous version of mutual information, is also proposed as a method. In this paper, the concepts of redundancy and mutual information are explored as applied to multi-dimensional categorical data. We found that mutual information and redundancy for categorical data can be expressed as a function of the generalized likelihood ratio statistic under several kinds of independent log-linear models. As a consequence, mutual information and redundancy can also be used to analyze contingency tables stochastically. Whereas the generalized likelihood ratio statistic to test the goodness-of-fit of the log-linear models is sensitive to the sample size, the redundancy for categorical data does not depend on sample size but depends on its cell probabilities.  相似文献   

3.
P.J. Huber 《Statistics》2013,47(1):41-53
Recently, cumulative residual entropy (CRE) has been found to be a new measure of information that parallels Shannon's entropy (see Rao et al. [Cumulative residual entropy: A new measure of information, IEEE Trans. Inform. Theory. 50(6) (2004), pp. 1220–1228] and Asadi and Zohrevand [On the dynamic cumulative residual entropy, J. Stat. Plann. Inference 137 (2007), pp. 1931–1941]). Motivated by this finding, in this paper, we introduce a generalized measure of it, namely cumulative residual Renyi's entropy, and study its properties. We also examine it in relation to some applied problems such as weighted and equilibrium models. Finally, we extend this measure into the bivariate set-up and prove certain characterizing relationships to identify different bivariate lifetime models.  相似文献   

4.
In the literature of information theory, the concept of generalized entropy has been proposed and the length-based shift dependent information measure has been studied. In this paper, the concept of weighted generalized entropy has been introduced. The properties of weighted generalized residual entropy and weighted generalized past entropy are also discussed.  相似文献   

5.
Supersaturated designs are factorial designs in which the number of potential effects is greater than the run size. They are commonly used in screening experiments, with the aim of identifying the dominant active factors with low cost. However, an important research field, which is poorly developed, is the analysis of such designs with non-normal response. In this article, we develop a variable selection strategy, through the modification of the PageRank algorithm, which is commonly used in the Google search engine for ranking Webpages. The proposed method incorporates an appropriate information theoretical measure into this algorithm and as a result, it can be efficiently used for factor screening. A noteworthy advantage of this procedure is that it allows the use of supersaturated designs for analyzing discrete data and therefore a generalized linear model is assumed. As it is depicted via a thorough simulation study, in which the Type I and Type II error rates are computed for a wide range of underlying models and designs, the presented approach can be considered quite advantageous and effective.  相似文献   

6.
Recently, many supersaturated designs have been proposed. A supersaturated design is a fractional factorial design in which the number of factors is greater than the number of experimental runs. The main thrust of the previous studies has been to generate more columns while avoiding large values of squared inner products among all design columns. These designs would be appropriate if the probability for each factor being active is uniformly distributed. When factors can be partitioned into two groups, namely, with high and low probabilities of each factor being active, it is desirable to maintain orthogonality among columns to be assigned to the factors in the high-probability group. We discuss a supersaturated design including an orthogonal base which is suitable for this common situation. Mathematical results on the existence of the supersaturated designs are shown, and the construction of supersaturated designs is presented. We next discuss some properties of the proposed supersaturated designs based on the squared inner products.  相似文献   

7.
Abstract

Recently, the notion of cumulative residual Rényi’s entropy has been proposed in the literature as a measure of information that parallels Rényi’s entropy. Motivated by this, here we introduce a generalized measure of it, namely cumulative residual inaccuracy of order α. We study the proposed measure for conditionally specified models of two components having possibly different ages called generalized conditional cumulative residual inaccuracy measure. Several properties of generalized conditional cumulative residual inaccuracy measure including the effect of monotone transformation are investigated. Further, we provide some bounds on using the usual stochastic order and characterize some bivariate distributions using the concept of conditional proportional hazard rate model.  相似文献   

8.
By incorporating informative and/or historical knowledge of the unknown parameters, Bayesian experimental design under the decision-theory framework can combine all the information available to the experimenter so that a better design may be achieved. Bayesian optimal designs for generalized linear regression models, especially for the Poisson regression model, is of interest in this article. In addition, lack of an efficient computational method in dealing with the Bayesian design leads to development of a hybrid computational method that consists of the combination of a rough global optima search and a more precise local optima search. This approach can efficiently search for the optimal design for multi-variable generalized linear models. Furthermore, the equivalence theorem is used to verify whether the design is optimal or not.  相似文献   

9.
This paper presents generalized theorems on the optimality of supersaturated designs in terms of low dependency over all pairs of column vectors. Some mixed-level supersaturated designs are constructed using a method based on these theorems. An index is proposed for measuring the efficiency of supersaturated design and applied to evaluate the constructed mixed-level supersaturated designs.  相似文献   

10.
Major sources of information for the estimation of the size of the fish stocks and the rate of their exploitation are samples from which the age composition of catches may be determined. However, the age composition in the catches often varies as a result of several factors. Stratification of the sampling is desirable, because it leads to better estimates of the age composition, and the corresponding variances and covariances. The analysis is impeded by the fact that the response is ordered categorical. This paper introduces an easily applicable method to analyze such data. The method combines continuation-ratio logits and the theory for generalized linear mixed models. Continuation-ratio logits are designed for ordered multinomial response and have the feature that the associated log-likelihood splits into separate terms for each category levels. Thus, generalized linear mixed models can be applied separately to each level of the logits. The method is illustrated by the analysis of age-composition data collected from the Danish sandeel fishery in the North Sea in 1993. The significance of possible sources of variation is evaluated, and formulae for estimating the proportions of each age group and their variance-covariance matrix are derived.  相似文献   

11.
Proportion differences are often used to estimate and test treatment effects in clinical trials with binary outcomes. In order to adjust for other covariates or intra-subject correlation among repeated measures, logistic regression or longitudinal data analysis models such as generalized estimating equation or generalized linear mixed models may be used for the analyses. However, these analysis models are often based on the logit link which results in parameter estimates and comparisons in the log-odds ratio scale rather than in the proportion difference scale. A two-step method is proposed in the literature to approximate the calculation of confidence intervals for the proportion difference using a concept of effective sample sizes. However, the performance of this two-step method has not been investigated in their paper. On this note, we examine the properties of the two-step method and propose an adjustment to the effective sample size formula based on Bayesian information theory. Simulations are conducted to evaluate the performance and to show that the modified effective sample size improves the coverage property of the confidence intervals.  相似文献   

12.
Abstract

In this paper, we consider weighted extensions of generalized cumulative residual entropy and its dynamic(residual) version. Our results include linear transformations, stochastic ordering, bounds, aging class properties and some relationships with other reliability concepts. We also define the conditional weighted generalized cumulative residual entropy and discuss some properties of its. For these concepts, we obtain some characterization results under some assumptions. Finally, we provide an estimator of the new information measure using empirical approach. In addition, we study large sample properties of this estimator.  相似文献   

13.
Tsallis entropy is a generalized form of entropy and tends to be Shannon entropy when q → 1. Using Tsallis entropy, an alternative estimation methodology (generalized maximum Tsallis entropy) is introduced and used to estimate the parameters in a linear regression model when the basic data are ill-conditioned. We describe the generalized maximum Tsallis entropy and for q = 2 we call that GMET2 estimator. We apply the GMET2 estimator for estimating the linear regression model Y = Xβ + e where the design matrix X is subject to severe multicollinearity. We compared the GMET2, generalized maximum entropy (GME), ordinary least-square (OLS), and inequality restricted least-square (IRLS) estimators on the analyzed dataset on Portland cement.  相似文献   

14.
A particular influence measure for restricted regression models is reviewed in this paper. We give em- phasis on establishing regularity conditions to apply the proposed influence measure in restricted gen- eralized linear models. The development of conditional residuals is also discussed. In particular, a sim- ulation study was conducted in order to compare the distributions of the proposed residuals for various generalized linear models. Finally, an application is given.  相似文献   

15.
16.
Generalized linear models are commonly used to analyze categorical data such as binary, count, and ordinal outcomes. Adjusting for important prognostic factors or baseline covariates in generalized linear models may improve the estimation efficiency. The model‐based mean for a treatment group produced by most software packages estimates the response at the mean covariate, not the mean response for this treatment group for the studied population. Although this is not an issue for linear models, the model‐based group mean estimates in generalized linear models could be seriously biased for the true group means. We propose a new method to estimate the group mean consistently with the corresponding variance estimation. Simulation showed the proposed method produces an unbiased estimator for the group means and provided the correct coverage probability. The proposed method was applied to analyze hypoglycemia data from clinical trials in diabetes. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

17.
Regression models are often used to make predictions. All the information needed is contained in the predictive distribution. However, this cannot be evaluated explicitly for most generalized linear models. We construct two approximations to this distribution and demonstrate their use on two sets of survival data, corresponding to the outcome of patients admitted to intensive care units and the survival times of leukaemia patients.Regression models are often used to make predictions. All the information needed is contained in the predictive distribution. However, this cannot be evaluated explicitly for most generalized linear models. We construct two approximations to this distribution and demonstrate their use on two sets of survival data, corresponding to the outcome of patients admitted to intensive care units and the survival times of leukaemia patients.Regression models are often used to make predictions. All the information needed is contained in the predictive distribution. However, this cannot be evaluated explicitly for most generalized linear models. We construct two approximations to this distribution and demonstrate their use on two sets of survival data, corresponding to the outcome of patients admitted to intensive care units and the survival times of leukaemia patients.Regression models are often used to make predictions. All the information needed is contained in the predictive distribution. However, this cannot be evaluated explicitly for most generalized linear models. We construct two approximations to this distribution and demonstrate their use on two sets of survival data, corresponding to the outcome of patients admitted to intensive care units and the survival times of leukaemia patients.  相似文献   

18.
Regression calibration is a simple method for estimating regression models when covariate data are missing for some study subjects. It consists in replacing an unobserved covariate by an estimator of its conditional expectation given available covariates. Regression calibration has recently been investigated in various regression models such as the linear, generalized linear, and proportional hazards models. The aim of this paper is to investigate the appropriateness of this method for estimating the stratified Cox regression model with missing values of the covariate defining the strata. Despite its practical relevance, this problem has not yet been discussed in the literature. Asymptotic distribution theory is developed for the regression calibration estimator in this setting. A simulation study is also conducted to investigate the properties of this estimator.  相似文献   

19.
In this paper, we suggest an extension of the cumulative residual entropy (CRE) and call it generalized cumulative entropy. The proposed entropy not only retains attributes of the existing uncertainty measures but also possesses the absolute homogeneous property with unbounded support, which the CRE does not have. We demonstrate its mathematical properties including the entropy of order statistics and the principle of maximum general cumulative entropy. We also introduce the cumulative ratio information as a measure of discrepancy between two distributions and examine its application to a goodness-of-fit test of the logistic distribution. Simulation study shows that the test statistics based on the cumulative ratio information have comparable statistical power with competing test statistics.  相似文献   

20.
In this paper, we consider a new mixture of varying coefficient models, in which each mixture component follows a varying coefficient model and the mixing proportions and dispersion parameters are also allowed to be unknown smooth functions. We systematically study the identifiability, estimation and inference for the new mixture model. The proposed new mixture model is rather general, encompassing many mixture models as its special cases such as mixtures of linear regression models, mixtures of generalized linear models, mixtures of partially linear models and mixtures of generalized additive models, some of which are new mixture models by themselves and have not been investigated before. The new mixture of varying coefficient model is shown to be identifiable under mild conditions. We develop a local likelihood procedure and a modified expectation–maximization algorithm for the estimation of the unknown non‐parametric functions. Asymptotic normality is established for the proposed estimator. A generalized likelihood ratio test is further developed for testing whether some of the unknown functions are constants. We derive the asymptotic distribution of the proposed generalized likelihood ratio test statistics and prove that the Wilks phenomenon holds. The proposed methodology is illustrated by Monte Carlo simulations and an analysis of a CO2‐GDP data set.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号