期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Selection of Binary Variables and Classification by Boosting

Junyong Park Jayson D. Wilbur Jayanta K. Ghosh Cindy H. Nakatsu Corinne Ackerman 《统计学通讯:模拟与计算》2013,42(4):855-869

We adopt boosting for classification and selection of high-dimensional binary variables for which classical methods based on normality and non singular sample dispersion are inapplicable. Boosting seems particularly well suited for binary variables. We present three methods of which two combine boosting with the relatively classical variable selection methods developed in Wilbur et al. (2002 Wilbur , J. D. , Ghosh , J. K. , Nakatsu , C. H. , Brouder , S. M. , Doerge , R. W. ( 2002 ). Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial community DNA fingerprints . Biometrics 58 : 378 – 386 . [Google Scholar]). Our primary interest is variable selection in classification with small misclassification error being used as validation of proposed method for variable selection. Two of the new methods perform uniformly better than Wilbur et al. (2002 Wilbur , J. D. , Ghosh , J. K. , Nakatsu , C. H. , Brouder , S. M. , Doerge , R. W. ( 2002 ). Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial community DNA fingerprints . Biometrics 58 : 378 – 386 . [Google Scholar]) in one set of simulated and three real life examples. 相似文献

2.

Multivariate Generalized Distributions of Order k

《统计学通讯:理论与方法》2013,42(9):1725-1735

Abstract

The study of multivariate distributions of order k, two of which are the multivariate negative binomial of order k and the multinomial of the same order, was introduced in Philippou et al. (Philippou, A. N., Antzoulakos, D. L., Tripsiannis, G. A. (1988 Philippou, A. N., Antzoulakos, D. L. and Tripsiannis, G. A. 1988. Multivariate distributions of order k. Statistics and Probability Letters, 7(3): 207–216. [Google Scholar]). Multivariate distributions of order k. Statistics and Probability Letters 7(3):207–216.), and Philippou et al. (Philippou, A. N., Antzoulakos, D. L., Tripsiannis, G. A. (1990 Philippou, A. N., Antzoulakos, D. L. and Tripsiannis, G. A. 1990. Multivariate distributions of order k, part II. Statistics and Probability Letters, 10(1): 29–35. [Google Scholar]). Multivariate distributions of order k, part II. Statistics and Probability Letters 10(1):29–35.). Recently, an order k (or cluster) generalized negative binomial distribution and a multivariate negative binomial distribution were derived in Sen and Jain (Sen, K., Jain, R. (1996 Sen, K. and Jain, R. 1996. “Cluster generalized negative binomial distribution”. In Probability Models and Statistics Medhi Festschrift, A. J., on the Occasion of his 70th Birthday Edited by: Borthakur, A. C. 227–241. New Delhi: New Age International Publishers. [Google Scholar]). Cluster generalized negative binomial distribution. In: Borthakur et al. A. C., Eds.; Probability Models and Statistics Medhi Festschrift, A. J., on the Occasion of his 70th Birthday. New Age International Publishers: New Delhi, 227–241.) and Sen and Jain (Sen, K., Jain, R. (1997 Sen, K. and Jain, R. 1997. A multivariate generalized Polya-Eggenberger probability model-first passage approach. Communications in Statistics—Theory and Methods, 26: 871–884. [Taylor & Francis Online], [Web of Science ®] , [Google Scholar]). A multivariate generalized Polya-Eggenberger probability model-first passage approach. Communications in Statistics-Theory and Methods 26:871–884.), respectively. In this paper, all four distributions are generalized to a multivariate generalized negative binomial distribution of order k by means of an appropriate sampling scheme and a first passage event. This new distribution includes as special cases several known and new multivariate distributions of order k, and gives rise in the limit to multivariate generalized logarithmic, Poisson and Borel-Tanner distributions of the same order. Applications are indicated. 相似文献

3.

On a Generalization of Bivariate Cauchy Distribution

A. Jamalizadeh N. Balakrishnan 《统计学通讯:理论与方法》2013,42(4):469-474

This paper addresses a generalization of the bivariate Cauchy distribution discussed by Fang et al. (1990 Fang , K. T. , Kotz , S. , Ng , K. W. ( 1990 ). Symmetric Multivariate and Related Distributions . London : Chapman and Hall .[Crossref] , [Google Scholar]), derived from a trivariate normal distribution with a general correlation matrix. We obtain explicit expressions for the joint distribution function and joint density function, and show that they reduce in a special case to the corresponding expressions of Fang et al. (1990 Fang , K. T. , Kotz , S. , Ng , K. W. ( 1990 ). Symmetric Multivariate and Related Distributions . London : Chapman and Hall .[Crossref] , [Google Scholar]). Finally, we show that this generalized distribution is useful in determining the orthant probability of a bivariate skew-normal distribution of Azzalini and Dalla Valle (1996 Azzalini , A. , Dalla Valle , A. ( 1996 ). The multivariate skew-normal distribution . Biometrika 83 : 715 – 726 .[Crossref], [Web of Science ®] , [Google Scholar]). 相似文献

4.

Tolerance Factors in Multiple and Multivariate Linear Regressions

K. Krishnamoorthy Sumona Mondal 《统计学通讯:模拟与计算》2013,42(3):546-559

In this article, an improved method of computing tolerance factors for constructing tolerance regions in a multivariate linear regression model is proposed. The method is based on a chi-square approximation to the distribution of a linear function of noncentral chi-square variables and simulation. The merits of the proposed approach and the usual simulation method considered in Lee and Mathew (2004 Lee , Y. , Mathew , T. ( 2004 ). Tolerance regions in multivariate linear regression . Journal of Statistical Planning Inference 126 : 253 – 271 . [Google Scholar]) are evaluated using Monte Carlo simulation. The study indicates that the proposed approach is stable and accurate even for small samples, and better than available methods. For constructing two-sided tolerance intervals in multiple linear regression, coverage level adjusted one-sided tolerance factors are shown to be better than available approximate tolerance factors. The results based on the coverage level adjusted one-sided tolerance factors are as good as the ones based on the exact two-sided tolerance factors in many cases. 相似文献

5.

Comparison of Frailty Models for Acute Leukemia Data under Gompertz Baseline Distribution

David D. Hanagal Richa Sharma 《统计学通讯:理论与方法》2013,42(7):1338-1350

In this article, we consider two different shared frailty regression models under the assumption of Gompertz as baseline distribution. Mostly assumption of gamma distribution is considered for frailty distribution. To compare the results with gamma frailty model, we consider the inverse Gaussian shared frailty model also. We compare these two models to a real life bivariate survival data set of acute leukemia remission times (Freireich et al., 1963 Freireich, E.J., Gehan, E., Frei, E., Schroeder, L.R., Wolman, I.J., Anbari, R., Burgert, E.O., Mills, S.D., Pinkel, D., Selawry, O.S., Moon, J.H., Gendel, B.R., Spurr, C.L., Storrs, R., Haurani, F., Hoogstraten, B., Lee, S. (1963). The effect of 6-mercaptopurine on the duration of steroid-induced remissions in acute leukemia: a model for evaluation of other potentially useful therapy. Blood 21:699–716.[Web of Science ®] , [Google Scholar]). Analysis is performed using Markov Chain Monte Carlo methods. Model comparison is made using Bayesian model selection criterion and a well-fitted model is suggested for the acute leukemia data. 相似文献

6.

Marginal Correlation from Logit- and Probit-Beta-Normal Models for Hierarchical Binary Data

Tony Vangeneugden Geert Molenberghs Geert Verbeke Clarice G.B. Demétrio 《统计学通讯:理论与方法》2014,43(19):4164-4178

In hierarchical data settings, be it of a longitudinal, spatial, multi-level, clustered, or otherwise repeated nature, often the association between repeated measurements attracts at least part of the scientific interest. Quantifying the association frequently takes the form of a correlation function, including but not limited to intraclass correlation. Vangeneugden et al. (2010 Vangeneugden, T., Molenberghs, G., Laenen, A., Geys, H., Beunckens, C., Sotto, C. (2010). Marginal correlation in longitudinal binary data based on generalized linear mixed models. Communi. Stati. Theory &; Methods. 39:3542–3557. [Google Scholar]) derived approximate correlation functions for longitudinal sequences of general data type, Gaussian and non-Gaussian, based on generalized linear mixed-effects models. Here, we consider the extended model family proposed by Molenberghs et al. (2010 Molenberghs, G., Verbeke, G., Demétrio, C., Vieira, A. (2010). A family of generalized linear models for repeated measures with normal and conjugate random effects. Stat. Sci. 25:325–347.[Crossref], [Web of Science ®] , [Google Scholar]). This family flexibly accommodates data hierarchies, intra-sequence correlation, and overdispersion. The family allows for closed-form means, variance functions, and correlation function, for a variety of outcome types and link functions. Unfortunately, for binary data with logit link, closed forms cannot be obtained. This is in contrast with the probit link, for which such closed forms can be derived. It is therefore that we concentrate on the probit case. It is of interest, not only in its own right, but also as an instrument to approximate the logit case, thanks to the well-known probit-logit ‘conversion.’ Next to the general situation, some important special cases such as exchangeable clustered outcomes receive attention because they produce insightful expressions. The closed-form expressions are contrasted with the generic approximate expressions of Vangeneugden et al. (2010 Vangeneugden, T., Molenberghs, G., Laenen, A., Geys, H., Beunckens, C., Sotto, C. (2010). Marginal correlation in longitudinal binary data based on generalized linear mixed models. Communi. Stati. Theory &; Methods. 39:3542–3557. [Google Scholar]) and with approximations derived for the so-called logistic-beta-normal combined model. A simulation study explores performance of the method proposed. Data from a schizophrenia trial are analyzed and correlation functions derived. 相似文献

7.

Rank Tests for Two-Sample Problems Based on Multiple Type-II Censored Data

M. S. Chikkagoudar B. S. Biradar 《统计学通讯:理论与方法》2013,42(18):3203-3221

In this article, we study the effect of censoring on the asymptotic efficiency of the two-sample rank tests based on multiple Type-II censored data. Since the scores generating functions associated with these test statistics have a finite number of jump discontinuities, we use a slightly modified version of a theorem of Dupac and Hajek (1969 Dupac , V. , Hajek , J. ( 1969 ). Asymptotic normality of simple linear rank statistics under alternatives II . Ann. Math. Statist. 40 : 1992 – 2017 .[Crossref] , [Google Scholar]) to obtain their asymptotic distributions under fixed alternatives. This modified version, which leads to a simpler centering constant, is proved by Dupac (1970 Dupac , V. ( 1970 ). A contribution to the asymptotic normality of simple linear rank statistics . In: Puri , M. L. , ed. Nonparametric Techniques in Statistical Inference . Cambridge : Cambridge University Press . [Google Scholar]) in the light of results of Hoeffding (1968 Hoeffding , W. ( 1968 ). On the Centering of Simple Linear Rank Statistics. Instit. Statist. Mimeo Series No. 585, University of North Carolina . [Google Scholar]), an earlier version of Hoeffding (1973 Hoeffding , W. ( 1973 ). On the centering of simple linear rank statistics . Ann. Statist. 1 : 54 – 66 .[Crossref], [Web of Science ®] , [Google Scholar]). Hence, we obtain the Pitman ARE's of these rank tests relative to the corresponding tests based on the complete samples. The ARE's are computed for some well known rank tests for two-sample location and scale problems, when the combined ordered samples from different underlying distributions are censored using triple and lower order Type-II censoring schemes. The effect of all these censoring schemes on the ARE's of the different tests is examined numerically. It is found that there is a gain in efficiency due to censoring in many of the cases considered here. This suggests that in such cases it is possible to improve the efficiency of rank tests by discarding suitable portions of the data. 相似文献

8.

Population Attributable Fraction as a Measure of Heritability in Dichotomous Twin Data

Viswanathan Ramakrishnan 《统计学通讯:模拟与计算》2013,42(3):405-418

In many genetic analyses of dichotomous twin data, odds ratios have been used to test hypotheses on heritability and shared common environment effects of a given disease (Lichtenstein et al., 2000 Lichtenstein , P. , Holm , N. , Verkasalo , P. , Iliadou , A. , Kaprio , J. , Koskenvuo , M. , Pukkala , E. , Skytthe , A. , Hemminki , K. ( 2000 ). Environmental and heritable factors in the causation of cancer . New England Journal of Medicine 343 : 78 – 85 .[Crossref], [Web of Science ®] , [Google Scholar]; Ahlbom et al., 1997 Ahlbom , A. , Lichtenstein , P. , Malmström , H. , Feychting , M. , Hemminki , K. , Pedersen , N. L. ( 1997 ). Cancer in twins: genetic and nongenetic familial risk factors . Journal of the National Cancer Institute 89 : 287 – 93 . [Google Scholar]; Ramakrishnan et al., 1992 Ramakrishnan , V. , Goldberg , J. , Henderson , W. , Elsen , S. , True , W. , Lyons , M. , Tsuang , M. ( 1992 ). Elementary methods for the analysis of dichotomous outcomes in unselected samples of twins . Genetic Epidemiology 9 : 273 – 287 . [Google Scholar], 4). However, estimates of these two effects have not been dealt with in the literature. In epidemiology, the attributable fraction (AF), a function of the odds ratio and the prevalence of the risk factor has been used to describe the contribution of a risk factor to a disease in a given population (Leviton, 1973 Leviton , A. ( 1973 ). Definitions of attributable risk . American Journal of Epidemiology 98 : 231 . [Google Scholar]). In this article, we adapt the AF to quantify the heritability and the shared common environment. Twin data on cancer, gallstone disease and phobia are used to illustrate the applicability of the AF estimate as a measure of heritability. 相似文献

9.

Limiting Spectral Distribution for Large Sample Covariance Matrices with m-Dependent Elements

Jun Hui 《统计学通讯:理论与方法》2013,42(6):935-941

This article establishes the limiting spectral distribution of large sample covariance matrices with m-dependent random variables under the second moment condition by verifying the condition of Theorem 1.1 in Bai and Zhou (2008 Bai , Z. D. , Zhou , W. , ( 2008 ). Large sample covariance matrices without independence structure in columns . Statist. Sinica 2 : 425 – 443 . [Google Scholar]). 相似文献

10.

Joint Moment Generating Functions of Nonadjacent Dual Generalized Order Statistics from Reflected Generalized Pareto Distributions

A. Ahmad Abd El-Baset 《统计学通讯:理论与方法》2013,42(15):2762-2772

In this article, a class of reflected generalized Pareto distributions (cf. Burkschat et al., 2003 Burkschat , M. , Cramer , E. , Kamps , U. ( 2003 ). Dual generalized order statistics . Metron LXI ( 1 ): 13 – 26 . [Google Scholar]) is considered. Recurrence relations for joint moment generating functions of higher non adjacent dual generalized order statistics based on a random sample drawn from the considered class are derived. Higher joint moments of non adjacent dual generalized order statistics (reversed ordered order statistics and lower k-records as special cases) are obtained. Recurrence relations for single and product moment generating functions and moments of higher non adjacent dual generalized order statistics are derived. Some results of higher moments of non adjacent generalized order statistics from generalized Pareto distributions (cf. Johnson et al., 1995 Johnson , N. L. , Kotz , S. , Balakrishnan , N. ( 1995 ). Continuous Univariate Distributions. , 2nd ed. Vol. 2. New York : Wiley & Sons . [Google Scholar]), are obtained by using a relation connecting higher moments of generalized order statistics and its dual. 相似文献

11.

Bootstrap approach to test the homogeneity of order restricted mean vectors when the covariance matrices are unknown

Abouzar Bazyari 《统计学通讯:模拟与计算》2017,46(9):7194-7209

Testing homogeneity of multivariate normal mean vectors under an order restriction when the covariance matrices are unknown, arbitrary positive definite and unequal are considered. This problem of testing has been studied to some extent, for example, by Kulatunga and Sasabuchi (1984 Kulatunga, D. D. S., Sasabuchi, S. (1984). A test of homogeneity of mean vectors against multivariate isotonic alternatives. Mem Fac Sci, Kyushu Univ Ser A Mathemat 38:151–161. [Google Scholar]) when the covariance matrices are known and also Sasabuchi et al. (2003 Sasabuchi, S., Tanaka, K., Tsukamodo, T. (2003). Testing homogeneity of multivariate normal mean vectors under an order restriction when the covariance matrices are common but unknown. Annals of Statistics. 31(5):1517–1536.[Web of Science ®] , [Google Scholar]) and Sasabuchi (2007 Sasabuchi, S. (2007). More powerful tests for homogeneity of multivariate normal mean vectors under an order restriction. Sankhya 69(4):700–716. [Google Scholar]) when the covariance matrices are unknown but common. In this paper, a test statistic is proposed and because of the main advantage of the bootstrap test is that it avoids the derivation of the complex null distribution analytically, a bootstrap test statistic is derived and since the proposed test statistic is location invariance the bootstrap p-value defined logical and some steps are presented to estimate it. Our numerical studies via Monte Carlo simulation show that the proposed bootstrap test can correctly control the type I error rates. The power of the test for some of the p-dimensional normal distributions is computed by Monte Carlo simulation. Also, the null distribution of test statistic is estimated using kernel density. Finally, the bootstrap test is illustrated using a real data. 相似文献

12.

Is Each NPMLE of a Continuous Bivariate Distribution Function with Singly Right-Censored Data Really Inconsistent?

Qiqing Yu Chingfu Sen Jinlong Huang Chinsan Lee 《统计学通讯:理论与方法》2013,42(5):844-862

We consider non-parametric estimation of a continuous cdf of a random vector (X ₁, X ₂). With bivariate RC data, it is stated in van der Laan (1996 Van der Laan , M. J. ( 1996 ) Efficient estimation in the bivariate censoring model and repairing NPMLE . Ann. Statist. 24 : 596 – 627 .[Crossref], [Web of Science ®] , [Google Scholar], p. 598¹⁰, Ann. Statist.), Quale et al. (2006 Quale , C. M. , van der Laan , M. J. , Robins , J. R. ( 2006 ). Locally efficient estimation with bivariate right-censored data . JASA. 101 : 1076 – 1084 .[Taylor & Francis Online], [Web of Science ®] , [Google Scholar], JASA) etc. that “it is well known that the NPMLE for continuous data is inconsistent (Tsai et al. (1986 Tsai , W. Y. , Leurgans , S. , Crowley , J. ( 1986 ). Nonparametric estimation of a bivariate survival function in the presence of censoring . Ann. Statist. 14 : 1351 – 1365 .[Crossref], [Web of Science ®] , [Google Scholar])).” The claim is based on a result in Tsai et al. (1986 Tsai , W. Y. , Leurgans , S. , Crowley , J. ( 1986 ). Nonparametric estimation of a bivariate survival function in the presence of censoring . Ann. Statist. 14 : 1351 – 1365 .[Crossref], [Web of Science ®] , [Google Scholar], p.1352, Ann. Statist.) that if X ₁ is right censored but not X ₂, then common ways for defining one NPMLE lead to inconsistency. If X ₁ is right censored and X ₂ is type I right-censored (which includes the case in Tsai et al.), we present a consistent NPMLE. The result corrects a common misinterpretation of Tsai's example (Tsai et al., 1986 Tsai , W. Y. , Leurgans , S. , Crowley , J. ( 1986 ). Nonparametric estimation of a bivariate survival function in the presence of censoring . Ann. Statist. 14 : 1351 – 1365 .[Crossref], [Web of Science ®] , [Google Scholar], Ann. Statist.). 相似文献

13.

A Bottom-Up Dynamic Model of Portfolio Credit Risk with Stochastic Intensities and Random Recoveries

Tomasz R. Bielecki Areski Cousin Stéphane Crépey Alexander Herbertsson 《统计学通讯:理论与方法》2014,43(7):1362-1389

In Bielecki et al. (2014a Bielecki , T. R. , Cousin , A. , Crépey , S. , Herbertsson , A. ( 2014a ). Dynamic hedging of portfolio credit risk in a markov copula model . J. Optimiz. Theor. Applic . doi: DOI 10.1007/s10957-013-0318-4 (forthcoming) .[Crossref] , [Google Scholar]), the authors introduced a Markov copula model of portfolio credit risk where pricing and hedging can be done in a sound theoretical and practical way. Further theoretical backgrounds and practical details are developed in Bielecki et al. (2014b Bielecki , T. R. , Cousin , A. , Crépey , S. , Herbertsson , A. ( 2014b ). A bottom-up dynamic model of portfolio credit risk - Part I: Markov copula perspective . In: Recent Adv. Fin. Eng. 2012 , World Scientific (preprint version available at http://dx.doi.org/10.2139/ssrn.1844574) . [Google Scholar],c) where numerical illustrations assumed deterministic intensities and constant recoveries. In the present paper, we show how to incorporate stochastic default intensities and random recoveries in the bottom-up modeling framework of Bielecki et al. (2014a Bielecki , T. R. , Cousin , A. , Crépey , S. , Herbertsson , A. ( 2014a ). Dynamic hedging of portfolio credit risk in a markov copula model . J. Optimiz. Theor. Applic . doi: DOI 10.1007/s10957-013-0318-4 (forthcoming) .[Crossref] , [Google Scholar]) while preserving numerical tractability. These two features are of primary importance for applications like CVA computations on credit derivatives (Assefa et al., 2011 Assefa , S. , Bielecki , T. R. , Crépey , S. , Jeanblanc , M. ( 2011 ). CVA computation for counterparty risk assessment in credit portfolios . In: Bielecki , T.R. , Brigo , D. , Patras , F. , Eds., Credit Risk Frontiers . Hoboken : Wiley/Bloomberg-Press . [Google Scholar]; Bielecki et al., 2012 Bielecki , T. R. , Crépey , S. , Jeanblanc , M. , Zargari , B. ( 2012 ). Valuation and Hedging of CDS counterparty exposure in a markov copula model . Int. J. Theoret. Appl. Fin. 15 ( 1 ): 1250004 .[Crossref] , [Google Scholar]), as CVA is sensitive to the stochastic nature of credit spreads and random recoveries allow to achieve satisfactory calibration even for “badly behaved” data sets. This article is thus a complement to Bielecki et al. (2014a Bielecki , T. R. , Cousin , A. , Crépey , S. , Herbertsson , A. ( 2014a ). Dynamic hedging of portfolio credit risk in a markov copula model . J. Optimiz. Theor. Applic . doi: DOI 10.1007/s10957-013-0318-4 (forthcoming) .[Crossref] , [Google Scholar]), Bielecki et al. (2014b Bielecki , T. R. , Cousin , A. , Crépey , S. , Herbertsson , A. ( 2014b ). A bottom-up dynamic model of portfolio credit risk - Part I: Markov copula perspective . In: Recent Adv. Fin. Eng. 2012 , World Scientific (preprint version available at http://dx.doi.org/10.2139/ssrn.1844574) . [Google Scholar]) and Bielecki et al. (2014c Bielecki , T. R. , Cousin , A. , Crépey , S. , Herbertsson , A. ( 2014c ). A bottom-up dynamic model of portfolio credit risk - Part II: Common-shock interpretation, calibration and hedging issues . Recent Adv. Fin. Eng. 2012 , World Scientific (preprint version available at http://dx.doi.org/10.2139/ssrn.2245130) . [Google Scholar]). 相似文献

14.

Some Fundamental Properties of a Multivariate von Mises Distribution

Kanti V. Mardia 《统计学通讯:理论与方法》2014,43(6):1132-1144

In application areas like bioinformatics, multivariate distributions on angles are encountered which show significant clustering. One approach to statistical modeling of such situations is to use mixtures of unimodal distributions. In the literature (Mardia et al., 2012 Mardia , K. V. , Kent , J. T. , Zhang , Z. , Taylor , C. , Hamelryck , T. ( 2012 ). Mixtures of concentrated multivariate sine distributions with applications to bioinformatics . J. Appl. Stat. 39 : 2475 – 2492 .[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]), the multivariate von Mises distribution, also known as the multivariate sine distribution, has been suggested for components of such models, but work in the area has been hampered by the fact that no good criteria for the von Mises distribution to be unimodal were available. In this article we study the question about when a multivariate von Mises distribution is unimodal. We give sufficient criteria for this to be the case and show examples of distributions with multiple modes when these criteria are violated. In addition, we propose a method to generate samples from the von Mises distribution in the case of high concentration. 相似文献

15.

Measures of predictor sensitivity for order-insensitive partitioning of multiple correlation

Sammy Zahran Michael A. Long Kenneth J. Berry 《Journal of applied statistics》2012,39(1):39-51

Lindeman et al. [12 Lindeman, R. H., Merenda, P. F. and Gold, R. Z. 1980. Introduction to Bivariate and Multivariate Analysis, Glenview, IL: Scott Foresman. [Google Scholar]] provide a unique solution to the relative importance of correlated predictors in multiple regression by averaging squared semi-partial correlations obtained for each predictor across all p! orderings. In this paper, we propose a series of predictor sensitivity statistics that complement the variance decomposition procedure advanced by Lindeman et al. [12 Lindeman, R. H., Merenda, P. F. and Gold, R. Z. 1980. Introduction to Bivariate and Multivariate Analysis, Glenview, IL: Scott Foresman. [Google Scholar]]. First, we detail the logic of averaging over orderings as a technique of variance partitioning. Second, we assess predictors by conditional dominance analysis, a qualitative procedure designed to overcome defects in the Lindeman et al. [12 Lindeman, R. H., Merenda, P. F. and Gold, R. Z. 1980. Introduction to Bivariate and Multivariate Analysis, Glenview, IL: Scott Foresman. [Google Scholar]] variance decomposition solution. Third, we introduce a suite of indices to assess the sensitivity of a predictor to model specification, advancing a series of sensitivity-adjusted contribution statistics that allow for more definite quantification of predictor relevance. Fourth, we describe the analytic efficiency of our proposed technique against the Budescu conditional dominance solution to the uneven contribution of predictors across all p! orderings. 相似文献

16.

Bootstrap methods for multivariate hypothesis testing

Łukasz Smaga 《统计学通讯:模拟与计算》2017,46(10):7654-7667

The nonparametric and parametric bootstrap methods for multivariate hypothesis testing are developed. They are used to approximate the null distribution of the test statistics proposed by Duchesne and Francq (2015 Duchesne, P., Francq, C. (2015). Multivariate hypothesis testing using generalized and {2}-inverses—with applications. Statistics 49:475–496.[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]), resulting in bootstrap testing procedures. In the problem of testing for the mean vector of a multivariate distribution, the asymptotic validity of the bootstrap methods is proved. The finite sample performance of the new solutions is demonstrated by means of Monte Carlo simulation studies. They indicate that for small-sample size, the bootstrap tests provide a better finite sample properties than the asymptotic tests considered by Duchesne and Francq (2015 Duchesne, P., Francq, C. (2015). Multivariate hypothesis testing using generalized and {2}-inverses—with applications. Statistics 49:475–496.[Taylor &; Francis Online], [Web of Science ®] , [Google Scholar]). 相似文献

17.

MULTI-LEVEL FACTORIAL DESIGNS WITH MINIMUM NUMBERS OF LEVEL CHANGES

《统计学通讯:理论与方法》2013,42(5):875-885

The order of experimental runs in a fractional factorial experiment is essential when the cost of level changes in factors is considered. The generalized foldover scheme given by [1] Coster, D. C. and Cheng, C. S. 1988. Minimum cost trend free run orders of fractional factorial designs. The Annals of Statistics, 16: 1188–1205. [Crossref], [Web of Science ®] , [Google Scholar]gives an optimal order to experimental runs in an experiment with specified defining contrasts. An experiment can be specified by a design requirement such as resolution or estimation of some interactions. To meet such a requirement, we can find several sets of defining contrasts. Applying the generalized foldover scheme to these sets of defining contrasts, we obtain designs with different numbers of level changes and then the design with minimum number of level changes. The difficulty is to find all the sets of defining contrasts. An alternative approach is investigated by [2] Cheng, C. S., Martin, R. J. and Tang, B. 1998. Two-level factorial designs with extreme numbers of level changes. The Annals of Statistics, 26: 1522–1539. [Crossref], [Web of Science ®] , [Google Scholar]for two-level fractional factorial experiments. In this paper, we investigate experiments with all factors in slevels. 相似文献

18.

A Comparison of Procedures for Controlling the False Discovery Rate in the Presence of Small Variance Genes: A Simulation Study

Dan Lin Ziv Shkedy Tomasz Burzykowski Willem Talloen Luc Bijnens 《统计学通讯:模拟与计算》2013,42(10):2111-2122

The Significance Analysis of Microarrays (SAM; Tusher et al., 2001 Tusher , V. G. , Tibshirani , R. , Chu , G. ( 2001 ). Significance analysis of microarrys applied to the ionizing radiation response . Proceedings of the National Academy of Sciences 98 : 5116 – 5121 .[Crossref], [PubMed], [Web of Science ®] , [Google Scholar]) method is widely used in analyzing gene expression data while controlling the FDR by using resampling-based procedure in the microarray setting. One of the main components of the SAM procedure is the adjustment of the test statistic. The introduction of the fudge factor to the test statistic aims at deflating the large value of test statistics due to the small standard error of gene-expression. Lin et al. (2008 Lin , D. , Shkedy , Z. , Burzykowski , T. , Göhlmann , H. W. H. , De Bondt , A. , Perera , T. , Geerts , T. , Bijnens , L. ( 2008 ). Significance analysis of microarray (SAM) for comparisons of several treatments with one control . Biometric Journal, MCP 50 ( 5 ): 801 – 823 .[Crossref], [PubMed], [Web of Science ®] , [Google Scholar]) pointed out that the fudge factor does not effectively improve the power and the control of the FDR as compared to the SAM procedure without the fudge factor in the presence of small variance genes. Motivated by the simulation results presented in Lin et al. (2008 Lin , D. , Shkedy , Z. , Burzykowski , T. , Göhlmann , H. W. H. , De Bondt , A. , Perera , T. , Geerts , T. , Bijnens , L. ( 2008 ). Significance analysis of microarray (SAM) for comparisons of several treatments with one control . Biometric Journal, MCP 50 ( 5 ): 801 – 823 .[Crossref], [PubMed], [Web of Science ®] , [Google Scholar]), in this article, we extend our study to compare several methods for choosing the fudge factor in the modified t-type test statistics and use simulation studies to investigate the power and the control of the FDR of the considered methods. 相似文献

19.

ROBUST TESTS FOR THE SIGNIFICANCE OF ORTHANT RESTRICTED MEAN VECTOR

《统计学通讯:理论与方法》2013,42(8-9):1789-1810

Mudholkar and Srivastava [1] Mudholkar, G. S. and Srivastava, D. K. A class of robust stepwise alternatives to Hotelling's T²tests. Submitted to the Journal of Applied Statistics 1999 [Google Scholar]adapted Mudholkar and Subbaiah's [2] Mudholkar, G. S. and Subbaiah, P. 1980. Testing significance of a mean vector–a possible alternative to Hotelling's T². Ann. Inst. Statist. Math., 32(A): 43–52. [Google Scholar]modified stepwise procedure, using the trimmed means in place of the means and appropriate studentization, to construct robust tests for the significance of a mean vector. They concluded that the robust alternatives provide excellent type I error control, and a substantial gain in power over Hotelling's T ²test in case of heavy tailed populations without significant loss of power when the population is normal. In this paper we adapt the modified stepwise approach to construct simple tests for the significance of the orthant constrained mean vector of a p-variate normal population with unknown covariance matrix, and also for constructing robust tests without assuming normality. The simple normal theory tests have exact type I error, whereas the robust tests provide a reasonably type I error control and substantial power advantage over Perlman's [3] Perlman, M. D. 1969. One-sided testing problems in multivariate analysis. Annals of Mathematical Statistics, 40: 549–567. [Crossref] , [Google Scholar]likelihood ratio test. 相似文献

20.

Entropy-Based Moment Selection in the Presence of Weak Identification

Alastair R. Hall Atsushi Inoue Changmock Shin 《Econometric Reviews》2013,32(4-6):398-427

Hall et al. (2007 Hall , A. R. , Inoue , A. , Jana , K. , Shin , C. (2007). Information in generalized method of moments estimation and entropy based moment selection. Journal of Econometrics 138:488–512.[Crossref] , [Google Scholar]) propose a method for moment selection based on an information criterion that is a function of the entropy of the limiting distribution of the Generalized Method of Moments (GMM) estimator. They establish the consistency of the method subject to certain conditions that include the identification of the parameter vector by at least one of the moment conditions being considered. In this article, we examine the limiting behavior of this moment selection method when the parameter vector is weakly identified by all the moment conditions being considered. It is shown that the selected moment condition is random and hence not consistent in any meaningful sense. As a result, we propose a two-step procedure for moment selection in which identification is first tested using a statistic proposed by Stock and Yogo (2003 Stock , J. H. , Yogo , M. ( 2003 ). Testing for weak instruments in linear IV regression . Discussion paper, Kennedy School of Government, Harvard University, Cambridge, MA . [Google Scholar]) and then only if this statistic indicates identification does the researcher proceed to the second step in which the aforementioned information criterion is used to select moments. The properties of this two-step procedure are contrasted with those of strategies based on either using all available moments or using the information criterion without the identification pre-test. The performances of these strategies are compared via an evaluation of the finite sample behavior of various methods for inference about the parameter vector. The inference methods considered are based on the Wald statistic, Anderson and Rubin's (1949 Anderson , T. W. , Rubin , H. ( 1949 ). Estimation of the parameters of a single equation in a complete system of stochastic equations . Annals of Mathematical Statistics 20 : 46 – 63 .[Crossref] , [Google Scholar]) statistic, Kleibergen (2002 Kleibergen , F. ( 2002 ). Pivotal statistics for testing structural parameters in instrumenatl variables regression . Econometrica 70 : 1781 – 1803 .[Crossref], [Web of Science ®] , [Google Scholar]) K statistic, and combinations thereof in which the choice is based on the outcome of the test for weak identification. 相似文献