首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
Li et al. (2011 Li, B., Artemiou, A., Li, L. (2011). Principal support vector machine for linear and nonlinear sufficient dimension reduction. Ann. Stat. 39:31823210.[Crossref], [Web of Science ®] [Google Scholar]) presented the novel idea of using support vector machines (SVMs) to perform sufficient dimension reduction. In this work, we investigate the potential improvement in recovering the dimension reduction subspace when one changes the SVM algorithm to treat imbalance based on several proposals in the machine learning literature. We find out that in most situations, treating the imbalanced nature of the slices will help improve the estimation. Our results are verified through simulation and real data applications.  相似文献   

2.
In this paper a methodology for the delineation of local labour markets (LLMs) using evolutionary algorithms is proposed. This procedure, based on that in Flórez-Revuelta et al. [13 F. Flórez-Revuelta, J.M. Casado-Díaz, and L. Martínez-Bernabeu, An evolutionary approach to the delineation of functional areas base on travel-to-work flows, Int. J. Autom. Comput. 5(1) (2008), pp. 1021. doi: 10.1007/s11633-008-0010-6[Crossref] [Google Scholar],14 F. Flórez-Revuelta, J.M. Casado-Díaz, L. Martínez-Bernabeu, and R. Gómez-Hernández, A memetic algorithm for the delineation of local labour markets, in Parallel Problem Solving from Nature X, Vol. 5199, Lecture Notes in Computer Science, G. Rudolph, T.H. Jansen, S.M. Lucas, C. Poloni, and N. Beume, eds., Springer, Berlin, 2008, pp. 1011–1020. [Google Scholar]], introduces three modifications. First, initial groups of municipalities with a minimum size requirement are built using the travel time between them. Second, a not fully random initiation algorithm is proposed. And third, as a final stage of the procedure, a contiguity step is implemented. These modifications significantly decrease the computational times of the algorithm (up to a 99%) without any deterioration of the quality of the solutions. The optimization algorithm may give a set of potential solutions with very similar values with respect to the objective function what would lead to different partitions, both in terms of number of markets and their composition. In order to capture their common aspects an algorithm based on a cluster partitioning of k-means type is presented. This stage of the procedure also provides a ranking of LLMs foci useful for planners and administrations in decision-making processes on issues related to labour activities. Finally, to evaluate the performance of the algorithm a toy example with artificial data is analysed. The full methodology is illustrated through a real commuting data set of the region of Aragón (Spain).  相似文献   

3.
In this paper, we propose an asymmetric class of bivariate copulas. This class is obtained through limiting properties of the extended copula introduced by Bekrizadeh, et al. (2015 Bekrizadeh, H., Parham, G. A., Zadkarami, M. R. (2015). Extending some classes of copulas; Applications. Ph.D. Thesis, University of Shahid Chamran, Ahvaz. [Google Scholar]), and includes some of known copulas. Some general formulas for well-known association measures and concepts of dependence of the proposed model are obtained. This paper highlights the usefulness of this new bivariate copula for modeling the interested variables whose marginal distribution effect on joint distribution isn't identical. We apply some subfamilies of this new class to model a dataset of medical science to show the superiority of presented model in comparison with the known copulas. These results will be investigated using simulation.  相似文献   

4.
We adopt boosting for classification and selection of high-dimensional binary variables for which classical methods based on normality and non singular sample dispersion are inapplicable. Boosting seems particularly well suited for binary variables. We present three methods of which two combine boosting with the relatively classical variable selection methods developed in Wilbur et al. (2002 Wilbur , J. D. , Ghosh , J. K. , Nakatsu , C. H. , Brouder , S. M. , Doerge , R. W. ( 2002 ). Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial community DNA fingerprints . Biometrics 58 : 378386 . [Google Scholar]). Our primary interest is variable selection in classification with small misclassification error being used as validation of proposed method for variable selection. Two of the new methods perform uniformly better than Wilbur et al. (2002 Wilbur , J. D. , Ghosh , J. K. , Nakatsu , C. H. , Brouder , S. M. , Doerge , R. W. ( 2002 ). Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial community DNA fingerprints . Biometrics 58 : 378386 . [Google Scholar]) in one set of simulated and three real life examples.  相似文献   

5.
This article considers estimation of Panel Vector Autoregressive Models of order 1 (PVAR(1)) with focus on fixed T consistent estimation methods in First Differences (FD) with additional strictly exogenous regressors. Additional results for the Panel FD ordinary least squares (OLS) estimator and the FDLS type estimator of Han and Phillips (2010 Han, C., Phillips, P. C. B. (2010). Gmm estimation for dynamic panels with fixed effects and strong instruments at unity. Econometric Theory 26:119151.[Crossref], [Web of Science ®] [Google Scholar]) are provided. Furthermore, we simplify the analysis of Binder et al. (2005 Binder, M., Hsiao, C., Pesaran, M. H. (2005). Estimation and inference in short panel vector autoregressions with unit root and cointegration. Econometric Theory 21:795837.[Crossref], [Web of Science ®] [Google Scholar]) by providing additional analytical results and extend the original model by taking into account possible cross-sectional heteroscedasticity and presence of strictly exogenous regressors. We show that in the three wave panel the log-likelihood function of the unrestricted Transformed Maximum Likelihood (TML) estimator might violate the global identification assumption. The finite-sample performance of the analyzed methods is investigated in a Monte Carlo study.  相似文献   

6.
New drug discovery in the pediatrics has dramatically improved survival, but with long- term adverse events. This motivates the examination of adverse outcomes such as long-term toxicity in a phase IV trial. An ideal approach to monitor long-term toxicity is to systematically follow the survivors, which is generally not feasible. Instead, cross-sectional surveys are conducted in Hudson et al. (2007 Hudson , M. M. , Rai , S. N. , Nunez , C. , Merchant , T. E. , Marina , N. M. , Zalamea , N. , Cox , C. , Phipps , S. , Pompeu , R. , Rosenthal , D. ( 2007 ). Noninvasive evaluation of late anthracycline cardiac toxicity in childhood cancer survivors . J. Clin. Oncol. 25 : 36353643 .[Crossref], [PubMed], [Web of Science ®] [Google Scholar]), with one of the objectives to estimate the cumulative incidence rates along with specific interest in fixed-term (5 or 10 year) rates. We present inference procedures based on current status data to our motivating example with very interesting findings.  相似文献   

7.
This article provides an Edgeworth expansion for the distribution of the log-likelihood derivative LLD of the parameter of a time series generated by a linear regression model with Gaussian, stationary, and long-memory errors. Under some sets of conditions on the regression coefficients, the spectral density function, and the parameter values, an Edgeworth expansion of the density as well as the distribution function of a vector of centered and normalized derivatives of the plug-in log-likelihood PLL function of arbitrarily large order is established. This is done by extending the results of Lieberman et al. (2003 Lieberman , O. , Rousseau , J. , Zucker , D. M. ( 2003 ). Valid edgeworth expansions for the maximum likelihood estimator of the parameter of a stationary. gaussian, strongly dependent processes. it Ann. Statist. 31:586–612 . [Google Scholar]), who provided an Edgeworth expansion for the Gaussian stationary long-memory case, to our present model, which is a linear regression process with stationary Gaussian long-memory errors.  相似文献   

8.
In this work, we propose the construction of a chi-squared goodness-of-fit test in censored data case, for Bertholon model which can analyse various competing risks of failure or death. This test is based on a modification of the Nikulin-Rao-Robson (NRR) statistic proposed by Bagdonavicius and Nikulin (2011a Bagdonavicius, V., Nikulin, M. (2011a). Chi-squared tests for general composite hypotheses from censored samples. Comptes Rendus Mathématiques: Series I 349(3–4):219223. [Google Scholar], 2011b Bagdonavicius, V., Nikulin, M. (2011b). Chi-squared goodness-of-fit test for right censored data. International Journal of Applied Mathematics and Statistics 24:3050. [Google Scholar]) for censored data. We applied this test to numerical examples from simulated samples and real data.  相似文献   

9.
Based on the recursions in Huffer (1988 Huffer, F. (1988). Divided differences and the joint distribution of linear combinations of spacings. Journal of Applied Probability 25:346354. [Google Scholar]) and Huffer and Lin (2001 Huffer, F. W., Lin, C. T. (2001). Computing the joint distribution of general linear combinations of spacings or exponential variates. Statistica Sinica 11:11411157. [Google Scholar]), we present a two-stage algorithm and two specialized methods for evaluating the probabilities involving linear combination of spacings of special forms. The two-stage algorithm combines the advantages of marking algorithm in Huffer and Lin (1997 Huffer, F. W., Lin, C. T. (1997). Computing the exact distribution of the extremes of sums of consecutive spacings. Computational Statistics and Data Analysis 26:117132. [Google Scholar]) and general algorithm in Huffer and Lin (2001 Huffer, F. W., Lin, C. T. (2001). Computing the joint distribution of general linear combinations of spacings or exponential variates. Statistica Sinica 11:11411157. [Google Scholar]). The proposed methods can analytically derive the exact expressions for some specific problems, and efficiently handle problems such as the distribution of the circular scan statistic and multiple coverage probabilities.  相似文献   

10.
Accelerated failure time models are useful in survival data analysis, but such models have received little attention in the context of measurement error. In this paper we discuss an accelerated failure time model for bivariate survival data with covariates subject to measurement error. In particular, methods based on the marginal and joint models are considered. Consistency and efficiency of the resultant estimators are investigated. Simulation studies are carried out to evaluate the performance of the estimators as well as the impact of ignoring the measurement error of covariates. As an illustration we apply the proposed methods to analyze a data set arising from the Busselton Health Study (Knuiman et al., 1994 Knuiman , M. W. , Cullent , K. J. , Bulsara , M. K. , Welborn , T. A. , Hobbs , M. S. T. ( 1994 ). Mortality trends, 1965 to 1989, in Busselton, the site of repeated health surveys and interventions . Austral. J. Public Health 18 : 129135 . [CSA] [Crossref], [PubMed] [Google Scholar]).  相似文献   

11.
《统计学通讯:理论与方法》2012,41(16-17):3162-3178
In this article we use a new methodology, based on algebraic strata, to generate the class of all the orthogonal arrays of given size and strength. From this class we extract all the non isomorphic orthogonal arrays. Then, using all these non isomorphic orthogonal arrays, we suggest a method based on the inequivalent matrices permutations testing procedures Basso et al. (2004 Basso , D. , Evangelaras , H. , Koukouvinos , C. , Salmaso , L. ( 2004 ). Nonparametric testing for main effects on inequivalent designs. Proc. 7th Int. Workshop Model-Oriented Design Anal. Heeze, Netherlands, June 14–18 . [Google Scholar]) in order to obtain separate permutation tests for the effects in unreplicated mixed level fractional factorial designs. In order to validate the proposed method we perform a Monte Carlo simulation study and find out that the permutation tests appear to be a valid solution for testing effects, in particular when the usual normality assumptions cannot be justified.  相似文献   

12.
This article studies the estimation of change point in panel models. We extend Bai (2010 Bai, J. (2010). Common breaks in means and variances for panel data. Journal of Econometrics 157:7892.[Crossref], [Web of Science ®] [Google Scholar]) and Feng et al. (2009 Feng, Q., Kao, C., Lazarová, S. (2009). Estimation and Identification of Change Points in Panel Models, Working paper, Syracuse University. [Google Scholar]) to the case of stationary or nonstationary regressors and error term, and whether the change point is present or not. We prove consistency and derive the asymptotic distributions of the Ordinary Least Squares (OLS) and First Difference (FD) estimators. We find that the FD estimator is robust for all cases considered.  相似文献   

13.
We propose a new ratio type estimator for estimating the finite population mean using two auxiliary variables in stratified two-phase sampling. Expressions for bias and mean squared error of the proposed estimator are derived up to the first order of approximation. The proposed estimator is more efficient than the usual stratified sample mean estimator, traditional stratified ratio estimator and some other stratified estimators including Bahl and Tuteja (1991 Bahl, S., Tuteja, R. K. (1991). Ratio and product type exponential estimators. Information and Optimization Sciences 12:159163. [Google Scholar]), Chami et al. (2012 Chami, P. S., Singh, B., Thomas, D. (2012). A two-prameter ratio-product-ratio estimator using auxiliary information. ISRN Probability and Statistics 2012:115, doi: 10.5402/2012/103860.[Crossref] [Google Scholar]), Chand (1975 Chand, L. (1975) Some Ratio Type Estimator Based on two or more Auxiliary Variables, Ph.D. dissertation, Iowa State University, Ames, Iowa (unpublished). [Google Scholar]), Choudhury and Singh (2012 Choudhury, S., Singh, B. K. (2012). A class of chain ratio-product type estimators with two auxiliary variables under double sampling scheme. Journal of the Korean Statistical Society 41:247256. [Google Scholar]), Hamad et al. (2013 Hamad, N., Hanif, M., Haider, N. (2013). A regression type estimator with two auxiliary variables for two-phase sampling. Open Journal of Statistics, 3:7478. [Google Scholar]), Vishwakarma and Gangele (2014 Vishwakarma, G. K., Gangele, R. K. (2014). A class of chain ratio-type exponential estimators in double sampling using two auxiliary variates. Applied Mathematics and Computation 227:171175. [Google Scholar]), Sanaullah et al. (2014 Sanaullah, A., Ali, H. M., Noor ul Amin, M., Hanif, M. (2014). Generalized exponential chain ratio estimators under stratified two-phase random sampling. Applied Mathematics and Computation 226:541547. [Google Scholar]), and Chanu and Singh (2014 Chanu, W. K., Singh, B. K. (2014). Improved class of ratio-cum-product estimators of finite population mean in two phase sampling. Global Journal of Science Frontier Research: F Mathematics and Decision Sciences 14(2):114. [Google Scholar]).  相似文献   

14.
Tiku and Vaughan (1999 Tiku , M. L. , Vaughan , D. C. ( 1999 ). A Family of Short-tailed Symmetric Distributions. Technical Report, McMaster University, Canada . [Google Scholar]) introduced a short-tailed symmetric family recently. In the article, the tail properties of the short-tailed symmetric distribution are studied and the asymptotic distribution of the maximum of i.i.d. random variables obeying the short-tailed distribution is gained.  相似文献   

15.
This article considers some classes of estimators of the population median of the study variable using information on an auxiliary variable with their properties under large sample approximation. Asymptotic optimum estimator (AOE) in each class of estimators has been investigated along with the approximate mean square error formulae. It has been shown that the proposed classes of estimators are better than these considered by Gross (1980 Gross , T. S. ( 1980 ). Median estimation in sample surveys. Proc. Surv. Res. Meth. Sect. Amer. Statist. Assoc. 181–184 . [Google Scholar]), Kuk and Mak (1989 Kuk , A. Y. C. , Mak , T. K. ( 1989 ). Median estimation in the presence of auxiliary information . J. Roy. Statist. Soc. Ser. B51 : 261269 . [Google Scholar]), Singh et al. (2003a Singh , H. P. , Singh , S. , Joarder , A. H. ( 2003a ). Estimation of population median when mode of an auxiliary variable is known . J. Statist. Res. 37 ( 1 ): 5763 . [Google Scholar]), and Al and Cingi (2009 Al , S. , Cingi , H. ( 2009 ). New estimators for the population median in simple random sampling. Tenth Islamic Countries Conference on Statistical Sciences, held in New Cairo, Egypt . [Google Scholar]). An empirical study is carried out to judge the merits of the suggested class of estimators over other existing estimators.  相似文献   

16.
In hierarchical data settings, be it of a longitudinal, spatial, multi-level, clustered, or otherwise repeated nature, often the association between repeated measurements attracts at least part of the scientific interest. Quantifying the association frequently takes the form of a correlation function, including but not limited to intraclass correlation. Vangeneugden et al. (2010 Vangeneugden, T., Molenberghs, G., Laenen, A., Geys, H., Beunckens, C., Sotto, C. (2010). Marginal correlation in longitudinal binary data based on generalized linear mixed models. Communi. Stati. Theory &; Methods. 39:35423557. [Google Scholar]) derived approximate correlation functions for longitudinal sequences of general data type, Gaussian and non-Gaussian, based on generalized linear mixed-effects models. Here, we consider the extended model family proposed by Molenberghs et al. (2010 Molenberghs, G., Verbeke, G., Demétrio, C., Vieira, A. (2010). A family of generalized linear models for repeated measures with normal and conjugate random effects. Stat. Sci. 25:325347.[Crossref], [Web of Science ®] [Google Scholar]). This family flexibly accommodates data hierarchies, intra-sequence correlation, and overdispersion. The family allows for closed-form means, variance functions, and correlation function, for a variety of outcome types and link functions. Unfortunately, for binary data with logit link, closed forms cannot be obtained. This is in contrast with the probit link, for which such closed forms can be derived. It is therefore that we concentrate on the probit case. It is of interest, not only in its own right, but also as an instrument to approximate the logit case, thanks to the well-known probit-logit ‘conversion.’ Next to the general situation, some important special cases such as exchangeable clustered outcomes receive attention because they produce insightful expressions. The closed-form expressions are contrasted with the generic approximate expressions of Vangeneugden et al. (2010 Vangeneugden, T., Molenberghs, G., Laenen, A., Geys, H., Beunckens, C., Sotto, C. (2010). Marginal correlation in longitudinal binary data based on generalized linear mixed models. Communi. Stati. Theory &; Methods. 39:35423557. [Google Scholar]) and with approximations derived for the so-called logistic-beta-normal combined model. A simulation study explores performance of the method proposed. Data from a schizophrenia trial are analyzed and correlation functions derived.  相似文献   

17.
Mudholkar and Srivastava [1] Mudholkar, G. S. and Srivastava, D. K. A class of robust stepwise alternatives to Hotelling's T2tests. Submitted to the Journal of Applied Statistics 1999 [Google Scholar]adapted Mudholkar and Subbaiah's [2] Mudholkar, G. S. and Subbaiah, P. 1980. Testing significance of a mean vector–a possible alternative to Hotelling's T2. Ann. Inst. Statist. Math., 32(A): 4352.  [Google Scholar]modified stepwise procedure, using the trimmed means in place of the means and appropriate studentization, to construct robust tests for the significance of a mean vector. They concluded that the robust alternatives provide excellent type I error control, and a substantial gain in power over Hotelling's T 2test in case of heavy tailed populations without significant loss of power when the population is normal. In this paper we adapt the modified stepwise approach to construct simple tests for the significance of the orthant constrained mean vector of a p-variate normal population with unknown covariance matrix, and also for constructing robust tests without assuming normality. The simple normal theory tests have exact type I error, whereas the robust tests provide a reasonably type I error control and substantial power advantage over Perlman's [3] Perlman, M. D. 1969. One-sided testing problems in multivariate analysis. Annals of Mathematical Statistics, 40: 549567. [Crossref] [Google Scholar]likelihood ratio test.  相似文献   

18.
In many genetic analyses of dichotomous twin data, odds ratios have been used to test hypotheses on heritability and shared common environment effects of a given disease (Lichtenstein et al., 2000 Lichtenstein , P. , Holm , N. , Verkasalo , P. , Iliadou , A. , Kaprio , J. , Koskenvuo , M. , Pukkala , E. , Skytthe , A. , Hemminki , K. ( 2000 ). Environmental and heritable factors in the causation of cancer . New England Journal of Medicine 343 : 7885 .[Crossref], [Web of Science ®] [Google Scholar]; Ahlbom et al., 1997 Ahlbom , A. , Lichtenstein , P. , Malmström , H. , Feychting , M. , Hemminki , K. , Pedersen , N. L. ( 1997 ). Cancer in twins: genetic and nongenetic familial risk factors . Journal of the National Cancer Institute 89 : 28793 . [Google Scholar]; Ramakrishnan et al., 1992 Ramakrishnan , V. , Goldberg , J. , Henderson , W. , Elsen , S. , True , W. , Lyons , M. , Tsuang , M. ( 1992 ). Elementary methods for the analysis of dichotomous outcomes in unselected samples of twins . Genetic Epidemiology 9 : 273287 . [Google Scholar], 4). However, estimates of these two effects have not been dealt with in the literature. In epidemiology, the attributable fraction (AF), a function of the odds ratio and the prevalence of the risk factor has been used to describe the contribution of a risk factor to a disease in a given population (Leviton, 1973 Leviton , A. ( 1973 ). Definitions of attributable risk . American Journal of Epidemiology 98 : 231 . [Google Scholar]). In this article, we adapt the AF to quantify the heritability and the shared common environment. Twin data on cancer, gallstone disease and phobia are used to illustrate the applicability of the AF estimate as a measure of heritability.  相似文献   

19.
In this article, we have evaluated the performance of different forecasters and tested association between their performances for different pairs of variables. We have used three data sets of track records of professional U.S. economic forecasters participating in the Blue Chip consensus forecasting service (the data sets contain the root mean square errors (RMSE) of different forecasters for different years). To evaluate the performance of forecasters we have covered three well-known tests, namely the usual F test (cf. Fisher (1923 Fisher, R. A., Mackenzie, M. A. (1923). Studied in crop variation II. The manurial response of different potato. Journal of Agricultural Science 13:311320. [Google Scholar])), Kruskal Wallis test (cf. Kruskal and Wallis (1952 Kruskall, W. H., Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of American Statistical Association 47:583621. [Google Scholar])), and Extension of Median test (cf. Daniel (1990 Daniel, W. W. (1990). Applied Nonparametric Statistics. Duxbury Classic Series. (2nd Ed.), Boston. [Google Scholar])). To test the association between the forecaster's performances for different pairs of variables, we have considered Gini mean correlation coefficient rg1 (cf. Yitzhaki, S., and Olkin, I. (1991 Yitzhaki, S., Olkin, I. (1991). Concentration indices and concentration curves, in K. Mosler and M. Scarsini (eds.), Stochastic Orders and Decisions under Risk, Institute of Mathematical Statistics: Lecture-Notes Monograph Series, 19, 1991, 380392. [Google Scholar]) and Yitzhaki (2003 Yitzaki, S. (2003). Gini mean difference: A superior measure of variability for non normal distribution. Metron-International Journal of Statistics, LXI:285316. [Google Scholar])), Modified rank correlation coefficient (cf. Zimmerman (1994 Zimmerman, D. W. (1994). A Note on modified rank correlation. Journal of educational and Behavioral Statistics 19:357362. [Google Scholar])) and three modifications of Spearman rank correlation coefficient. We have observed that different forecasters do not necessarily offer same average performance. Moreover, an evidence of association between two criteria does not always lead us reaching at the same decision. The outcomes of the study may help the practitioners in selecting the best forecaster(s) for policymaking purposes.  相似文献   

20.
When a sufficient correlation between the study variable and the auxiliary variable exists, the ranks of the auxiliary variable are also correlated with the study variable, and thus, these ranks can be used as an effective tool in increasing the precision of an estimator. In this paper, we propose a new improved estimator of the finite population mean that incorporates the supplementary information in forms of: (i) the auxiliary variable and (ii) ranks of the auxiliary variable. Mathematical expressions for the bias and the mean-squared error of the proposed estimator are derived under the first order of approximation. The theoretical and empirical studies reveal that the proposed estimator always performs better than the usual mean, ratio, product, exponential-ratio and -product, classical regression estimators, and Rao (1991 Rao, T.J. (1991). On certail methods of improving ration and regression estimators. Commun. Stat. Theory Methods 20(10):33253340.[Taylor &; Francis Online], [Web of Science ®] [Google Scholar]), Singh et al. (2009 Singh, R., Chauhan, P., Sawan, N., Smarandache, F. (2009). Improvement in estimating the population mean using exponential estimator in simple random sampling. Int. J. Stat. Econ. 3(A09):1318. [Google Scholar]), Shabbir and Gupta (2010 Shabbir, J., Gupta, S. (2010). On estimating finite population mean in simple and stratified random sampling. Commun. Stat. Theory Methods 40(2):199212.[Taylor &; Francis Online], [Web of Science ®] [Google Scholar]), Grover and Kaur (2011 Grover, L.K., Kaur, P. (2011). An improved estimator of the finite population mean in simple random sampling. Model Assisted Stat. Appl. 6(1):4755. [Google Scholar], 2014) estimators.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号