共查询到20条相似文献,搜索用时 93 毫秒
1.
Li et al. (2011) presented the novel idea of using support vector machines (SVMs) to perform sufficient dimension reduction. In this work, we investigate the potential improvement in recovering the dimension reduction subspace when one changes the SVM algorithm to treat imbalance based on several proposals in the machine learning literature. We find out that in most situations, treating the imbalanced nature of the slices will help improve the estimation. Our results are verified through simulation and real data applications. 相似文献
2.
M. Pilar Alonso Asunción Beamonte Manuel Salvador 《Journal of applied statistics》2015,42(5):1043-1063
In this paper a methodology for the delineation of local labour markets (LLMs) using evolutionary algorithms is proposed. This procedure, based on that in Flórez-Revuelta et al. [13,14], introduces three modifications. First, initial groups of municipalities with a minimum size requirement are built using the travel time between them. Second, a not fully random initiation algorithm is proposed. And third, as a final stage of the procedure, a contiguity step is implemented. These modifications significantly decrease the computational times of the algorithm (up to a 99%) without any deterioration of the quality of the solutions. The optimization algorithm may give a set of potential solutions with very similar values with respect to the objective function what would lead to different partitions, both in terms of number of markets and their composition. In order to capture their common aspects an algorithm based on a cluster partitioning of k-means type is presented. This stage of the procedure also provides a ranking of LLMs foci useful for planners and administrations in decision-making processes on issues related to labour activities. Finally, to evaluate the performance of the algorithm a toy example with artificial data is analysed. The full methodology is illustrated through a real commuting data set of the region of Aragón (Spain). 相似文献
3.
In this paper, we propose an asymmetric class of bivariate copulas. This class is obtained through limiting properties of the extended copula introduced by Bekrizadeh, et al. (2015), and includes some of known copulas. Some general formulas for well-known association measures and concepts of dependence of the proposed model are obtained. This paper highlights the usefulness of this new bivariate copula for modeling the interested variables whose marginal distribution effect on joint distribution isn't identical. We apply some subfamilies of this new class to model a dataset of medical science to show the superiority of presented model in comparison with the known copulas. These results will be investigated using simulation. 相似文献
4.
Junyong Park Jayson D. Wilbur Jayanta K. Ghosh Cindy H. Nakatsu Corinne Ackerman 《统计学通讯:模拟与计算》2013,42(4):855-869
We adopt boosting for classification and selection of high-dimensional binary variables for which classical methods based on normality and non singular sample dispersion are inapplicable. Boosting seems particularly well suited for binary variables. We present three methods of which two combine boosting with the relatively classical variable selection methods developed in Wilbur et al. (2002). Our primary interest is variable selection in classification with small misclassification error being used as validation of proposed method for variable selection. Two of the new methods perform uniformly better than Wilbur et al. (2002) in one set of simulated and three real life examples. 相似文献
5.
Artūras Juodis 《Econometric Reviews》2018,37(6):650-693
This article considers estimation of Panel Vector Autoregressive Models of order 1 (PVAR(1)) with focus on fixed T consistent estimation methods in First Differences (FD) with additional strictly exogenous regressors. Additional results for the Panel FD ordinary least squares (OLS) estimator and the FDLS type estimator of Han and Phillips (2010) are provided. Furthermore, we simplify the analysis of Binder et al. (2005) by providing additional analytical results and extend the original model by taking into account possible cross-sectional heteroscedasticity and presence of strictly exogenous regressors. We show that in the three wave panel the log-likelihood function of the unrestricted Transformed Maximum Likelihood (TML) estimator might violate the global identification assumption. The finite-sample performance of the analyzed methods is investigated in a Monte Carlo study. 相似文献
6.
Shesh N. Rai Jianmin Pan Xiaobin Yuan Jianguo Sun Melissa M. Hudson Deo K. Srivastava 《统计学通讯:理论与方法》2013,42(17):3117-3133
New drug discovery in the pediatrics has dramatically improved survival, but with long- term adverse events. This motivates the examination of adverse outcomes such as long-term toxicity in a phase IV trial. An ideal approach to monitor long-term toxicity is to systematically follow the survivors, which is generally not feasible. Instead, cross-sectional surveys are conducted in Hudson et al. (2007), with one of the objectives to estimate the cumulative incidence rates along with specific interest in fixed-term (5 or 10 year) rates. We present inference procedures based on current status data to our motivating example with very interesting findings. 相似文献
7.
Mosisa Aga 《统计学通讯:理论与方法》2013,42(4):663-673
This article provides an Edgeworth expansion for the distribution of the log-likelihood derivative LLD of the parameter of a time series generated by a linear regression model with Gaussian, stationary, and long-memory errors. Under some sets of conditions on the regression coefficients, the spectral density function, and the parameter values, an Edgeworth expansion of the density as well as the distribution function of a vector of centered and normalized derivatives of the plug-in log-likelihood PLL function of arbitrarily large order is established. This is done by extending the results of Lieberman et al. (2003), who provided an Edgeworth expansion for the Gaussian stationary long-memory case, to our present model, which is a linear regression process with stationary Gaussian long-memory errors. 相似文献
8.
In this work, we propose the construction of a chi-squared goodness-of-fit test in censored data case, for Bertholon model which can analyse various competing risks of failure or death. This test is based on a modification of the Nikulin-Rao-Robson (NRR) statistic proposed by Bagdonavicius and Nikulin (2011a, 2011b) for censored data. We applied this test to numerical examples from simulated samples and real data. 相似文献
9.
Based on the recursions in Huffer (1988) and Huffer and Lin (2001), we present a two-stage algorithm and two specialized methods for evaluating the probabilities involving linear combination of spacings of special forms. The two-stage algorithm combines the advantages of marking algorithm in Huffer and Lin (1997) and general algorithm in Huffer and Lin (2001). The proposed methods can analytically derive the exact expressions for some specific problems, and efficiently handle problems such as the distribution of the circular scan statistic and multiple coverage probabilities. 相似文献
10.
Accelerated failure time models are useful in survival data analysis, but such models have received little attention in the context of measurement error. In this paper we discuss an accelerated failure time model for bivariate survival data with covariates subject to measurement error. In particular, methods based on the marginal and joint models are considered. Consistency and efficiency of the resultant estimators are investigated. Simulation studies are carried out to evaluate the performance of the estimators as well as the impact of ignoring the measurement error of covariates. As an illustration we apply the proposed methods to analyze a data set arising from the Busselton Health Study (Knuiman et al., 1994). 相似文献
11.
《统计学通讯:理论与方法》2012,41(16-17):3162-3178
In this article we use a new methodology, based on algebraic strata, to generate the class of all the orthogonal arrays of given size and strength. From this class we extract all the non isomorphic orthogonal arrays. Then, using all these non isomorphic orthogonal arrays, we suggest a method based on the inequivalent matrices permutations testing procedures Basso et al. (2004) in order to obtain separate permutation tests for the effects in unreplicated mixed level fractional factorial designs. In order to validate the proposed method we perform a Monte Carlo simulation study and find out that the permutation tests appear to be a valid solution for testing effects, in particular when the usual normality assumptions cannot be justified. 相似文献
12.
This article studies the estimation of change point in panel models. We extend Bai (2010) and Feng et al. (2009) to the case of stationary or nonstationary regressors and error term, and whether the change point is present or not. We prove consistency and derive the asymptotic distributions of the Ordinary Least Squares (OLS) and First Difference (FD) estimators. We find that the FD estimator is robust for all cases considered. 相似文献
13.
We propose a new ratio type estimator for estimating the finite population mean using two auxiliary variables in stratified two-phase sampling. Expressions for bias and mean squared error of the proposed estimator are derived up to the first order of approximation. The proposed estimator is more efficient than the usual stratified sample mean estimator, traditional stratified ratio estimator and some other stratified estimators including Bahl and Tuteja (1991), Chami et al. (2012), Chand (1975), Choudhury and Singh (2012), Hamad et al. (2013), Vishwakarma and Gangele (2014), Sanaullah et al. (2014), and Chanu and Singh (2014). 相似文献
14.
15.
Housila P. Singh 《统计学通讯:理论与方法》2013,42(23):4222-4238
This article considers some classes of estimators of the population median of the study variable using information on an auxiliary variable with their properties under large sample approximation. Asymptotic optimum estimator (AOE) in each class of estimators has been investigated along with the approximate mean square error formulae. It has been shown that the proposed classes of estimators are better than these considered by Gross (1980), Kuk and Mak (1989), Singh et al. (2003a), and Al and Cingi (2009). An empirical study is carried out to judge the merits of the suggested class of estimators over other existing estimators. 相似文献
16.
Tony Vangeneugden Geert Molenberghs Geert Verbeke Clarice G.B. Demétrio 《统计学通讯:理论与方法》2014,43(19):4164-4178
In hierarchical data settings, be it of a longitudinal, spatial, multi-level, clustered, or otherwise repeated nature, often the association between repeated measurements attracts at least part of the scientific interest. Quantifying the association frequently takes the form of a correlation function, including but not limited to intraclass correlation. Vangeneugden et al. (2010) derived approximate correlation functions for longitudinal sequences of general data type, Gaussian and non-Gaussian, based on generalized linear mixed-effects models. Here, we consider the extended model family proposed by Molenberghs et al. (2010). This family flexibly accommodates data hierarchies, intra-sequence correlation, and overdispersion. The family allows for closed-form means, variance functions, and correlation function, for a variety of outcome types and link functions. Unfortunately, for binary data with logit link, closed forms cannot be obtained. This is in contrast with the probit link, for which such closed forms can be derived. It is therefore that we concentrate on the probit case. It is of interest, not only in its own right, but also as an instrument to approximate the logit case, thanks to the well-known probit-logit ‘conversion.’ Next to the general situation, some important special cases such as exchangeable clustered outcomes receive attention because they produce insightful expressions. The closed-form expressions are contrasted with the generic approximate expressions of Vangeneugden et al. (2010) and with approximations derived for the so-called logistic-beta-normal combined model. A simulation study explores performance of the method proposed. Data from a schizophrenia trial are analyzed and correlation functions derived. 相似文献
17.
《统计学通讯:理论与方法》2013,42(8-9):1789-1810
Mudholkar and Srivastava [1]adapted Mudholkar and Subbaiah's [2]modified stepwise procedure, using the trimmed means in place of the means and appropriate studentization, to construct robust tests for the significance of a mean vector. They concluded that the robust alternatives provide excellent type I error control, and a substantial gain in power over Hotelling's T 2test in case of heavy tailed populations without significant loss of power when the population is normal. In this paper we adapt the modified stepwise approach to construct simple tests for the significance of the orthant constrained mean vector of a p-variate normal population with unknown covariance matrix, and also for constructing robust tests without assuming normality. The simple normal theory tests have exact type I error, whereas the robust tests provide a reasonably type I error control and substantial power advantage over Perlman's [3]likelihood ratio test. 相似文献
18.
Viswanathan Ramakrishnan 《统计学通讯:模拟与计算》2013,42(3):405-418
In many genetic analyses of dichotomous twin data, odds ratios have been used to test hypotheses on heritability and shared common environment effects of a given disease (Lichtenstein et al., 2000; Ahlbom et al., 1997; Ramakrishnan et al., 1992, 4). However, estimates of these two effects have not been dealt with in the literature. In epidemiology, the attributable fraction (AF), a function of the odds ratio and the prevalence of the risk factor has been used to describe the contribution of a risk factor to a disease in a given population (Leviton, 1973). In this article, we adapt the AF to quantify the heritability and the shared common environment. Twin data on cancer, gallstone disease and phobia are used to illustrate the applicability of the AF estimate as a measure of heritability. 相似文献
19.
In this article, we have evaluated the performance of different forecasters and tested association between their performances for different pairs of variables. We have used three data sets of track records of professional U.S. economic forecasters participating in the Blue Chip consensus forecasting service (the data sets contain the root mean square errors (RMSE) of different forecasters for different years). To evaluate the performance of forecasters we have covered three well-known tests, namely the usual F test (cf. Fisher (1923)), Kruskal Wallis test (cf. Kruskal and Wallis (1952)), and Extension of Median test (cf. Daniel (1990)). To test the association between the forecaster's performances for different pairs of variables, we have considered Gini mean correlation coefficient rg1 (cf. Yitzhaki, S., and Olkin, I. (1991) and Yitzhaki (2003)), Modified rank correlation coefficient (cf. Zimmerman (1994)) and three modifications of Spearman rank correlation coefficient. We have observed that different forecasters do not necessarily offer same average performance. Moreover, an evidence of association between two criteria does not always lead us reaching at the same decision. The outcomes of the study may help the practitioners in selecting the best forecaster(s) for policymaking purposes. 相似文献
20.
When a sufficient correlation between the study variable and the auxiliary variable exists, the ranks of the auxiliary variable are also correlated with the study variable, and thus, these ranks can be used as an effective tool in increasing the precision of an estimator. In this paper, we propose a new improved estimator of the finite population mean that incorporates the supplementary information in forms of: (i) the auxiliary variable and (ii) ranks of the auxiliary variable. Mathematical expressions for the bias and the mean-squared error of the proposed estimator are derived under the first order of approximation. The theoretical and empirical studies reveal that the proposed estimator always performs better than the usual mean, ratio, product, exponential-ratio and -product, classical regression estimators, and Rao (1991), Singh et al. (2009), Shabbir and Gupta (2010), Grover and Kaur (2011, 2014) estimators. 相似文献