共查询到20条相似文献,搜索用时 31 毫秒
1.
This paper presents a new variable weight method, called the singular value decomposition (SVD) approach, for Kohonen competitive learning (KCL) algorithms based on the concept of Varshavsky et al. [18]. Integrating the weighted fuzzy c-means (FCM) algorithm with KCL, in this paper, we propose a weighted fuzzy KCL (WFKCL) algorithm. The goal of the proposed WFKCL algorithm is to reduce the clustering error rate when data contain some noise variables. Compared with the k-means, FCM and KCL with existing variable-weight methods, the proposed WFKCL algorithm with the proposed SVD's weight method provides a better clustering performance based on the error rate criterion. Furthermore, the complexity of the proposed SVD's approach is less than Pal et al. [17], Wang et al. [19] and Hung et al. [9]. 相似文献
2.
Analysis of discrete lifetime data under middle-censoring and in the presence of covariates 总被引:1,自引:0,他引:1
S. Rao Jammalamadaka 《Journal of applied statistics》2015,42(4):905-913
‘Middle censoring’ is a very general censoring scheme where the actual value of an observation in the data becomes unobservable if it falls inside a random interval (L, R) and includes both left and right censoring. In this paper, we consider discrete lifetime data that follow a geometric distribution that is subject to middle censoring. Two major innovations in this paper, compared to the earlier work of Davarzani and Parsian [3], include (i) an extension and generalization to the case where covariates are present along with the data and (ii) an alternate approach and proofs which exploit the simple relationship between the geometric and the exponential distributions, so that the theory is more in line with the work of Iyer et al. [6]. It is also demonstrated that this kind of discretization of life times gives results that are close to the original data involving exponential life times. Maximum likelihood estimation of the parameters is studied for this middle-censoring scheme with covariates and their large sample distributions discussed. Simulation results indicate how well the proposed estimation methods work and an illustrative example using time-to-pregnancy data from Baird and Wilcox [1] is included. 相似文献
3.
Hisham Hilow 《Journal of applied statistics》2014,41(4):802-816
Time trend resistant fractional factorial experiments have often been based on regular fractionated designs where several algorithms exist for sequencing their runs in minimum number of factor-level changes (i.e. minimum cost) such that main effects and/or two-factor interactions are orthogonal to and free from aliasing with the time trend, which may be present in the sequentially generated responses. On the other hand, only one algorithm exists for sequencing runs of the more economical non-regular fractional factorial experiments, namely Angelopoulos et al. [1]. This research studies sequential factorial experimentation under non-regular fractionated designs and constructs a catalog of 8 minimum cost linear trend-free 12-run designs (of resolution III) in 4 up to 11 two-level factors by applying the interactions-main effects assignment technique of Cheng and Jacroux [3] on the standard 12-run Plackett–Burman design, where factor-level changes between runs are minimal and where main effects are orthogonal to the linear time trend. These eight 12-run designs are non-orthogonal but are more economical than the linear trend-free designs of Angelopoulos et al. [1], where they can accommodate larger number of two-level factors in smaller number of experimental runs. These non-regular designs are also more economical than many regular trend-free designs. The following will be provided for each proposed systematic design:
(1) The run order in minimum number of factor-level changes.
(2) The total number of factor-level changes between the 12 runs (i.e. the cost).
(3) The closed-form least-squares contrast estimates for all main effects as well as their closed-form variance–covariance structure.
4.
Abstract A generalization of Chauvenet's test (see Bol'shev, L. N. 1969. On tests for rejecting outlying observations. Trudy In-ta prikladnoi Mat. Tblissi Gosudart. univ. 2:159–177. (In Russian); Voinov, V. G., Nikulin, M. N. 1996. Unbaised Estimators and Their Applications. Vol. 2. Kluwer Academic Publishers.) suitable to applied the problem of detecting r outliers in an univariate data set is proposed. In the exponential case, the Chauvenet's test can be used. Various modifications of this test were considered by Bol'shev, Ibrakimov and Khalfina (Ibrakimov, I. A., Khalfina 1978. Some asymptotic results concerning the Chauvenet test. Ter. Veroyatnost. i Primenen. 23(3):593–597.), Greenwood and Nikulin (Greenwood, Nikulin, P. E. 1996. A Guide to Chi-Squared Testing. New York: John Wiley and Sons, Inc.) depending on the choice of the estimation method used: MLE or MVUE. As procedures for testing one outlier in exponential model have been investigated by a number of authors including Chikkagoudar and Kunchur (Chikkagoudar, M. S., Kunchur, S. H. 1983. Distribution of test statistics for multiple outliers in exponential samples. Comm. Stat. Theory. and Meth. 12:2127–2142.), Lewis and Fieller (Lewis, T., Fiellerm N. R. J. 1979. A recursive algorithm for null distribution for outliers : I. Gamma samples. Technometrics 21:371–376.), Likes (Likes, J. 1966. Distribution of Dixon's statistics in the case of an exponential population. Metrika 11:46–54. (91, 96, 136, 198–200, 204, 209, 210).) and Kabe (Kabe, D. G. 1970. Testing outliers from an exponential population. Metrika 15:15–18.); only two types of statistics for testing multiple outliers exist. First is Dixon's while the second is based on the ratio of the sum of the observations suspected to be outliers to the sum of all observations of the sample. In fact, most of these authors have considered a general case of gamma model and the results for exponential model are given a special case. The object of the present communication is to focus on alternative models, namely slippage alternatives (see Barnett, Vic., Toby Lewis 1978. Outlier in Statistical Data. New York: John Wiley and Sons, Inc.) in exponential samples. We propose a statistic different from the well known Dixon's statistic Dr to test for multiple outliers. Distribution of the test based on this new statistic under slippage alternatives is obtained and hence the tables of critical values are given, for various n (size of the sample) and r (the number of outliers). The power of the new test is also calculated, it is compared to the power of the Dixon's statistic (Chikkagoudar, M. S., Kunchur, S. H. 1983. Distribution of test statistics for multiple outliers in exponential samples. Comm. Stat. Theory. and Meth. 12:2127–2142.). Notice that the new statistic based test power is greater the Dixon's statistic based test one. 相似文献
5.
This article proposes an asymptotic expansion for the Studentized linear discriminant function using two-step monotone missing samples under multivariate normality. The asymptotic expansions related to discriminant function have been obtained for complete data under multivariate normality. The result derived by Anderson (1973) plays an important role in deciding the cut-off point that controls the probabilities of misclassification. This article provides an extension of the result derived by Anderson (1973) in the case of two-step monotone missing samples under multivariate normality. Finally, numerical evaluations by Monte Carlo simulations were also presented. 相似文献
6.
《统计学通讯:理论与方法》2013,42(4):785-786
ABSTRACT The concept of generalized order statistics was introduced by Kamps (1995) to unify several concepts that have been used in statistics such as order statistics, record values, and sequential order statistics. Estimation of the parameters of the Burr type XII distribution are obtained based on generalized order statistics. The maximum likelihood and Bayes methods of estimation are used for this purposes. The Bayes estimates are derived by using the approximation form of Lindley (1980). Estimation based on upper records from the Burr model is obtained and compared by using Monte Carlo simulation study. Our results are specialized to the results of AL-Hussaini and Jaheen (1992) which are based on ordinary order statistics. 相似文献
7.
《统计学通讯:理论与方法》2012,41(13-14):2445-2455
In this article, the problem of estimation of the individual weights of three objects using a chemical balance weighing design is considered. We use the criterion of D-optimality. We assume that the covariance matrix of errors is the matrix of first-order autoregressive process. Such problems were discussed in Li and Yang (2005) and also in Yeh and Lo Huang (2005). We present some results of D-optimal designs in certain class of designs with the design matrix X ∈ M n×3(±1) such that each column of matrix X has at least one 1 and one ?1. 相似文献
8.
Repeated measurement designs are widely used in medicine, pharmacology, animal sciences, and psychology. In this paper the works of Iqbal and Tahir (2009) and Iqbal, Tahir, and Ghazali (2010) are generalized for the construction of circular-balanced and circular strongly balanced repeated measurements designs through the method of cyclic shifts for three periods. 相似文献
9.
I. Ardoino E. M. Biganzoli C. Bajdik P. J. Lisboa P. Boracchi F. Ambrogi 《Journal of applied statistics》2012,39(7):1409-1421
In cancer research, study of the hazard function provides useful insights into disease dynamics, as it describes the way in which the (conditional) probability of death changes with time. The widely utilized Cox proportional hazard model uses a stepwise nonparametric estimator for the baseline hazard function, and therefore has a limited utility. The use of parametric models and/or other approaches that enables direct estimation of the hazard function is often invoked. A recent work by Cox et al. [6] has stimulated the use of the flexible parametric model based on the Generalized Gamma (GG) distribution, supported by the development of optimization software. The GG distribution allows estimation of different hazard shapes in a single framework. We use the GG model to investigate the shape of the hazard function in early breast cancer patients. The flexible approach based on a piecewise exponential model and the nonparametric additive hazards model are also considered. 相似文献
10.
N. K. Mandal 《统计学通讯:理论与方法》2013,42(10):1565-1575
In a mixture experiment the measured response is assumed to depend only on the relative proportion of ingredients or components present in the mixture. Scheffe (1958, 1963) first systematically considered this problem and introduced different models and designs suitable in such situations. Optimum designs for the estimation of parameters of different mixture models are available in the literature. The problem of estimating the optimum proportion of mixture components is of great practical importance. Pal and Mandal (2006, 2007) attempted to find a solution to this problem by adopting a pseudo-Bayesian approach and using the trace criterion. Subsequently, Pal and Mandal (2008) solved the problem using minimax criterion. In this article, the deficiency criterion due to Chatterjee and Mandal (1981) has been used as a measure for comparing the performance of competing designs. 相似文献
11.
Wen-Liang Hung Shou-Jen Chang-Chien Miin-Shen Yang 《Journal of applied statistics》2015,42(10):2220-2232
This paper proposes an intuitive clustering algorithm capable of automatically self-organizing data groups based on the original data structure. Comparisons between the propopsed algorithm and EM [1] and spherical k-means [7] algorithms are given. These numerical results show the effectiveness of the proposed algorithm, using the correct classification rate and the adjusted Rand index as evaluation criteria [5,6]. In 1995, Mayor and Queloz announced the detection of the first extrasolar planet (exoplanet) around a Sun-like star. Since then, observational efforts of astronomers have led to the detection of more than 1000 exoplanets. These discoveries may provide important information for understanding the formation and evolution of planetary systems. The proposed clustering algorithm is therefore used to study the data gathered on exoplanets. Two main implications are also suggested: (1) there are three major clusters, which correspond to the exoplanets in the regimes of disc, ongoing tidal and tidal interactions, respectively, and (2) the stellar metallicity does not play a key role in exoplanet migration. 相似文献
12.
Classification and regression tree has been useful in medical research to construct algorithms for disease diagnosis or prognostic prediction. Jin et al. 7 developed a robust and cost-saving tree (RACT) algorithm with application in classification of hip fracture risk after 5-year follow-up based on the data from the Study of Osteoporotic Fractures (SOF). Although conventional recursive partitioning algorithms have been well developed, they still have some limitations. Binary splits may generate a big tree with many layers, but trinary splits may produce too many nodes. In this paper, we propose a classification approach combining trinary splits and binary splits to generate a trinary–binary tree. A new non-inferiority test of entropy is used to select the binary or trinary splits. We apply the modified method in SOF to construct a trinary–binary classification rule for predicting risk of osteoporotic hip fracture. Our new classification tree has good statistical utility: it is statistically non-inferior to the optimum binary tree and the RACT based on the testing sample and is also cost-saving. It may be useful in clinical applications: femoral neck bone mineral density, age, height loss and weight gain since age 25 can identify subjects with elevated 5-year hip fracture risk without loss of statistical efficiency. 相似文献
13.
Suppose that some information is available for the particular factor. The experimenter may apply the technique of foldover to isolate the factor and the two-factor interactions involving this factor. In fact, under some situations this can be done by the method of semi-folding. We will discuss this property in detail in this article. Furthermore, we use the computer to search the corresponding optimal semi-folding design for the given 2 k?p designs that are tabulated in Chen et al. (1993) research. 相似文献
14.
We consider non-parametric estimation of a continuous cdf of a random vector (X 1, X 2). With bivariate RC data, it is stated in van der Laan (1996, p. 59810, Ann. Statist.), Quale et al. (2006, JASA) etc. that “it is well known that the NPMLE for continuous data is inconsistent (Tsai et al. (1986)).” The claim is based on a result in Tsai et al. (1986, p.1352, Ann. Statist.) that if X 1 is right censored but not X 2, then common ways for defining one NPMLE lead to inconsistency. If X 1 is right censored and X 2 is type I right-censored (which includes the case in Tsai et al.), we present a consistent NPMLE. The result corrects a common misinterpretation of Tsai's example (Tsai et al., 1986, Ann. Statist.). 相似文献
15.
In this article, the frequency polygon studied by Scott (1985) is investigated as a nonparametric estimator for negatively associated samples. By the Bernstein type inequality, we give the uniformly strong consistency of the estimator and obtain the corresponding rate under some mild conditions. 相似文献
16.
The problem of finding D-optimal designs in the presence of a number of covariates has been considered in the one-way set-up. This is an extension of Dey and Mukerjee (2006) in the sense that for fixed replication numbers of each treatment, an alternative upper bound to the determinant of the information matrix has been found through completely symmetric C-matrices for the regression coefficients; this upper bound includes the upper bound given in Dey and Mukerjee (2006) obtained through diagonal C-matrices. Because of the fact that a smaller class of C-matrices was used at the intermediate stage where the replication numbers were fixed, ultimately some optimal designs remained unidentified there. These designs have been identified here and thereby the conjecture made in Dey and Mukerjee (2006) has been settled. 相似文献
17.
Hadi Alizadeh Noughabi 《Journal of applied statistics》2015,42(9):1973-1983
The logistic distribution has been used to model growth curves in survival analysis and biological studies. In this article, we propose a goodness-of-fit test for the logistic distribution based on the empirical likelihood ratio. The test is constructed based on the methodology introduced by Vexler and Gurevich [17]. In order to compute the test statistic, parameters of the distribution are estimated by the method of maximum likelihood. Power comparisons of the proposed test with some known competing tests are carried out via simulations. Finally, an illustrative example is presented and analyzed. 相似文献
18.
Austin L. Hand John A. Scott James D. Stamey Dean M. Young 《Journal of applied statistics》2016,43(9):1625-1635
Adaptive clinical trial designs can often improve drug-study efficiency by utilizing data obtained during the course of the trial. We present a novel Bayesian two-stage adaptive design for Phase II clinical trials with Poisson-distributed outcomes that allows for person-observation-time adjustments for early termination due to either futility or efficacy. Our design is motivated by the adaptive trial from [9], which uses binomial data. Although many frequentist and Bayesian two-stage adaptive designs for count data have been proposed in the literature, many designs do not allow for person-time adjustments after the first stage. This restriction limits flexibility in the study design. However, our proposed design allows for such flexibility by basing the second-stage person-time on the first-stage observed-count data. We demonstrate the implementation of our Bayesian predictive adaptive two-stage design using a hypothetical Phase II trial of Immune Globulin (Intravenous). 相似文献
19.
The objective of this paper is to study U-type designs for Bayesian non parametric response surface prediction under correlated errors. The asymptotic Bayes criterion is developed in terms of the asymptotic approach of Mitchell et al. (1994) for a more general covariance kernel proposed by Chatterjee and Qin (2011). A relationship between the asymptotic Bayes criterion and other criteria, such as orthogonality and aberration, is then developed. A lower bound for the criterion is also obtained, and numerical results show that this lower bound is tight. The established results generalize those of Yue et al. (2011) from symmetrical case to asymmetrical U-type designs. 相似文献
20.
《统计学通讯:理论与方法》2012,41(13-14):2356-2366
In this article, the optimal design problem in a fixed effects interference model with left-neighbor effects is studied. It is known (Druilhet, 1999) that circular neighbor balanced designs (CNBDs) are universally optimal in such a model. We prove the universal optimality of circular weakly neighbor balanced designs (CWNBDs), which require a smaller number of blocks than CNBDs. CWNBDs with the number of blocks smaller than the number of treatments belong to the class of partially neighbor balanced designs (PNBDs) defined by Wilkinson et al. (1983). We give a construction method for some CWNBDs, with examples. 相似文献