期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Weighting variables in K-means clustering

Myung-Hoe Huh 《Journal of applied statistics》2009,36(1):67-78

The aim of this study is to assign weights w ₁, …, w _m to m clustering variables Z ₁, …, Z _m, so that k groups were uncovered to reveal more meaningful within-group coherence. We propose a new criterion to be minimized, which is the sum of the weighted within-cluster sums of squares and the penalty for the heterogeneity in variable weights w ₁, …, w _m. We will present the computing algorithm for such k-means clustering, a working procedure to determine a suitable value of penalty constant and numerical examples, among which one is simulated and the other two are real. 相似文献

2.

An automatic clustering algorithm for probability density functions

《Journal of Statistical Computation and Simulation》2012,82(15):3047-3063

We propose an intuitive and computationally simple algorithm for clustering the probability density functions (pdfs). A data-driven learning mechanism is incorporated in the algorithm in order to determine the suitable widths of the clusters. The clustering results prove that the proposed algorithm is able to automatically group the pdfs and provide the optimal cluster number without any a priori information. The performance study also shows that the proposed algorithm is more efficient than existing ones. In addition, the clustering can serve as the intermediate compression tool in content-based multimedia retrieval that we apply the proposed algorithm to categorize a subset of COREL image database. And the clustering results indicate that the proposed algorithm performs well in colour image categorization. 相似文献

3.

An unsupervised, ensemble clustering algorithm: A new approach for classification of X-ray sources

S.M. Hojnacki G. Micela S.M. LaLonde E.D. Feigelson J.H. Kastner 《Statistical Methodology》2008,5(4):350-360

A large volume of CCD X-ray spectra is being generated by the Chandra X-ray Observatory (Chandra) and XMM-Newton. Automated spectral analysis and classification methods can aid in sorting, characterizing, and classifying this large volume of CCD X-ray spectra in a non-parametric fashion, complementary to current parametric model fits. We have developed an algorithm that uses multivariate statistical techniques, including an ensemble clustering method, applied for the first time for X-ray spectral classification. The algorithm uses spectral data to group similar discrete sources of X-ray emission by placing the X-ray sources in a three-dimensional spectral sequence and then grouping the ordered sources into clusters based on their spectra. This new method can handle large quantities of data and operate independently of the requirement of spectral source models and a priori knowledge concerning the nature of the sources (i.e., young stars, interacting binaries, active galactic nuclei). We apply the method to Chandra imaging spectroscopy of the young stellar clusters in the Orion Nebula Cluster and the NGC 1333 star formation region. 相似文献

4.

An LM-type test for idiosyncratic unit roots in the exact factor model with integrated factors

Xingwu Zhou 《统计学通讯:理论与方法》2017,46(10):4888-4914

We consider an exact factor model with integrated factors and propose an LM-type test for unit roots in the idiosyncratic component. We show that, for a fixed number of panel individuals (N) and when the number of time points (T) tends to infinity, the limiting distribution of the LM-type statistic is a weighted sum of independent Chi-square variables with one degree of freedom, and when T tends to infinity followed by N tending to infinity, the limiting distribution is standard normal. The results should contribute to the challenging task of deriving likelihood-based unit-root tests in dynamic factor models. 相似文献

5.

An iterative sparse algorithm for the penalized maximum likelihood estimator in mixed effects model

Won Son Jong Soo Lee Kyeong Eun Lee Johan Lim 《Journal of the Korean Statistical Society》2018,47(4):482-490

In this paper, we propose a new iterative sparse algorithm (ISA) to compute the maximum likelihood estimator (MLE) or penalized MLE of the mixed effects model. The sparse approximation based on the arrow-head (A-H) matrix is one solution which is popularly used in practice. The A-H method provides an easy computation of the inverse of the Hessian matrix and is computationally efficient. However, it often has non-negligible error in approximating the inverse of the Hessian matrix and in the estimation. Unlike the A-H method, in the ISA, the sparse approximation is applied “iteratively” to reduce the approximation error at each Newton Raphson step. The advantages of the ISA over the exact and A-H method are illustrated using several synthetic and real examples. 相似文献

6.

Geometric programming for optimal allocation of integrated samples in quality control

Miles Davis Rudolph E. Schwartz 《统计学通讯:理论与方法》2013,42(11):3235-3254

We apply geometric programming, developed by Duffin, Peterson Zener (1967), to the optimal allocation of stratified samples. As an introduction, we show how geometric programming is used to allocate samples according to Neyman (1934), using the data of Cornell (1947) and following the exposition of Cochran (1953).

Then we use geometric programming to allocate an integrated sample introduced by Schwartz (1978) for more efficient sampling of three U. S. Federal welfare quality control systems, Aid to Families with Dependent Children, Food Stamps and Medicaid.

We develop methods for setting up the allocation problem, interpreting it as a geometric programming primal problem, transforming it to the corresponding dual problem, solving that, and finding the sample sizes required in the allocation problem. We show that the integrated sample saves sampling costs. 相似文献