首页 | 本学科首页   官方微博 | 高级检索  
     


Model-based clustering for social networks
Authors:Mark S. Handcock   Adrian E. Raftery   Jeremy M. Tantrum
Affiliation:University of Washington, Seattle, USA; Microsoft adCenter Labs, Redmond, USA
Abstract:Summary.  Network models are widely used to represent relations between interacting units or actors. Network data often exhibit transitivity, meaning that two actors that have ties to a third actor are more likely to be tied than actors that do not, homophily by attributes of the actors or dyads, and clustering. Interest often focuses on finding clusters of actors or ties, and the number of groups in the data is typically unknown. We propose a new model, the latent position cluster model , under which the probability of a tie between two actors depends on the distance between them in an unobserved Euclidean 'social space', and the actors' locations in the latent social space arise from a mixture of distributions, each corresponding to a cluster. We propose two estimation methods: a two-stage maximum likelihood method and a fully Bayesian method that uses Markov chain Monte Carlo sampling. The former is quicker and simpler, but the latter performs better. We also propose a Bayesian way of determining the number of clusters that are present by using approximate conditional Bayes factors. Our model represents transitivity, homophily by attributes and clustering simultaneously and does not require the number of clusters to be known. The model makes it easy to simulate realistic networks with clustering, which are potentially useful as inputs to models of more complex systems of which the network is part, such as epidemic models of infectious disease. We apply the model to two networks of social relations. A free software package in the R statistical language, latentnet, is available to analyse data by using the model.
Keywords:Bayes factor    Dyad    Latent space    Markov chain Monte Carlo methods    Mixture model    Transitivity
正在获取引用信息,请稍候...
相似文献(共20条):
[1]、García-Escudero,Luis Angel,Mayo-Iscar,Agustín,Riani, Marco.Model-based clustering with determinant-and-shape constraint[J].Statistics and Computing,2020,30(5):1363-1380.
[2]、Dimitris Karlis,Anais Santourian.Model-based clustering with non-elliptically contoured distributions[J].Statistics and Computing,2009,19(1):73-83.
[3]、Ioannis Kosmidis,Dimitris Karlis.Model-based clustering using copulas with applications[J].Statistics and Computing,2016,26(5):1079-1099.
[4]、Fop,Michael,Murphy,Thomas Brendan,Scrucca, Luca.Model-based clustering with sparse covariance matrices[J].Statistics and Computing,2019,29(4):791-819.
[5]、Gertraud Malsiner-Walli,Sylvia Frühwirth-Schnatter,Bettina Grün.Model-based clustering based on sparse finite Gaussian mixtures[J].Statistics and Computing,2016,26(1-2):303-324.
[6]、Sharon X. Lee,Geoffrey J. McLachlan.Model-based clustering and classification with non-normal mixture distributions[J].Statistical Methods and Applications,2013,22(4):427-454.
[7]、Francesco, Lagona,Marco, Picone.Model-based clustering of multivariate skew data with circular components and missing values[J].Journal of applied statistics,2012,39(5):927-945.
[8]、Christophe Biernacki,Julien Jacques.Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm[J].Statistics and Computing,2016,26(5):929-943.
[9]、Jeffrey L. Andrews,Paul D. McNicholas.Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions[J].Statistics and Computing,2012,22(5):1021-1029.
[10]、Model-based clustering,classification, and discriminant analysis of data with mixed type[J].Journal of statistical planning and inference
[11]、Danyang Huang,Hansheng Wang.Spatial autoregression with repeated measurements for social networks[J].统计学通讯:理论与方法,2018,47(15):3715-3727.
[12]、Marco Corneli,Pierre Latouche,Fabrice Rossi.Multiple change points detection and clustering in dynamic networks[J].Statistics and Computing,2018,28(5):989-1007.
[13]、Md.?Abul?HasnatEmail authorModel-based hierarchical clustering with Bregman divergences and Fishers mixture model: application to depth image analysis[J].Statistics and Computing,2016,26(4):861-880.
[14]、P. J. Diggle,J. A. Tawn,& R. A. Moyeed.Model-based geostatistics[J].Journal of the Royal Statistical Society. Series C, Applied statistics,1998,47(3):299-350.
[15]、Model-based confidence bands for survival functions[J].Journal of statistical planning and inference
[16]、C. Bouveyron,P. Latouche,R. Zreik.The stochastic topic block model for the clustering of vertices in networks with textual edges[J].Statistics and Computing,2018,28(1):11-31.
[17]、Nathalie Peyrard,Régis Sabbadin,Daniel Spring,Barry Brook,Ralph Mac Nally.Model-based adaptive spatial sampling for occurrence map construction[J].Statistics and Computing,2013,23(1):29-42.
[18]、Yang,Xiaoyi,Niezink,Nynke M. D.,Nugent,Rebecca.Learning social networks from text data using covariate information[J].Statistical Methods and Applications,2021,30(5):1399-1423.
[19]、Cosine similarity-based clustering and dynamic reputation trust aware key generation scheme for trusted communication on social networking[J].Journal of Statistical Computation and Simulation
[20]、William A., Link,Evan G., Cooch,Emmanuelle, Cam.Model-based estimation of individual fitness[J].Journal of applied statistics,2002,29(1-4):207-224.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号