Multi-source propagation aware network clustering

Network cluster analysis is of great importance as it is closely related to diverse applications, such as social community detection, biological module identification, and document segmentation. Aiming to effectively uncover clusters in the network data, a number of computational approaches, which u...

Full description

Saved in:
Bibliographic Details
Main Authors: He, Tiantian, Ong, Yew-Soon, Hu, Pengwei
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148397
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148397
record_format dspace
spelling sg-ntu-dr.10356-1483972021-05-27T07:43:49Z Multi-source propagation aware network clustering He, Tiantian Ong, Yew-Soon Hu, Pengwei School of Computer Science and Engineering Data Science and Artificial Intelligence Research Centre Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing Network Cluster Analysis Matrix Factorization Network cluster analysis is of great importance as it is closely related to diverse applications, such as social community detection, biological module identification, and document segmentation. Aiming to effectively uncover clusters in the network data, a number of computational approaches, which utilize network topology, single vector of vertex features, or both the aforementioned, have been proposed. However, most prevalent approaches are incapable of dealing with those contemporary network data whose vertices are characterized by features collected from multiple sources. To address this challenge, in this paper, we propose a novel framework, dubbed Multi-Source Propagation Aware Network Clustering (MSPANC) for uncovering clusters in network data possessing multiple sources of vertex features. Different from most previous approaches, MSPANC is able to infer the cluster preference for each vertex utilizing both network topology and multi-source vertex features. To improve the practical significance of the discovered clusters, the learning of cluster membership is also involved into the modeling of the maximization of intra-cluster propagation regarding multi-source features. We propose a unified objective function for MSPANC to perform the clustering task and derive an alternative manner of learning algorithm for model optimization. Besides, we theoretically prove the convergence of the algorithm for optimizing MSPANC. The proposed model has been tested on five real-world datasets, including social, biological and document networks, and has been compared with several competitive baselines. The remarkable experimental results validate the effectiveness of MSPANC. AI Singapore Accepted version This research is supported by the National Research Foundation Singapore under its AI Singapore Programme [Award Number: AISG-RP- 2018-004], the National Natural Science Foundation of China under Grant 61802317, and Data Science and Artifical Intelligence Research Center, Nanyang Technological University. 2021-05-27T07:26:38Z 2021-05-27T07:26:38Z 2021 Journal Article He, T., Ong, Y. & Hu, P. (2021). Multi-source propagation aware network clustering. Neurocomputing, 453, 119-130. https://dx.doi.org/10.1016/j.neucom.2021.04.064 0925-2312 https://hdl.handle.net/10356/148397 10.1016/j.neucom.2021.04.064 453 119 130 en AISG-RP-2018-004 Neurocomputing © 2021 Elsevier B.V. All rights reserved. This paper was published in Neurocomputing and is made available with permission of Elsevier B.V. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Network Cluster Analysis
Matrix Factorization
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Network Cluster Analysis
Matrix Factorization
He, Tiantian
Ong, Yew-Soon
Hu, Pengwei
Multi-source propagation aware network clustering
description Network cluster analysis is of great importance as it is closely related to diverse applications, such as social community detection, biological module identification, and document segmentation. Aiming to effectively uncover clusters in the network data, a number of computational approaches, which utilize network topology, single vector of vertex features, or both the aforementioned, have been proposed. However, most prevalent approaches are incapable of dealing with those contemporary network data whose vertices are characterized by features collected from multiple sources. To address this challenge, in this paper, we propose a novel framework, dubbed Multi-Source Propagation Aware Network Clustering (MSPANC) for uncovering clusters in network data possessing multiple sources of vertex features. Different from most previous approaches, MSPANC is able to infer the cluster preference for each vertex utilizing both network topology and multi-source vertex features. To improve the practical significance of the discovered clusters, the learning of cluster membership is also involved into the modeling of the maximization of intra-cluster propagation regarding multi-source features. We propose a unified objective function for MSPANC to perform the clustering task and derive an alternative manner of learning algorithm for model optimization. Besides, we theoretically prove the convergence of the algorithm for optimizing MSPANC. The proposed model has been tested on five real-world datasets, including social, biological and document networks, and has been compared with several competitive baselines. The remarkable experimental results validate the effectiveness of MSPANC.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
He, Tiantian
Ong, Yew-Soon
Hu, Pengwei
format Article
author He, Tiantian
Ong, Yew-Soon
Hu, Pengwei
author_sort He, Tiantian
title Multi-source propagation aware network clustering
title_short Multi-source propagation aware network clustering
title_full Multi-source propagation aware network clustering
title_fullStr Multi-source propagation aware network clustering
title_full_unstemmed Multi-source propagation aware network clustering
title_sort multi-source propagation aware network clustering
publishDate 2021
url https://hdl.handle.net/10356/148397
_version_ 1701270504609939456