Dynamic label propagation in social networks

Label propagation has been studied for many years, starting from a set of nodes with labels and then propagating to those without labels. In social networks, building complete user profiles like interests and affiliations contributes to the systems like link prediction, personalized feeding, etc. Si...

Full description

Saved in:
Bibliographic Details
Main Authors: DU, Juan, ZHU, Feida, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2013
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1733
https://ink.library.smu.edu.sg/context/sis_research/article/2732/viewcontent/chp_3A10.1007_2F978_3_642_37450_0_14.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2732
record_format dspace
spelling sg-smu-ink.sis_research-27322018-06-18T06:02:15Z Dynamic label propagation in social networks DU, Juan ZHU, Feida LIM, Ee Peng Label propagation has been studied for many years, starting from a set of nodes with labels and then propagating to those without labels. In social networks, building complete user profiles like interests and affiliations contributes to the systems like link prediction, personalized feeding, etc. Since the labels for each user are mostly not filled, we often employ some people to label these users. And therefore, the cost of human labeling is high if the data set is large. To reduce the expense, we need to select the optimal data set for labeling, which produces the best propagation result. In this paper, we proposed two algorithms for the selection of the optimal data set for labeling, which is the greedy and greedyMax algorithms according to different user input. We select the data set according to two scenarios, which are 1) finding top-K nodes for labeling and then propagating as much nodes as possible, and 2) finding a minimal set of nodes for labeling and then propagating the whole network with at least one label. Furthermore, we analyze the network structure that affects the selection and propagation results. Our algorithms are suitable for most propagation algorithms. In the experiment part, we evaluate our algorithms based on 500 networks extracted from the film-actor table in freebase according to the two different scenarios. The performance including input percentage, time cost, precision and f1-score were present in the results. And from the results, the greedyMax could achieve higher performance with a balance of precision and time cost than the greedy algorithm. In addition, our algorithm could be adaptive to the user input in a quick response. 2013-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1733 info:doi/10.1007/978-3-642-37450-0_14 https://ink.library.smu.edu.sg/context/sis_research/article/2732/viewcontent/chp_3A10.1007_2F978_3_642_37450_0_14.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing Social Media
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
Social Media
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
Social Media
DU, Juan
ZHU, Feida
LIM, Ee Peng
Dynamic label propagation in social networks
description Label propagation has been studied for many years, starting from a set of nodes with labels and then propagating to those without labels. In social networks, building complete user profiles like interests and affiliations contributes to the systems like link prediction, personalized feeding, etc. Since the labels for each user are mostly not filled, we often employ some people to label these users. And therefore, the cost of human labeling is high if the data set is large. To reduce the expense, we need to select the optimal data set for labeling, which produces the best propagation result. In this paper, we proposed two algorithms for the selection of the optimal data set for labeling, which is the greedy and greedyMax algorithms according to different user input. We select the data set according to two scenarios, which are 1) finding top-K nodes for labeling and then propagating as much nodes as possible, and 2) finding a minimal set of nodes for labeling and then propagating the whole network with at least one label. Furthermore, we analyze the network structure that affects the selection and propagation results. Our algorithms are suitable for most propagation algorithms. In the experiment part, we evaluate our algorithms based on 500 networks extracted from the film-actor table in freebase according to the two different scenarios. The performance including input percentage, time cost, precision and f1-score were present in the results. And from the results, the greedyMax could achieve higher performance with a balance of precision and time cost than the greedy algorithm. In addition, our algorithm could be adaptive to the user input in a quick response.
format text
author DU, Juan
ZHU, Feida
LIM, Ee Peng
author_facet DU, Juan
ZHU, Feida
LIM, Ee Peng
author_sort DU, Juan
title Dynamic label propagation in social networks
title_short Dynamic label propagation in social networks
title_full Dynamic label propagation in social networks
title_fullStr Dynamic label propagation in social networks
title_full_unstemmed Dynamic label propagation in social networks
title_sort dynamic label propagation in social networks
publisher Institutional Knowledge at Singapore Management University
publishDate 2013
url https://ink.library.smu.edu.sg/sis_research/1733
https://ink.library.smu.edu.sg/context/sis_research/article/2732/viewcontent/chp_3A10.1007_2F978_3_642_37450_0_14.pdf
_version_ 1770571484828270592