SLR: A scalable latent role model for attribute completion and tie prediction in social networks

Social networks are an important class of networks that span a wide variety of media, ranging from social websites such as Facebook and Google Plus, citation networks of academic papers and patents, caller networks in telecommunications, and hyperlinked document collections such as Wikipedia — to na...

Full description

Saved in:
Bibliographic Details
Main Authors: LIAO, Lizi, HO, Qirong, JIANG, Jing, LIM, Ee-peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7576
https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=8579&context=sis_research
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8579
record_format dspace
spelling sg-smu-ink.sis_research-85792022-12-12T08:10:34Z SLR: A scalable latent role model for attribute completion and tie prediction in social networks LIAO, Lizi HO, Qirong JIANG, Jing LIM, Ee-peng Social networks are an important class of networks that span a wide variety of media, ranging from social websites such as Facebook and Google Plus, citation networks of academic papers and patents, caller networks in telecommunications, and hyperlinked document collections such as Wikipedia — to name a few. Many of these social networks now exceed millions of users or actors, each of which may be associated with rich attribute data such as user profiles in social websites and caller networks, or subject classifications in document collections and citation networks. Such attribute data is often incomplete for a number of reasons — for example, users may be unwilling to spend the effort to complete their profiles, while in the case of document collections, there may be insufficient human labor to accurately classify all documents. At the same time, the tie or link information in these networks may also be incomplete — in social websites, users may simply be unaware of potential acquaintances, while in citation networks, authors may be unaware of appropriate literature that should be referenced. Completing and predicting these missing attributes and ties is important to a spectrum of applications, such as recommendation, personalized search, and targeted advertising, yet large social networks can pose a scalability challenge to existing algorithms designed for this task. Towards this end, we propose an integrative probabilistic model, SLR, that captures both attribute and tie information simultaneously, and can be used for attribute completion and tie prediction, in order to enable the above mentioned applications. A key innovation in our model is the use of triangle motifs to represent ties in the network, in order to scale to networks with millions of nodes and beyond. Experiments on real world datasets show that SLR significantly improves the accuracy of attribute prediction and tie prediction compared to well-known methods, and our distributed, multi-machine implementation easily scales up to millions of users. In addition to fast and accurate attribute and tie prediction, we also demonstrate how SLR can identify the attributes most responsible for homophily within the network, thus revealing which attributes drive network tie formation. 2016-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7576 https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=8579&context=sis_research http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Predictive models Probabilistic logic Scalability Algorithm design and analysis Facebook Patents Digital Communications and Networking
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Predictive models
Probabilistic logic
Scalability
Algorithm design and analysis
Facebook
Patents
Digital Communications and Networking
spellingShingle Predictive models
Probabilistic logic
Scalability
Algorithm design and analysis
Facebook
Patents
Digital Communications and Networking
LIAO, Lizi
HO, Qirong
JIANG, Jing
LIM, Ee-peng
SLR: A scalable latent role model for attribute completion and tie prediction in social networks
description Social networks are an important class of networks that span a wide variety of media, ranging from social websites such as Facebook and Google Plus, citation networks of academic papers and patents, caller networks in telecommunications, and hyperlinked document collections such as Wikipedia — to name a few. Many of these social networks now exceed millions of users or actors, each of which may be associated with rich attribute data such as user profiles in social websites and caller networks, or subject classifications in document collections and citation networks. Such attribute data is often incomplete for a number of reasons — for example, users may be unwilling to spend the effort to complete their profiles, while in the case of document collections, there may be insufficient human labor to accurately classify all documents. At the same time, the tie or link information in these networks may also be incomplete — in social websites, users may simply be unaware of potential acquaintances, while in citation networks, authors may be unaware of appropriate literature that should be referenced. Completing and predicting these missing attributes and ties is important to a spectrum of applications, such as recommendation, personalized search, and targeted advertising, yet large social networks can pose a scalability challenge to existing algorithms designed for this task. Towards this end, we propose an integrative probabilistic model, SLR, that captures both attribute and tie information simultaneously, and can be used for attribute completion and tie prediction, in order to enable the above mentioned applications. A key innovation in our model is the use of triangle motifs to represent ties in the network, in order to scale to networks with millions of nodes and beyond. Experiments on real world datasets show that SLR significantly improves the accuracy of attribute prediction and tie prediction compared to well-known methods, and our distributed, multi-machine implementation easily scales up to millions of users. In addition to fast and accurate attribute and tie prediction, we also demonstrate how SLR can identify the attributes most responsible for homophily within the network, thus revealing which attributes drive network tie formation.
format text
author LIAO, Lizi
HO, Qirong
JIANG, Jing
LIM, Ee-peng
author_facet LIAO, Lizi
HO, Qirong
JIANG, Jing
LIM, Ee-peng
author_sort LIAO, Lizi
title SLR: A scalable latent role model for attribute completion and tie prediction in social networks
title_short SLR: A scalable latent role model for attribute completion and tie prediction in social networks
title_full SLR: A scalable latent role model for attribute completion and tie prediction in social networks
title_fullStr SLR: A scalable latent role model for attribute completion and tie prediction in social networks
title_full_unstemmed SLR: A scalable latent role model for attribute completion and tie prediction in social networks
title_sort slr: a scalable latent role model for attribute completion and tie prediction in social networks
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/7576
https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=8579&context=sis_research
_version_ 1753801200632856576