Unsupervised user identity linkage via factoid embedding
User identity linkage (UIL), the problem of matching user account across multiple online social networks (OSNs), is widely studied and important to many real-world applications. Most existing UIL solutions adopt a supervised or semisupervised approach which generally suffer from scarcity of labeled...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2018
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4258 https://ink.library.smu.edu.sg/context/sis_research/article/5261/viewcontent/24._Dec06_2018___Unsupervised_User_Identity_Linkage_via_Factoid_Embedding__ICDM18_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5261 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-52612020-04-07T07:00:37Z Unsupervised user identity linkage via factoid embedding XIE, Wei MU, Xin LEE, Roy Ka Wei ZHU, Feida LIM, Ee-peng User identity linkage (UIL), the problem of matching user account across multiple online social networks (OSNs), is widely studied and important to many real-world applications. Most existing UIL solutions adopt a supervised or semisupervised approach which generally suffer from scarcity of labeled data. In this paper, we propose Factoid Embedding, a novel framework that adopts an unsupervised approach. It is designed to cope with different profile attributes, content types and network links of different OSNs. The key idea is that each piece of information about a user identity describes the real identity owner, and thus distinguishes the owner from other users. We represent such a piece of information by a factoid and model it as a triplet consisting of user identity, predicate, and an object or another user identity. By embedding these factoids, we learn the user identity latent representations and link two user identities from different OSNs if they are close to each other in the user embedding space. Our Factoid Embedding algorithm is designed such that as we learn the embedding space, each embedded factoid is “translated” into a motion in the user embedding space to bring similar user identities closer, and different user identities further apart. Extensive experiments are conducted to evaluate Factoid Embedding on two real-world OSNs data sets. The experiment results show that Factoid Embedding outperforms the state-of-the-art methods even without training data. 2018-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4258 info:doi/10.1109/ICDM.2018.00182 https://ink.library.smu.edu.sg/context/sis_research/article/5261/viewcontent/24._Dec06_2018___Unsupervised_User_Identity_Linkage_via_Factoid_Embedding__ICDM18_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University user identity linkage factoid embedding network embedding Databases and Information Systems Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
user identity linkage factoid embedding network embedding Databases and Information Systems Theory and Algorithms |
spellingShingle |
user identity linkage factoid embedding network embedding Databases and Information Systems Theory and Algorithms XIE, Wei MU, Xin LEE, Roy Ka Wei ZHU, Feida LIM, Ee-peng Unsupervised user identity linkage via factoid embedding |
description |
User identity linkage (UIL), the problem of matching user account across multiple online social networks (OSNs), is widely studied and important to many real-world applications. Most existing UIL solutions adopt a supervised or semisupervised approach which generally suffer from scarcity of labeled data. In this paper, we propose Factoid Embedding, a novel framework that adopts an unsupervised approach. It is designed to cope with different profile attributes, content types and network links of different OSNs. The key idea is that each piece of information about a user identity describes the real identity owner, and thus distinguishes the owner from other users. We represent such a piece of information by a factoid and model it as a triplet consisting of user identity, predicate, and an object or another user identity. By embedding these factoids, we learn the user identity latent representations and link two user identities from different OSNs if they are close to each other in the user embedding space. Our Factoid Embedding algorithm is designed such that as we learn the embedding space, each embedded factoid is “translated” into a motion in the user embedding space to bring similar user identities closer, and different user identities further apart. Extensive experiments are conducted to evaluate Factoid Embedding on two real-world OSNs data sets. The experiment results show that Factoid Embedding outperforms the state-of-the-art methods even without training data. |
format |
text |
author |
XIE, Wei MU, Xin LEE, Roy Ka Wei ZHU, Feida LIM, Ee-peng |
author_facet |
XIE, Wei MU, Xin LEE, Roy Ka Wei ZHU, Feida LIM, Ee-peng |
author_sort |
XIE, Wei |
title |
Unsupervised user identity linkage via factoid embedding |
title_short |
Unsupervised user identity linkage via factoid embedding |
title_full |
Unsupervised user identity linkage via factoid embedding |
title_fullStr |
Unsupervised user identity linkage via factoid embedding |
title_full_unstemmed |
Unsupervised user identity linkage via factoid embedding |
title_sort |
unsupervised user identity linkage via factoid embedding |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2018 |
url |
https://ink.library.smu.edu.sg/sis_research/4258 https://ink.library.smu.edu.sg/context/sis_research/article/5261/viewcontent/24._Dec06_2018___Unsupervised_User_Identity_Linkage_via_Factoid_Embedding__ICDM18_.pdf |
_version_ |
1770574547416776704 |