A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media

Many web applications today thrive on offering services for large-scale multimedia data, e.g., Flickr for photos and YouTube for videos. However, these data, while rich in content, are usually sparse in textual descriptive information. For example, a video clip is often associated with only a few ta...

Full description

Saved in:
Bibliographic Details
Main Authors: LIU, Lu, ZHU, Feida, ZHANG, Lei, YANG, Shiqiang
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2012
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3209
https://ink.library.smu.edu.sg/context/sis_research/article/4210/viewcontent/ProbabilisticGraphicalModelTopic_2012.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4210
record_format dspace
spelling sg-smu-ink.sis_research-42102020-01-15T15:00:38Z A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media LIU, Lu ZHU, Feida ZHANG, Lei YANG, Shiqiang Many web applications today thrive on offering services for large-scale multimedia data, e.g., Flickr for photos and YouTube for videos. However, these data, while rich in content, are usually sparse in textual descriptive information. For example, a video clip is often associated with only a few tags. Moreover, the textual descriptions are often overly specific to the video content. Such characteristics make it very challenging to discover topics at a satisfactory granularity on this kind of data. In this paper, we propose a generative probabilistic model named Preference-Topic Model (PTM) to introduce the dimension of user preferences to enhance the insufficient textual information. PTM is a unified framework to combine the tasks of user preference discovery and document topic mining together. Through modeling user-document interactions, PTM cannot only discover topics and preferences simultaneously, but also enable them to inform and benefit each other in a unified framework. As a result, PTM can extract better topics and preferences from sparse data. The experimental results on real-life video application data show that PTM is superior to LDA in discovering informative topics and preferences in terms of clustering-based evaluations. Furthermore, the experimental results on DBLP data demonstrate that PTM is a general model which can be applied to other kinds of user–document interactions. 2012-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3209 info:doi/10.1016/j.neucom.2011.05.039 https://ink.library.smu.edu.sg/context/sis_research/article/4210/viewcontent/ProbabilisticGraphicalModelTopic_2012.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Social media mining Topic model Preference discovery Databases and Information Systems Digital Communications and Networking Social Media
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Social media mining
Topic model
Preference discovery
Databases and Information Systems
Digital Communications and Networking
Social Media
spellingShingle Social media mining
Topic model
Preference discovery
Databases and Information Systems
Digital Communications and Networking
Social Media
LIU, Lu
ZHU, Feida
ZHANG, Lei
YANG, Shiqiang
A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media
description Many web applications today thrive on offering services for large-scale multimedia data, e.g., Flickr for photos and YouTube for videos. However, these data, while rich in content, are usually sparse in textual descriptive information. For example, a video clip is often associated with only a few tags. Moreover, the textual descriptions are often overly specific to the video content. Such characteristics make it very challenging to discover topics at a satisfactory granularity on this kind of data. In this paper, we propose a generative probabilistic model named Preference-Topic Model (PTM) to introduce the dimension of user preferences to enhance the insufficient textual information. PTM is a unified framework to combine the tasks of user preference discovery and document topic mining together. Through modeling user-document interactions, PTM cannot only discover topics and preferences simultaneously, but also enable them to inform and benefit each other in a unified framework. As a result, PTM can extract better topics and preferences from sparse data. The experimental results on real-life video application data show that PTM is superior to LDA in discovering informative topics and preferences in terms of clustering-based evaluations. Furthermore, the experimental results on DBLP data demonstrate that PTM is a general model which can be applied to other kinds of user–document interactions.
format text
author LIU, Lu
ZHU, Feida
ZHANG, Lei
YANG, Shiqiang
author_facet LIU, Lu
ZHU, Feida
ZHANG, Lei
YANG, Shiqiang
author_sort LIU, Lu
title A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media
title_short A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media
title_full A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media
title_fullStr A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media
title_full_unstemmed A Probabilistic Graphical Model for Topic and Preference Discovery on Social Media
title_sort probabilistic graphical model for topic and preference discovery on social media
publisher Institutional Knowledge at Singapore Management University
publishDate 2012
url https://ink.library.smu.edu.sg/sis_research/3209
https://ink.library.smu.edu.sg/context/sis_research/article/4210/viewcontent/ProbabilisticGraphicalModelTopic_2012.pdf
_version_ 1770572978650611712