CompareLDA: A topic model for document comparison
A number of real-world applications require comparison of entities based on their textual representations. In this work, we develop a topic model supervised by pairwise comparisons of documents. Such a model seeks to yield topics that help to differentiate entities along some dimension of interest,...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2019
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4698 https://ink.library.smu.edu.sg/context/sis_research/article/5701/viewcontent/aaai19b.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5701 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-57012020-01-09T07:14:53Z CompareLDA: A topic model for document comparison TKACHENKO, Maksim LAUW, Hady Wirawan A number of real-world applications require comparison of entities based on their textual representations. In this work, we develop a topic model supervised by pairwise comparisons of documents. Such a model seeks to yield topics that help to differentiate entities along some dimension of interest, which may vary from one application to another. While previous supervised topic models consider document labels in an independent and pointwise manner, our proposed Comparative Latent Dirichlet Allocation (CompareLDA) learns predictive topic distributions that comply with the pairwise comparison observations. To fit the model, we derive a maximum likelihood estimation method via augmented variational approximation algorithm. Evaluation on several public datasets underscores the strengths of CompareLDA in modelling document comparisons. 2019-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4698 info:doi/10.1609/aaai.v33i01.33017112 https://ink.library.smu.edu.sg/context/sis_research/article/5701/viewcontent/aaai19b.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University topic model document comparison Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
topic model document comparison Databases and Information Systems |
spellingShingle |
topic model document comparison Databases and Information Systems TKACHENKO, Maksim LAUW, Hady Wirawan CompareLDA: A topic model for document comparison |
description |
A number of real-world applications require comparison of entities based on their textual representations. In this work, we develop a topic model supervised by pairwise comparisons of documents. Such a model seeks to yield topics that help to differentiate entities along some dimension of interest, which may vary from one application to another. While previous supervised topic models consider document labels in an independent and pointwise manner, our proposed Comparative Latent Dirichlet Allocation (CompareLDA) learns predictive topic distributions that comply with the pairwise comparison observations. To fit the model, we derive a maximum likelihood estimation method via augmented variational approximation algorithm. Evaluation on several public datasets underscores the strengths of CompareLDA in modelling document comparisons. |
format |
text |
author |
TKACHENKO, Maksim LAUW, Hady Wirawan |
author_facet |
TKACHENKO, Maksim LAUW, Hady Wirawan |
author_sort |
TKACHENKO, Maksim |
title |
CompareLDA: A topic model for document comparison |
title_short |
CompareLDA: A topic model for document comparison |
title_full |
CompareLDA: A topic model for document comparison |
title_fullStr |
CompareLDA: A topic model for document comparison |
title_full_unstemmed |
CompareLDA: A topic model for document comparison |
title_sort |
comparelda: a topic model for document comparison |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2019 |
url |
https://ink.library.smu.edu.sg/sis_research/4698 https://ink.library.smu.edu.sg/context/sis_research/article/5701/viewcontent/aaai19b.pdf |
_version_ |
1770574982576865280 |