CompareLDA: A topic model for document comparison

A number of real-world applications require comparison of entities based on their textual representations. In this work, we develop a topic model supervised by pairwise comparisons of documents. Such a model seeks to yield topics that help to differentiate entities along some dimension of interest,...

Full description

Saved in:
Bibliographic Details
Main Authors: TKACHENKO, Maksim, LAUW, Hady Wirawan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4698
https://ink.library.smu.edu.sg/context/sis_research/article/5701/viewcontent/aaai19b.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5701
record_format dspace
spelling sg-smu-ink.sis_research-57012020-01-09T07:14:53Z CompareLDA: A topic model for document comparison TKACHENKO, Maksim LAUW, Hady Wirawan A number of real-world applications require comparison of entities based on their textual representations. In this work, we develop a topic model supervised by pairwise comparisons of documents. Such a model seeks to yield topics that help to differentiate entities along some dimension of interest, which may vary from one application to another. While previous supervised topic models consider document labels in an independent and pointwise manner, our proposed Comparative Latent Dirichlet Allocation (CompareLDA) learns predictive topic distributions that comply with the pairwise comparison observations. To fit the model, we derive a maximum likelihood estimation method via augmented variational approximation algorithm. Evaluation on several public datasets underscores the strengths of CompareLDA in modelling document comparisons. 2019-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4698 info:doi/10.1609/aaai.v33i01.33017112 https://ink.library.smu.edu.sg/context/sis_research/article/5701/viewcontent/aaai19b.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University topic model document comparison Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic topic model
document comparison
Databases and Information Systems
spellingShingle topic model
document comparison
Databases and Information Systems
TKACHENKO, Maksim
LAUW, Hady Wirawan
CompareLDA: A topic model for document comparison
description A number of real-world applications require comparison of entities based on their textual representations. In this work, we develop a topic model supervised by pairwise comparisons of documents. Such a model seeks to yield topics that help to differentiate entities along some dimension of interest, which may vary from one application to another. While previous supervised topic models consider document labels in an independent and pointwise manner, our proposed Comparative Latent Dirichlet Allocation (CompareLDA) learns predictive topic distributions that comply with the pairwise comparison observations. To fit the model, we derive a maximum likelihood estimation method via augmented variational approximation algorithm. Evaluation on several public datasets underscores the strengths of CompareLDA in modelling document comparisons.
format text
author TKACHENKO, Maksim
LAUW, Hady Wirawan
author_facet TKACHENKO, Maksim
LAUW, Hady Wirawan
author_sort TKACHENKO, Maksim
title CompareLDA: A topic model for document comparison
title_short CompareLDA: A topic model for document comparison
title_full CompareLDA: A topic model for document comparison
title_fullStr CompareLDA: A topic model for document comparison
title_full_unstemmed CompareLDA: A topic model for document comparison
title_sort comparelda: a topic model for document comparison
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/4698
https://ink.library.smu.edu.sg/context/sis_research/article/5701/viewcontent/aaai19b.pdf
_version_ 1770574982576865280