Topic modeling for multi-aspect listwise comparison
As a well-established probabilistic method, topic models seek to uncover latent semantics from plain text. In addition to having textual content, we observe that documents are usually compared in listwise rankings based on their content. For instance, world-wide countries are compared in an internat...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6432 https://ink.library.smu.edu.sg/context/sis_research/article/7435/viewcontent/cikm21.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7435 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-74352021-12-14T05:47:15Z Topic modeling for multi-aspect listwise comparison ZHANG, Delvin Ce LAUW, Hady W. As a well-established probabilistic method, topic models seek to uncover latent semantics from plain text. In addition to having textual content, we observe that documents are usually compared in listwise rankings based on their content. For instance, world-wide countries are compared in an international ranking in terms of electricity production based on their national reports. Such document comparisons constitute additional information that reveal documents' relative similarities. Incorporating them into topic modeling could yield comparative topics that help to differentiate and rank documents. Furthermore, based on different comparison criteria, the observed document comparisons usually cover multiple aspects, each expressing a distinct ranked list. For example, a country may be ranked higher in terms of electricity production, but fall behind others in terms of life expectancy or government budget. Each comparison criterion, or aspect, observes a distinct ranking. Considering such multiple aspects of comparisons based on different ranking criteria allows us to derive one set of topics that inform heterogeneous document similarities. We propose a generative topic model aimed at learning topics that are well aligned to multi-aspect listwise comparisons. Experiments on public datasets demonstrate the advantage of the proposed method in jointly modeling topics and ranked lists against baselines comprehensively. 2021-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6432 info:doi/10.1145/3459637.3482398 https://ink.library.smu.edu.sg/context/sis_research/article/7435/viewcontent/cikm21.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Generative Topic Model Text Mining Comparative Documents Databases and Information Systems Data Science |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Generative Topic Model Text Mining Comparative Documents Databases and Information Systems Data Science |
spellingShingle |
Generative Topic Model Text Mining Comparative Documents Databases and Information Systems Data Science ZHANG, Delvin Ce LAUW, Hady W. Topic modeling for multi-aspect listwise comparison |
description |
As a well-established probabilistic method, topic models seek to uncover latent semantics from plain text. In addition to having textual content, we observe that documents are usually compared in listwise rankings based on their content. For instance, world-wide countries are compared in an international ranking in terms of electricity production based on their national reports. Such document comparisons constitute additional information that reveal documents' relative similarities. Incorporating them into topic modeling could yield comparative topics that help to differentiate and rank documents. Furthermore, based on different comparison criteria, the observed document comparisons usually cover multiple aspects, each expressing a distinct ranked list. For example, a country may be ranked higher in terms of electricity production, but fall behind others in terms of life expectancy or government budget. Each comparison criterion, or aspect, observes a distinct ranking. Considering such multiple aspects of comparisons based on different ranking criteria allows us to derive one set of topics that inform heterogeneous document similarities. We propose a generative topic model aimed at learning topics that are well aligned to multi-aspect listwise comparisons. Experiments on public datasets demonstrate the advantage of the proposed method in jointly modeling topics and ranked lists against baselines comprehensively. |
format |
text |
author |
ZHANG, Delvin Ce LAUW, Hady W. |
author_facet |
ZHANG, Delvin Ce LAUW, Hady W. |
author_sort |
ZHANG, Delvin Ce |
title |
Topic modeling for multi-aspect listwise comparison |
title_short |
Topic modeling for multi-aspect listwise comparison |
title_full |
Topic modeling for multi-aspect listwise comparison |
title_fullStr |
Topic modeling for multi-aspect listwise comparison |
title_full_unstemmed |
Topic modeling for multi-aspect listwise comparison |
title_sort |
topic modeling for multi-aspect listwise comparison |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6432 https://ink.library.smu.edu.sg/context/sis_research/article/7435/viewcontent/cikm21.pdf |
_version_ |
1770575959450189824 |