Topic modeling for multi-aspect listwise comparison

As a well-established probabilistic method, topic models seek to uncover latent semantics from plain text. In addition to having textual content, we observe that documents are usually compared in listwise rankings based on their content. For instance, world-wide countries are compared in an internat...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Delvin Ce, LAUW, Hady W.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6432
https://ink.library.smu.edu.sg/context/sis_research/article/7435/viewcontent/cikm21.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7435
record_format dspace
spelling sg-smu-ink.sis_research-74352021-12-14T05:47:15Z Topic modeling for multi-aspect listwise comparison ZHANG, Delvin Ce LAUW, Hady W. As a well-established probabilistic method, topic models seek to uncover latent semantics from plain text. In addition to having textual content, we observe that documents are usually compared in listwise rankings based on their content. For instance, world-wide countries are compared in an international ranking in terms of electricity production based on their national reports. Such document comparisons constitute additional information that reveal documents' relative similarities. Incorporating them into topic modeling could yield comparative topics that help to differentiate and rank documents. Furthermore, based on different comparison criteria, the observed document comparisons usually cover multiple aspects, each expressing a distinct ranked list. For example, a country may be ranked higher in terms of electricity production, but fall behind others in terms of life expectancy or government budget. Each comparison criterion, or aspect, observes a distinct ranking. Considering such multiple aspects of comparisons based on different ranking criteria allows us to derive one set of topics that inform heterogeneous document similarities. We propose a generative topic model aimed at learning topics that are well aligned to multi-aspect listwise comparisons. Experiments on public datasets demonstrate the advantage of the proposed method in jointly modeling topics and ranked lists against baselines comprehensively. 2021-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6432 info:doi/10.1145/3459637.3482398 https://ink.library.smu.edu.sg/context/sis_research/article/7435/viewcontent/cikm21.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Generative Topic Model Text Mining Comparative Documents Databases and Information Systems Data Science
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Generative Topic Model
Text Mining
Comparative Documents
Databases and Information Systems
Data Science
spellingShingle Generative Topic Model
Text Mining
Comparative Documents
Databases and Information Systems
Data Science
ZHANG, Delvin Ce
LAUW, Hady W.
Topic modeling for multi-aspect listwise comparison
description As a well-established probabilistic method, topic models seek to uncover latent semantics from plain text. In addition to having textual content, we observe that documents are usually compared in listwise rankings based on their content. For instance, world-wide countries are compared in an international ranking in terms of electricity production based on their national reports. Such document comparisons constitute additional information that reveal documents' relative similarities. Incorporating them into topic modeling could yield comparative topics that help to differentiate and rank documents. Furthermore, based on different comparison criteria, the observed document comparisons usually cover multiple aspects, each expressing a distinct ranked list. For example, a country may be ranked higher in terms of electricity production, but fall behind others in terms of life expectancy or government budget. Each comparison criterion, or aspect, observes a distinct ranking. Considering such multiple aspects of comparisons based on different ranking criteria allows us to derive one set of topics that inform heterogeneous document similarities. We propose a generative topic model aimed at learning topics that are well aligned to multi-aspect listwise comparisons. Experiments on public datasets demonstrate the advantage of the proposed method in jointly modeling topics and ranked lists against baselines comprehensively.
format text
author ZHANG, Delvin Ce
LAUW, Hady W.
author_facet ZHANG, Delvin Ce
LAUW, Hady W.
author_sort ZHANG, Delvin Ce
title Topic modeling for multi-aspect listwise comparison
title_short Topic modeling for multi-aspect listwise comparison
title_full Topic modeling for multi-aspect listwise comparison
title_fullStr Topic modeling for multi-aspect listwise comparison
title_full_unstemmed Topic modeling for multi-aspect listwise comparison
title_sort topic modeling for multi-aspect listwise comparison
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6432
https://ink.library.smu.edu.sg/context/sis_research/article/7435/viewcontent/cikm21.pdf
_version_ 1770575959450189824