Who is missing? Characterizing the participation of different demographic groups in a Korean nationwide daily conversation corpus

A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus...

Full description

Saved in:
Bibliographic Details
Main Authors: KWAK, Haewoon, AN, Jisun, PARK, Kunwoo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7499
https://ink.library.smu.edu.sg/context/sis_research/article/8502/viewcontent/19397_Article_Text_23410_1_2_20220531.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus constructed by the National Institute of Korean Language (NIKL) to characterize the participation of different demographic (age and sex) groups in the corpus.