Real-time comprehensive sociometrics for two-person dialogs
A real-time system is proposed to quantitatively assess speaking mannerisms and social behavior from audio recordings of two-person dialogs. Speaking mannerisms are quantitatively assessed by low-level speech metrics such as volume, rate, and pitch of speech. The social behavior is quantified by soc...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/101166 http://hdl.handle.net/10220/18315 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-101166 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1011662023-05-19T06:44:44Z Real-time comprehensive sociometrics for two-person dialogs Dauwels, Shoko Rasheed, Umer Tahir, Yasir Dauwels, Justin Thalmann, Daniel Nanyang Business School School of Electrical and Electronic Engineering International Workshop, Human Behavior Understanding (4th : 2013 : Barcelona, Spain) Institute for Media Innovation DRNTU::Social sciences::Sociology::Social behavior A real-time system is proposed to quantitatively assess speaking mannerisms and social behavior from audio recordings of two-person dialogs. Speaking mannerisms are quantitatively assessed by low-level speech metrics such as volume, rate, and pitch of speech. The social behavior is quantified by sociometrics including level of interest, agreement, and dominance. Such quantitative measures can be used to provide real-time feedback to the speakers, for instance, to alarm to speaker when the voice is too strong (speaking mannerism), or when the conversation is not proceeding well due to disagreements or numerous interruptions (social behavior). In the proposed approach, machine learning algorithms are designed to compute the sociometrics (level of interest, agreement, and dominance) in real-time from combinations of low-level speech metrics. To this end, a corpus of 150 brief two-person dialogs in English was collected. Several experts assessed the sociometrics for each of those dialogs. Next, the resulting annotated dialogs are used to train the machine learning algorithms in a supervised manner. Through this training procedure, the algorithms learn how the sociometrics depend on the low-level speech metrics, and consequently, are able to compute the sociometrics from speech recordings in an automated fashion, without further help of experts. Numerical tests through leave-one-out cross-validation indicate that the accuracy of the algorithms for inferring the sociometrics is in the range of 80-90%. In future, those reliable predictions can be the key to real-time sociofeedback, where speakers will be provided feedback in real-time about their behavior in an ongoing discussion. Such technology may be helpful in many contexts, for instance in group meetings, counseling, or executive training. Accepted version 2013-12-18T06:32:22Z 2019-12-06T20:34:29Z 2013-12-18T06:32:22Z 2019-12-06T20:34:29Z 2013 2013 Conference Paper Magnenat-Thalmann, N., Rasheed, U., Tahir, Y., Dauwels, S., Dauwels, J., & Thalmann, D. (2013). Real-time comprehensive sociometrics for two-person dialogs. Proceedings of the 4th International Workshop, Human Behavior Understanding (HBU 2013), LNCS 8212, 196-208. https://hdl.handle.net/10356/101166 http://hdl.handle.net/10220/18315 10.1007/978-3-319-02714-2_17 en © 2013 Springer International Publishing Switzerland. This is the author created version of a work that has been peer reviewed and accepted for publication by Proceedings of the 4th International Workshop, Human Behavior Understanding (HBU 2013), Springer International Publishing Switzerland. It incorporates referee’s comments but changes resulting from the publishing process, such as copyediting, structural formatting, may not be reflected in this document. The published version is available at: [http://dx.doi.org/10.1007/978-3-319-02714-2_17]. 13 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Social sciences::Sociology::Social behavior |
spellingShingle |
DRNTU::Social sciences::Sociology::Social behavior Dauwels, Shoko Rasheed, Umer Tahir, Yasir Dauwels, Justin Thalmann, Daniel Real-time comprehensive sociometrics for two-person dialogs |
description |
A real-time system is proposed to quantitatively assess speaking mannerisms and social behavior from audio recordings of two-person dialogs. Speaking mannerisms are quantitatively assessed by low-level speech metrics such as volume, rate, and pitch of speech. The social behavior is quantified by sociometrics including level of interest, agreement, and dominance. Such quantitative measures can be used to provide real-time feedback to the speakers, for instance, to alarm to speaker when the voice is too strong (speaking mannerism), or when the conversation is not proceeding well due to disagreements or numerous interruptions (social behavior). In the proposed approach, machine learning algorithms are designed to compute the sociometrics (level of interest, agreement, and dominance) in real-time from combinations of low-level speech metrics. To this end, a corpus of 150 brief two-person dialogs in English was collected. Several experts assessed the sociometrics for each of those dialogs. Next, the resulting annotated dialogs are used to train the machine learning algorithms in a supervised manner. Through this training procedure, the algorithms learn how the sociometrics depend on the low-level speech metrics, and consequently, are able to compute the sociometrics from speech recordings in an automated fashion, without further help of experts. Numerical tests through leave-one-out cross-validation indicate that the accuracy of the algorithms for inferring the sociometrics is in the range of 80-90%. In future, those reliable predictions can be the key to real-time sociofeedback, where speakers will be provided feedback in real-time about their behavior in an ongoing discussion. Such technology may be helpful in many contexts, for instance in group meetings, counseling, or executive training. |
author2 |
Nanyang Business School |
author_facet |
Nanyang Business School Dauwels, Shoko Rasheed, Umer Tahir, Yasir Dauwels, Justin Thalmann, Daniel |
format |
Conference or Workshop Item |
author |
Dauwels, Shoko Rasheed, Umer Tahir, Yasir Dauwels, Justin Thalmann, Daniel |
author_sort |
Dauwels, Shoko |
title |
Real-time comprehensive sociometrics for two-person dialogs |
title_short |
Real-time comprehensive sociometrics for two-person dialogs |
title_full |
Real-time comprehensive sociometrics for two-person dialogs |
title_fullStr |
Real-time comprehensive sociometrics for two-person dialogs |
title_full_unstemmed |
Real-time comprehensive sociometrics for two-person dialogs |
title_sort |
real-time comprehensive sociometrics for two-person dialogs |
publishDate |
2013 |
url |
https://hdl.handle.net/10356/101166 http://hdl.handle.net/10220/18315 |
_version_ |
1770565155164258304 |