Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques

This study focuses on the synthesis of naturalistic voiced laughter, and attempts to address the wide gap present in applications that involve synthetic agents. This gap lies in the interactions between human and these agents, which can in part be filled through the emulation and expression of paral...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Cagampan, Bernadyn R., Ng, Henry O., Panuelos, Kevin Matthew C.H., Uy, Krystyn Kaizzle S.
格式:	text
語言:	English
出版:	Animo Repository 2013
主題:	Speech processing systems Speech synthesis Laughter Computer Sciences
在線閱讀:	https://animorepository.dlsu.edu.ph/etd_bachelors/10829
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	De La Salle University
語言:	English

id	oai:animorepository.dlsu.edu.ph:etd_bachelors-11474
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:etd_bachelors-114742022-02-04T08:04:38Z Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques Cagampan, Bernadyn R. Ng, Henry O. Panuelos, Kevin Matthew C.H. Uy, Krystyn Kaizzle S. This study focuses on the synthesis of naturalistic voiced laughter, and attempts to address the wide gap present in applications that involve synthetic agents. This gap lies in the interactions between human and these agents, which can in part be filled through the emulation and expression of paralinguistic sounds such as laughter. Most agents speak through a synthesized voice, but inserting a prerecord laughter sound in between sentences proved to score low in participation tests (Trouvain & Schroder, 2004), thus making for a compelling reason to pursue computer-generated laughter. This involves the analysis of a set of acoustic features including, but not limited to, pitch and MFCCs present in voiced laughter, and consequently the synthesis of laughter using concatenative diphone synthesis, articulatory synthesis and hidden Markov model-based statistical parametric synthesis techniques. With this in mind, the goal is to generate laughter that is perceived as acceptable and natural by evaluators. This can be validated through subjective evaluation tests where an evaluator determines the synthesized laughter from a set of clips. The results of this work show that while evaluators are primarily able to identify natural laughter from synthesized laughter, there is much doubt and little agreement on whether or not these clips were even truly natural or not. Aside from articulatory synthesis-which was consistently rated lowly- the concatenative diphone synthesis and statistical parametric synthesis techniques proved quite effective in synthesizing laughter that was rated to be even more naturalistic than samples from a spontaneous laughter database. Differences between male and female evaluator groups were found and identified through the use of decision tree models and are used to identify how certain features may have influenced the evaluation score. 2013-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/10829 Bachelor's Theses English Animo Repository Speech processing systems Speech synthesis Laughter Computer Sciences
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
language	English
topic	Speech processing systems Speech synthesis Laughter Computer Sciences
spellingShingle	Speech processing systems Speech synthesis Laughter Computer Sciences Cagampan, Bernadyn R. Ng, Henry O. Panuelos, Kevin Matthew C.H. Uy, Krystyn Kaizzle S. Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques
description	This study focuses on the synthesis of naturalistic voiced laughter, and attempts to address the wide gap present in applications that involve synthetic agents. This gap lies in the interactions between human and these agents, which can in part be filled through the emulation and expression of paralinguistic sounds such as laughter. Most agents speak through a synthesized voice, but inserting a prerecord laughter sound in between sentences proved to score low in participation tests (Trouvain & Schroder, 2004), thus making for a compelling reason to pursue computer-generated laughter. This involves the analysis of a set of acoustic features including, but not limited to, pitch and MFCCs present in voiced laughter, and consequently the synthesis of laughter using concatenative diphone synthesis, articulatory synthesis and hidden Markov model-based statistical parametric synthesis techniques. With this in mind, the goal is to generate laughter that is perceived as acceptable and natural by evaluators. This can be validated through subjective evaluation tests where an evaluator determines the synthesized laughter from a set of clips. The results of this work show that while evaluators are primarily able to identify natural laughter from synthesized laughter, there is much doubt and little agreement on whether or not these clips were even truly natural or not. Aside from articulatory synthesis-which was consistently rated lowly- the concatenative diphone synthesis and statistical parametric synthesis techniques proved quite effective in synthesizing laughter that was rated to be even more naturalistic than samples from a spontaneous laughter database. Differences between male and female evaluator groups were found and identified through the use of decision tree models and are used to identify how certain features may have influenced the evaluation score.
format	text
author	Cagampan, Bernadyn R. Ng, Henry O. Panuelos, Kevin Matthew C.H. Uy, Krystyn Kaizzle S.
author_facet	Cagampan, Bernadyn R. Ng, Henry O. Panuelos, Kevin Matthew C.H. Uy, Krystyn Kaizzle S.
author_sort	Cagampan, Bernadyn R.
title	Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques
title_short	Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques
title_full	Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques
title_fullStr	Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques
title_full_unstemmed	Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques
title_sort	synthesizing naturalistic laughter: an exploratory study on modeling voiced laughter with speech synthesis techniques
publisher	Animo Repository
publishDate	2013
url	https://animorepository.dlsu.edu.ph/etd_bachelors/10829
_version_	1724079086255472640

Synthesizing naturalistic laughter: An exploratory study on modeling voiced laughter with speech synthesis techniques

相似書籍