Sarcasm recognition in speech using a real-time approach

This study focuses on the recognition of sarcasm in speech, and attempts to address the problem of inaccuracy with regard to identifying this particular audio signal. This problem widens the gap between computers and humans since interactions are not completely understandable using sarcastic comment...

Full description

Saved in:

Bibliographic Details
Main Author:	Pascual, Ramon Gabriel
Format:	text
Language:	English
Published:	Animo Repository 2015
Subjects:	Speech perception Machine learning Signal processing—Digital techniques Computer Sciences
Online Access:	https://animorepository.dlsu.edu.ph/etd_masteral/5073
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University
Language:	English

id	oai:animorepository.dlsu.edu.ph:etd_masteral-11911
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:etd_masteral-119112024-06-07T01:35:48Z Sarcasm recognition in speech using a real-time approach Pascual, Ramon Gabriel This study focuses on the recognition of sarcasm in speech, and attempts to address the problem of inaccuracy with regard to identifying this particular audio signal. This problem widens the gap between computers and humans since interactions are not completely understandable using sarcastic comments. With this in mind, the goal is to create a model capable of identifying sarcasm which is generic enough to work not only on a specific set but also on any kind of sarcastic statements using audio signals. This was accomplished using machine learning and digital signal processing techniques appropriate for real time processing. Audio features like pitch, intensity, Mel Frequency Cepstral Coefficients (MFCC), and formants were experimented on using a new acted speech corpus that was annotated as sarcastic and non sarcastic by six participants which include the researcher. By using Support Vector Machine with polynomial kernel on a data set containing 0.4 second segments with 30% overlap, an accuracy and kappa of 69% and 0.39, respectively. The results suggest that pitch, intensity and certain MFCC and formant features are predictive of sarcasm. With only 10 features, SVM with polynomial kernel processes a single 0.4 second clip in 1.2 seconds making it suitable for real time processing. 2015-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_masteral/5073 Master's Theses English Animo Repository Speech perception Machine learning Signal processing—Digital techniques Computer Sciences
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
language	English
topic	Speech perception Machine learning Signal processing—Digital techniques Computer Sciences
spellingShingle	Speech perception Machine learning Signal processing—Digital techniques Computer Sciences Pascual, Ramon Gabriel Sarcasm recognition in speech using a real-time approach
description	This study focuses on the recognition of sarcasm in speech, and attempts to address the problem of inaccuracy with regard to identifying this particular audio signal. This problem widens the gap between computers and humans since interactions are not completely understandable using sarcastic comments. With this in mind, the goal is to create a model capable of identifying sarcasm which is generic enough to work not only on a specific set but also on any kind of sarcastic statements using audio signals. This was accomplished using machine learning and digital signal processing techniques appropriate for real time processing. Audio features like pitch, intensity, Mel Frequency Cepstral Coefficients (MFCC), and formants were experimented on using a new acted speech corpus that was annotated as sarcastic and non sarcastic by six participants which include the researcher. By using Support Vector Machine with polynomial kernel on a data set containing 0.4 second segments with 30% overlap, an accuracy and kappa of 69% and 0.39, respectively. The results suggest that pitch, intensity and certain MFCC and formant features are predictive of sarcasm. With only 10 features, SVM with polynomial kernel processes a single 0.4 second clip in 1.2 seconds making it suitable for real time processing.
format	text
author	Pascual, Ramon Gabriel
author_facet	Pascual, Ramon Gabriel
author_sort	Pascual, Ramon Gabriel
title	Sarcasm recognition in speech using a real-time approach
title_short	Sarcasm recognition in speech using a real-time approach
title_full	Sarcasm recognition in speech using a real-time approach
title_fullStr	Sarcasm recognition in speech using a real-time approach
title_full_unstemmed	Sarcasm recognition in speech using a real-time approach
title_sort	sarcasm recognition in speech using a real-time approach
publisher	Animo Repository
publishDate	2015
url	https://animorepository.dlsu.edu.ph/etd_masteral/5073
_version_	1802997418321510400

Sarcasm recognition in speech using a real-time approach

Similar Items