WordFlag: Flagging inappropriate words in recorded speech through word spotting

Speech analytics is one of the most important methods used by the call center and telecommunications field in analyzing call content to improve customer satisfaction and overall business performance. One of the technologies used in speech analytics is word spotting, which is the identification of sp...

Full description

Saved in:
Bibliographic Details
Main Authors: Otic, Antonio Ray N., Ramos, Francesca Nerisse S., Torres, Ricardo Louis O., Yson, Kevin Joseph R.
Format: text
Language:English
Published: Animo Repository 2009
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/11379
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-12024
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-120242022-03-09T03:12:22Z WordFlag: Flagging inappropriate words in recorded speech through word spotting Otic, Antonio Ray N. Ramos, Francesca Nerisse S. Torres, Ricardo Louis O. Yson, Kevin Joseph R. Speech analytics is one of the most important methods used by the call center and telecommunications field in analyzing call content to improve customer satisfaction and overall business performance. One of the technologies used in speech analytics is word spotting, which is the identification of specific words in speech. Current speech analytics systems are usually focused on providing solutions for business analysis and product-related issues based on customer speech or feedback. Most Automatic Speech Recognition (ASR) systems that make use of the word spotting technology have specific methods to disregard the speaker's other words, which are considered insignificant in analyzing calls. However, with the large number of agents needed to be hired by companies and the tight competition of the Philippines with other countries in the worldwide contact or call center industry, there is a need to aid agent training processes and issues, including how other unnecessary and inappropriate words affect the customer-agent interaction. This research focuses on the design and development of an isolated word spotting system that automatically flags inappropriate words in a speech recording to aid the call center agent training process. WordFlag is trained to flag 65 words from a predefined list and makes use of recordings from different speakers as it is also designed to be speaker-independent. The system incorporates preprocessing through noise reduction, modified isolated word endpoint detection and segmentation, MFCC feature extraction, modified Hidden Markov Models, and word-based recognition. The WordFlag's system test results show an overall recognition average of 41.25%. It was also observed that words which were trained with additional semantic variations show a higher recognition rate at 48.3% than those without variations, which had a lower rate at 31.2%. Improvements on the corpus data and application of phoneme-based recognition may be done for future projects to compare performance of similar systems. 2009-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11379 Bachelor's Theses English Animo Repository Speech processing systems Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Speech processing systems
Computer Sciences
spellingShingle Speech processing systems
Computer Sciences
Otic, Antonio Ray N.
Ramos, Francesca Nerisse S.
Torres, Ricardo Louis O.
Yson, Kevin Joseph R.
WordFlag: Flagging inappropriate words in recorded speech through word spotting
description Speech analytics is one of the most important methods used by the call center and telecommunications field in analyzing call content to improve customer satisfaction and overall business performance. One of the technologies used in speech analytics is word spotting, which is the identification of specific words in speech. Current speech analytics systems are usually focused on providing solutions for business analysis and product-related issues based on customer speech or feedback. Most Automatic Speech Recognition (ASR) systems that make use of the word spotting technology have specific methods to disregard the speaker's other words, which are considered insignificant in analyzing calls. However, with the large number of agents needed to be hired by companies and the tight competition of the Philippines with other countries in the worldwide contact or call center industry, there is a need to aid agent training processes and issues, including how other unnecessary and inappropriate words affect the customer-agent interaction. This research focuses on the design and development of an isolated word spotting system that automatically flags inappropriate words in a speech recording to aid the call center agent training process. WordFlag is trained to flag 65 words from a predefined list and makes use of recordings from different speakers as it is also designed to be speaker-independent. The system incorporates preprocessing through noise reduction, modified isolated word endpoint detection and segmentation, MFCC feature extraction, modified Hidden Markov Models, and word-based recognition. The WordFlag's system test results show an overall recognition average of 41.25%. It was also observed that words which were trained with additional semantic variations show a higher recognition rate at 48.3% than those without variations, which had a lower rate at 31.2%. Improvements on the corpus data and application of phoneme-based recognition may be done for future projects to compare performance of similar systems.
format text
author Otic, Antonio Ray N.
Ramos, Francesca Nerisse S.
Torres, Ricardo Louis O.
Yson, Kevin Joseph R.
author_facet Otic, Antonio Ray N.
Ramos, Francesca Nerisse S.
Torres, Ricardo Louis O.
Yson, Kevin Joseph R.
author_sort Otic, Antonio Ray N.
title WordFlag: Flagging inappropriate words in recorded speech through word spotting
title_short WordFlag: Flagging inappropriate words in recorded speech through word spotting
title_full WordFlag: Flagging inappropriate words in recorded speech through word spotting
title_fullStr WordFlag: Flagging inappropriate words in recorded speech through word spotting
title_full_unstemmed WordFlag: Flagging inappropriate words in recorded speech through word spotting
title_sort wordflag: flagging inappropriate words in recorded speech through word spotting
publisher Animo Repository
publishDate 2009
url https://animorepository.dlsu.edu.ph/etd_bachelors/11379
_version_ 1728621100130107392