Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah

Automated speech recognition (ASR) for spontaneous speech poses extra challenge compared to read speech as it contains varied speaking rates, poor phonation and disfluencies. Studies have shown that filled pause is one of the most common disfluencies of spontaneous speech characteristic where it pre...

Full description

Saved in:
Bibliographic Details
Main Author: Hamzah, Raseeda
Format: Book Section
Language:English
Published: Institute of Graduate Studies, UiTM 2016
Subjects:
Online Access:http://ir.uitm.edu.my/id/eprint/20064/1/ABS_RASEEDA%20HAMZAH%20TDRA%20VOL%2010%20IGS%2016.pdf
http://ir.uitm.edu.my/id/eprint/20064/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Mara
Language: English
id my.uitm.ir.20064
record_format eprints
spelling my.uitm.ir.200642018-06-11T03:55:39Z http://ir.uitm.edu.my/id/eprint/20064/ Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah Hamzah, Raseeda Malaysia Malay language. General works. History Automated speech recognition (ASR) for spontaneous speech poses extra challenge compared to read speech as it contains varied speaking rates, poor phonation and disfluencies. Studies have shown that filled pause is one of the most common disfluencies of spontaneous speech characteristic where it presents considerable problems for ASR performance. In many filled pause studies, the hindering factor is that filled pause being often recognized as short words which particularly has semantic meaning, such as „um” can be recognized as „thumb” or „arm”. This problem becomes especially pertinent where a vowel sound of normal word being relatively long at any position in an utterance, both within a word as well as between words which formerly known as elongation. The existence of elongation causes normal word falsely detected as filled pause due to their similar acoustical feature patterns. Classifying elongation as filled pause affects ASR”s performance as eliminating normal words from recognition may modify the intended context of a speech. Therefore, the main aim of this research is to classify filled pause and elongation into its own classes by constructing a discriminative classification model from the extracted acoustical features. A large number of signal features have been employed for the problem of discriminating filled pause and elongation. Several wellestablished features such as Formant Frequency (FF), Fundamental Frequency (F0), Mel Frequency Cepstral Coefficients (MFCC), Zero Crossing Rates (ZCR) and Short Time Energy (STE) were used in this research. These features are carefully chosen to emphasize signal characteristics that differ between filled pause and elongation… Institute of Graduate Studies, UiTM 2016 Book Section PeerReviewed text en http://ir.uitm.edu.my/id/eprint/20064/1/ABS_RASEEDA%20HAMZAH%20TDRA%20VOL%2010%20IGS%2016.pdf Hamzah, Raseeda (2016) Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah. In: The Doctoral Research Abstracts. IGS Biannual Publication, 10 (10). Institute of Graduate Studies, UiTM, Shah Alam.
institution Universiti Teknologi Mara
building Tun Abdul Razak Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Mara
content_source UiTM Institutional Repository
url_provider http://ir.uitm.edu.my/
language English
topic Malaysia
Malay language. General works. History
spellingShingle Malaysia
Malay language. General works. History
Hamzah, Raseeda
Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah
description Automated speech recognition (ASR) for spontaneous speech poses extra challenge compared to read speech as it contains varied speaking rates, poor phonation and disfluencies. Studies have shown that filled pause is one of the most common disfluencies of spontaneous speech characteristic where it presents considerable problems for ASR performance. In many filled pause studies, the hindering factor is that filled pause being often recognized as short words which particularly has semantic meaning, such as „um” can be recognized as „thumb” or „arm”. This problem becomes especially pertinent where a vowel sound of normal word being relatively long at any position in an utterance, both within a word as well as between words which formerly known as elongation. The existence of elongation causes normal word falsely detected as filled pause due to their similar acoustical feature patterns. Classifying elongation as filled pause affects ASR”s performance as eliminating normal words from recognition may modify the intended context of a speech. Therefore, the main aim of this research is to classify filled pause and elongation into its own classes by constructing a discriminative classification model from the extracted acoustical features. A large number of signal features have been employed for the problem of discriminating filled pause and elongation. Several wellestablished features such as Formant Frequency (FF), Fundamental Frequency (F0), Mel Frequency Cepstral Coefficients (MFCC), Zero Crossing Rates (ZCR) and Short Time Energy (STE) were used in this research. These features are carefully chosen to emphasize signal characteristics that differ between filled pause and elongation…
format Book Section
author Hamzah, Raseeda
author_facet Hamzah, Raseeda
author_sort Hamzah, Raseeda
title Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah
title_short Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah
title_full Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah
title_fullStr Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah
title_full_unstemmed Discriminative classification model of filled pause and elongation for Malay language spontaneous speech / Raseeda Hamzah
title_sort discriminative classification model of filled pause and elongation for malay language spontaneous speech / raseeda hamzah
publisher Institute of Graduate Studies, UiTM
publishDate 2016
url http://ir.uitm.edu.my/id/eprint/20064/1/ABS_RASEEDA%20HAMZAH%20TDRA%20VOL%2010%20IGS%2016.pdf
http://ir.uitm.edu.my/id/eprint/20064/
_version_ 1685649296697851904