Insertion reduction in speech segmentation using neural network
Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a...
Saved in:
Main Authors: | , , |
---|---|
Format: | Book Section |
Published: |
Institute of Electrical and Electronics Engineers
2008
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/12589/ http://dx.doi.org/10.1109/ITSIM.2008.4632062 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.12589 |
---|---|
record_format |
eprints |
spelling |
my.utm.125892011-06-14T08:18:41Z http://eprints.utm.my/id/eprint/12589/ Insertion reduction in speech segmentation using neural network Salam, M-S Mohamad, Dzulkifli Salleh, S-H QA75 Electronic computers. Computer science Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a fusion between statistical and connectionist approach namely divergence algorithm and MLP neural network to improved segmentation by reducing insertions. The experiment conducted on Malay semi-spontaneous connected digit in classroom environment. The digit strings were manually segmented and trained using neural network with three set of data. The first training set trained without silence pattern, the second include silence while the last set introduced both silence and false pattern in the training. The experimental result on digit string segmentation shows number of insertion reduction of more than 5 times in comparison using divergence alone with increment of accuracy up to 40%.. The drawback however, the number of omission also increases to more than 10 times. Nevertheless, match segmentation rate still above 85%. Institute of Electrical and Electronics Engineers 2008 Book Section PeerReviewed Salam, M-S and Mohamad, Dzulkifli and Salleh, S-H (2008) Insertion reduction in speech segmentation using neural network. In: Proceedings - International Symposium on Information Technology 2008, ITSim. Institute of Electrical and Electronics Engineers, New York, pp. 2057-2063. ISBN 978-142442328-6 http://dx.doi.org/10.1109/ITSIM.2008.4632062 doi:10.1109/ITSIM.2008.4632062 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Salam, M-S Mohamad, Dzulkifli Salleh, S-H Insertion reduction in speech segmentation using neural network |
description |
Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a fusion between statistical and connectionist approach namely divergence algorithm and MLP neural network to improved segmentation by reducing insertions. The experiment conducted on Malay semi-spontaneous connected digit in classroom environment. The digit strings were manually segmented and trained using neural network with three set of data. The first training set trained without silence pattern, the second include silence while the last set introduced both silence and false pattern in the training. The experimental result on digit string segmentation shows number of insertion reduction of more than 5 times in comparison using divergence alone with increment of accuracy up to 40%.. The drawback however, the number of omission also increases to more than 10 times. Nevertheless, match segmentation rate still above 85%. |
format |
Book Section |
author |
Salam, M-S Mohamad, Dzulkifli Salleh, S-H |
author_facet |
Salam, M-S Mohamad, Dzulkifli Salleh, S-H |
author_sort |
Salam, M-S |
title |
Insertion reduction in speech segmentation using neural network |
title_short |
Insertion reduction in speech segmentation using neural network |
title_full |
Insertion reduction in speech segmentation using neural network |
title_fullStr |
Insertion reduction in speech segmentation using neural network |
title_full_unstemmed |
Insertion reduction in speech segmentation using neural network |
title_sort |
insertion reduction in speech segmentation using neural network |
publisher |
Institute of Electrical and Electronics Engineers |
publishDate |
2008 |
url |
http://eprints.utm.my/id/eprint/12589/ http://dx.doi.org/10.1109/ITSIM.2008.4632062 |
_version_ |
1643645991753089024 |