Insertion reduction in speech segmentation using neural network

Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a...

Full description

Saved in:
Bibliographic Details
Main Authors: Salam, M-S, Mohamad, Dzulkifli, Salleh, S-H
Format: Book Section
Published: Institute of Electrical and Electronics Engineers 2008
Subjects:
Online Access:http://eprints.utm.my/id/eprint/12589/
http://dx.doi.org/10.1109/ITSIM.2008.4632062
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.12589
record_format eprints
spelling my.utm.125892011-06-14T08:18:41Z http://eprints.utm.my/id/eprint/12589/ Insertion reduction in speech segmentation using neural network Salam, M-S Mohamad, Dzulkifli Salleh, S-H QA75 Electronic computers. Computer science Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a fusion between statistical and connectionist approach namely divergence algorithm and MLP neural network to improved segmentation by reducing insertions. The experiment conducted on Malay semi-spontaneous connected digit in classroom environment. The digit strings were manually segmented and trained using neural network with three set of data. The first training set trained without silence pattern, the second include silence while the last set introduced both silence and false pattern in the training. The experimental result on digit string segmentation shows number of insertion reduction of more than 5 times in comparison using divergence alone with increment of accuracy up to 40%.. The drawback however, the number of omission also increases to more than 10 times. Nevertheless, match segmentation rate still above 85%. Institute of Electrical and Electronics Engineers 2008 Book Section PeerReviewed Salam, M-S and Mohamad, Dzulkifli and Salleh, S-H (2008) Insertion reduction in speech segmentation using neural network. In: Proceedings - International Symposium on Information Technology 2008, ITSim. Institute of Electrical and Electronics Engineers, New York, pp. 2057-2063. ISBN 978-142442328-6 http://dx.doi.org/10.1109/ITSIM.2008.4632062 doi:10.1109/ITSIM.2008.4632062
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Salam, M-S
Mohamad, Dzulkifli
Salleh, S-H
Insertion reduction in speech segmentation using neural network
description Statistical approach with non-fixed overlapping window size is able to make good identification of discontinuity in speech signal without further knowledge upon the phonetic sequence. This however, leads to increase number of insertion and thus increase confusion in recognition. This paper present a fusion between statistical and connectionist approach namely divergence algorithm and MLP neural network to improved segmentation by reducing insertions. The experiment conducted on Malay semi-spontaneous connected digit in classroom environment. The digit strings were manually segmented and trained using neural network with three set of data. The first training set trained without silence pattern, the second include silence while the last set introduced both silence and false pattern in the training. The experimental result on digit string segmentation shows number of insertion reduction of more than 5 times in comparison using divergence alone with increment of accuracy up to 40%.. The drawback however, the number of omission also increases to more than 10 times. Nevertheless, match segmentation rate still above 85%.
format Book Section
author Salam, M-S
Mohamad, Dzulkifli
Salleh, S-H
author_facet Salam, M-S
Mohamad, Dzulkifli
Salleh, S-H
author_sort Salam, M-S
title Insertion reduction in speech segmentation using neural network
title_short Insertion reduction in speech segmentation using neural network
title_full Insertion reduction in speech segmentation using neural network
title_fullStr Insertion reduction in speech segmentation using neural network
title_full_unstemmed Insertion reduction in speech segmentation using neural network
title_sort insertion reduction in speech segmentation using neural network
publisher Institute of Electrical and Electronics Engineers
publishDate 2008
url http://eprints.utm.my/id/eprint/12589/
http://dx.doi.org/10.1109/ITSIM.2008.4632062
_version_ 1643645991753089024