Improved statistical speech segmentation using connectionist approach

Problem statement: Speech segmentation is an important part for speech recognition, synthesizing and coding. Statistical based approach detects segmentation points via computing spectral distortion of the signal without prior knowledge of the acoustic information proved to be able to give good match...

Full description

Saved in:
Bibliographic Details
Main Authors: Salam, M. S., Mohamad, Dzulkifli, Salleh, S. H.
Format: Article
Published: Science Publications 2009
Subjects:
Online Access:http://eprints.utm.my/id/eprint/15107/
http://dx.doi.org/10.3844/jcs.2009.275.282
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.15107
record_format eprints
spelling my.utm.151072011-09-30T15:08:41Z http://eprints.utm.my/id/eprint/15107/ Improved statistical speech segmentation using connectionist approach Salam, M. S. Mohamad, Dzulkifli Salleh, S. H. QA75 Electronic computers. Computer science Problem statement: Speech segmentation is an important part for speech recognition, synthesizing and coding. Statistical based approach detects segmentation points via computing spectral distortion of the signal without prior knowledge of the acoustic information proved to be able to give good match, less omission but lot of insertion. These insertion points dropped segmentation accuracy. Approach: This study proposed a fusion method between statistical and connectionist approaches namely the divergence algorithm and Multi Layer Perceptron (MLP) with adaptive learning for segmentation of Malay connected digit with the aim to improve statistical approach via detection of insertion points. The neural network was optimized via trial and error in finding suitable parameters and speech time normalization methods. The best neural network classifier was then fusion with divergence algorithm to make segmentation. Results: The results of the experiments showed that the best neural network classifier used learning rate of value 1.0 and momentum rate of value 0.9 with data normalization based on zero-padded. The segmentation using fusion of statistical and connectionist was able to reduce insertion points up to 10.4% while maintaining match points above 99% and omission point below 0.7% within time tolerance of 0.09 second. Conclusion: The result of segmentation using the proposed fusion method indicated potential use of connectionist approach in improving continuous segmentation by statistical approach. Science Publications 2009 Article PeerReviewed Salam, M. S. and Mohamad, Dzulkifli and Salleh, S. H. (2009) Improved statistical speech segmentation using connectionist approach. Journal of Computer Science, 5 (4). pp. 275-282. ISSN 15493636 http://dx.doi.org/10.3844/jcs.2009.275.282 doi:10.3844/jcs.2009.275.282
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Salam, M. S.
Mohamad, Dzulkifli
Salleh, S. H.
Improved statistical speech segmentation using connectionist approach
description Problem statement: Speech segmentation is an important part for speech recognition, synthesizing and coding. Statistical based approach detects segmentation points via computing spectral distortion of the signal without prior knowledge of the acoustic information proved to be able to give good match, less omission but lot of insertion. These insertion points dropped segmentation accuracy. Approach: This study proposed a fusion method between statistical and connectionist approaches namely the divergence algorithm and Multi Layer Perceptron (MLP) with adaptive learning for segmentation of Malay connected digit with the aim to improve statistical approach via detection of insertion points. The neural network was optimized via trial and error in finding suitable parameters and speech time normalization methods. The best neural network classifier was then fusion with divergence algorithm to make segmentation. Results: The results of the experiments showed that the best neural network classifier used learning rate of value 1.0 and momentum rate of value 0.9 with data normalization based on zero-padded. The segmentation using fusion of statistical and connectionist was able to reduce insertion points up to 10.4% while maintaining match points above 99% and omission point below 0.7% within time tolerance of 0.09 second. Conclusion: The result of segmentation using the proposed fusion method indicated potential use of connectionist approach in improving continuous segmentation by statistical approach.
format Article
author Salam, M. S.
Mohamad, Dzulkifli
Salleh, S. H.
author_facet Salam, M. S.
Mohamad, Dzulkifli
Salleh, S. H.
author_sort Salam, M. S.
title Improved statistical speech segmentation using connectionist approach
title_short Improved statistical speech segmentation using connectionist approach
title_full Improved statistical speech segmentation using connectionist approach
title_fullStr Improved statistical speech segmentation using connectionist approach
title_full_unstemmed Improved statistical speech segmentation using connectionist approach
title_sort improved statistical speech segmentation using connectionist approach
publisher Science Publications
publishDate 2009
url http://eprints.utm.my/id/eprint/15107/
http://dx.doi.org/10.3844/jcs.2009.275.282
_version_ 1643646481363632128