Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract

Post-laryngectomized and speech impaired patients lose the ability to create natural pitch, and therefore seek methods to provide replacement voicing for their speech. Non-invasive pitch generation can be accomplished by coupling an artificial pitch into the vocal tract (as in the electrolarynx). Ho...

Full description

Saved in:

Bibliographic Details
Main Author:	Ahmadi, Farzaneh
Other Authors:	Ian Vince McLoughlin
Format:	Theses and Dissertations
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis DRNTU::Engineering::Bioengineering
Online Access:	http://hdl.handle.net/10356/52661
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-52661
record_format	dspace
spelling	sg-ntu-dr.10356-526612023-03-04T00:33:07Z Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract Ahmadi, Farzaneh Ian Vince McLoughlin School of Computer Engineering Centre for Computational Intelligence Goh, Wooi Boon DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis DRNTU::Engineering::Bioengineering Post-laryngectomized and speech impaired patients lose the ability to create natural pitch, and therefore seek methods to provide replacement voicing for their speech. Non-invasive pitch generation can be accomplished by coupling an artificial pitch into the vocal tract (as in the electrolarynx). However this audible pitch leaks out, causing a monotonous and continuous background noise which significantly degrades the speech quality. This research aims to investigate the idea of substituting the missing pitch of such patients with a low frequency ultrasonic signal, propagating inside the tract to map the vocal tract (VT) shape. The emitted “ultrasonic speech” signal can then be down-converted to generate elements of audible speech. This technology can use the VT physiology in a similar way to the well-established electrolarynx, but the ultrasonic excitation conveys advantages of being non-audible (i.e. no pitch leakage), analysable by computer, and adjustable without directly affecting the speech quality. Other applications of the technology include: speech communications in high noise environments such as factories and battle fields, as well as secure/quiet mobile telephony modes to prevent others from overhearing sensitive communications. Another application is supplementary data provision in the reconstruction of voiced speech from whispers. Ultrasonic speech has received little research attention before the course of this research and the current state of the art of the technology is still far from achieving a complete system. Accordingly, this research aims to take a fundamental step towards ultrasonic speech deployment, using physical analysis and modelling of ultrasonic propagation inside the human vocal tract. Major questions including the differences between attributes of audible and ultrasonic output of the human vocal tract and the possibility of extending the source-filter theory of audible speech to the ultrasonic domain are answered in this thesis. This work proves that the linear source-filter theory can still be valid for low frequency and sub-ultrasonic (14 to 100 kHz) propagation inside the VT. In addition, linear predictive analysis is extended from the one-dimensional sound propagation inside the VT in audible speech to the three-dimensional propagation of ultrasonic waves. The next important aim of this thesis is to gain a model that describes propagation of ultrasonic waves in the vocal tract. This has been accomplished through finite element analysis of ultrasonic propagation in the geometry of human vocal tract. The models are later constructed using three-dimensional printing and the results of the simulations are verified using acoustic measurements. In addition, since implementation of ultrasonic speech involves timely exposure of human body to low frequency ultrasonic waves, this research reviews the safety requirements of implementing ultrasonic speech which is a necessary requirement in testing and implementing the technology. Doctor of Philosophy (SCE) 2013-05-22T01:53:13Z 2013-05-22T01:53:13Z 2012 2012 Thesis http://hdl.handle.net/10356/52661 en 174 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis DRNTU::Engineering::Bioengineering
spellingShingle	DRNTU::Engineering::Computer science and engineering::Mathematics of computing::Numerical analysis DRNTU::Engineering::Bioengineering Ahmadi, Farzaneh Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
description	Post-laryngectomized and speech impaired patients lose the ability to create natural pitch, and therefore seek methods to provide replacement voicing for their speech. Non-invasive pitch generation can be accomplished by coupling an artificial pitch into the vocal tract (as in the electrolarynx). However this audible pitch leaks out, causing a monotonous and continuous background noise which significantly degrades the speech quality. This research aims to investigate the idea of substituting the missing pitch of such patients with a low frequency ultrasonic signal, propagating inside the tract to map the vocal tract (VT) shape. The emitted “ultrasonic speech” signal can then be down-converted to generate elements of audible speech. This technology can use the VT physiology in a similar way to the well-established electrolarynx, but the ultrasonic excitation conveys advantages of being non-audible (i.e. no pitch leakage), analysable by computer, and adjustable without directly affecting the speech quality. Other applications of the technology include: speech communications in high noise environments such as factories and battle fields, as well as secure/quiet mobile telephony modes to prevent others from overhearing sensitive communications. Another application is supplementary data provision in the reconstruction of voiced speech from whispers. Ultrasonic speech has received little research attention before the course of this research and the current state of the art of the technology is still far from achieving a complete system. Accordingly, this research aims to take a fundamental step towards ultrasonic speech deployment, using physical analysis and modelling of ultrasonic propagation inside the human vocal tract. Major questions including the differences between attributes of audible and ultrasonic output of the human vocal tract and the possibility of extending the source-filter theory of audible speech to the ultrasonic domain are answered in this thesis. This work proves that the linear source-filter theory can still be valid for low frequency and sub-ultrasonic (14 to 100 kHz) propagation inside the VT. In addition, linear predictive analysis is extended from the one-dimensional sound propagation inside the VT in audible speech to the three-dimensional propagation of ultrasonic waves. The next important aim of this thesis is to gain a model that describes propagation of ultrasonic waves in the vocal tract. This has been accomplished through finite element analysis of ultrasonic propagation in the geometry of human vocal tract. The models are later constructed using three-dimensional printing and the results of the simulations are verified using acoustic measurements. In addition, since implementation of ultrasonic speech involves timely exposure of human body to low frequency ultrasonic waves, this research reviews the safety requirements of implementing ultrasonic speech which is a necessary requirement in testing and implementing the technology.
author2	Ian Vince McLoughlin
author_facet	Ian Vince McLoughlin Ahmadi, Farzaneh
format	Theses and Dissertations
author	Ahmadi, Farzaneh
author_sort	Ahmadi, Farzaneh
title	Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
title_short	Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
title_full	Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
title_fullStr	Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
title_full_unstemmed	Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
title_sort	voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract
publishDate	2013
url	http://hdl.handle.net/10356/52661
_version_	1759855862049931264

Voice replacement for the severely speech impaired through sub-ultrasonic excitation of the vocal tract

Similar Items