Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures
The deliberate attempt by speakers to conceal their identity (voice disguise) presents a challenge for forensics and for automated speaker identification systems. Using a database of natural and disguised voices of three professional voice artists, we build on earlier findings in [Amin et al., 2014]...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2014
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/103227 http://hdl.handle.net/10220/20003 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-103227 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1032272020-03-07T12:10:37Z Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures Amin, Talal B. German, James S. Marziliano, Pina School of Electrical and Electronic Engineering School of Humanities and Social Sciences 166th Meeting of the Acoustical Society of America DRNTU::Engineering::Electrical and electronic engineering The deliberate attempt by speakers to conceal their identity (voice disguise) presents a challenge for forensics and for automated speaker identification systems. Using a database of natural and disguised voices of three professional voice artists, we build on earlier findings in [Amin et al., 2014] by exploring how certain glottal and vocal tract measures, such as glottal timing (Open Quotient) and vowel formants are manipulated by the artists to create novel voice identities. We also investigate whether there are any features from these measures that can be useful for discriminating natural and disguised voices. As expected, variation in Open Quotient was speaker-dependent, and corresponded closely to social attributes (i.e., age) of the voice identities involved. By modelling the overall variability of speakers in the vowel space, we propose a new method for automatically classifying natural and disguised voices. The proposed method is found to outperform several state-of-the-art methods. Published version 2014-07-02T02:34:52Z 2019-12-06T21:07:54Z 2014-07-02T02:34:52Z 2019-12-06T21:07:54Z 2014 2014 Conference Paper Amin, T. B., German, J. S., & Marziliano, P. (2014). Detecting voice disguise from speech variability: Analysis of three glottal and vocal tract measures. Proceedings of Meetings on Acoustics, 20, 060005-. https://hdl.handle.net/10356/103227 http://hdl.handle.net/10220/20003 10.1121/1.4879257 177795 en © 2014 Acoustical Society of America. This paper was published in Proceedings of Meetings on Acoustics and is made available as an electronic reprint (preprint) with permission of Acoustical Society of America. The paper can be found at the following official DOI: http://dx.doi.org/10.1121/1.4879257 . One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper is prohibited and is subject to penalties under law. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Amin, Talal B. German, James S. Marziliano, Pina Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
description |
The deliberate attempt by speakers to conceal their identity (voice disguise) presents a challenge for forensics and for automated speaker identification systems. Using a database of natural and disguised voices of three professional voice artists, we build on earlier findings in [Amin et al., 2014] by exploring how certain glottal and vocal tract measures, such as glottal timing (Open Quotient) and vowel formants are manipulated by the artists to create novel voice identities. We also investigate whether there are any features from these measures that can be useful for discriminating natural and disguised voices. As expected, variation in Open Quotient was speaker-dependent, and corresponded closely to social attributes (i.e., age) of the voice identities involved. By modelling the overall variability of speakers in the vowel space, we propose a new method for automatically classifying natural and disguised voices. The proposed method is found to outperform several state-of-the-art methods. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Amin, Talal B. German, James S. Marziliano, Pina |
format |
Conference or Workshop Item |
author |
Amin, Talal B. German, James S. Marziliano, Pina |
author_sort |
Amin, Talal B. |
title |
Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
title_short |
Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
title_full |
Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
title_fullStr |
Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
title_full_unstemmed |
Detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
title_sort |
detecting voice disguise from speech variability : analysis of three glottal and vocal tract measures |
publishDate |
2014 |
url |
https://hdl.handle.net/10356/103227 http://hdl.handle.net/10220/20003 |
_version_ |
1681049906424315904 |