Spoofing detection from a feature representation perspective

Spoofing detection, which discriminates the spoofed speech from the natural speech, has gained much attention recently. Low-dimensional features that are used in speaker recognition/verification are also used in spoofing detection. Unfortunately, they don't capture sufficient information requir...

Full description

Saved in:
Bibliographic Details
Main Authors: Tian, Xiaohai, Wu, Zhizheg, Xiao, Xiong, Chng, Eng Siong, Li, Haizhou
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10356/89643
http://hdl.handle.net/10220/47063
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-89643
record_format dspace
spelling sg-ntu-dr.10356-896432020-03-07T11:48:46Z Spoofing detection from a feature representation perspective Tian, Xiaohai Wu, Zhizheg Xiao, Xiong Chng, Eng Siong Li, Haizhou School of Computer Science and Engineering 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) NTU-UBC Research Centre of Excellence in Active Living for the Elderly Temasek Laboratories Spoofing Detection DRNTU::Engineering::Computer science and engineering Spoofing Attack Spoofing detection, which discriminates the spoofed speech from the natural speech, has gained much attention recently. Low-dimensional features that are used in speaker recognition/verification are also used in spoofing detection. Unfortunately, they don't capture sufficient information required for spoofing detection. In this work, we investigate the use of high-dimensional features for spoofing detection, that maybe more sensitive to the artifacts in the spoofed speech. Six types of high-dimensional feature are employed. For each kind of feature, four different representations are extracted, i.e. the original high-dimensional feature, corresponding low-dimensional feature, the low- and the high-frequency regions of the original high-dimensional feature. Dynamic features are also calculated to assess the effectiveness of the temporal information to detect the artifacts across frames. A neural network-based classifier is adopted to handle the high-dimensional features. Experimental results on the standard ASVspoof 2015 corpus suggest that high-dimensional features and dynamic features are useful for spoofing attack detection. A fusion of them has been shown to achieve 0.0% the equal error rates for nine of ten attack types. NRF (Natl Research Foundation, S’pore) Accepted version 2018-12-18T07:34:06Z 2019-12-06T17:30:08Z 2018-12-18T07:34:06Z 2019-12-06T17:30:08Z 2016-03-01 2016 Conference Paper Tian, X., Wu, Z., Xiao, X., Chng, E. S., & Li, H. (2016). Spoofing detection from a feature representation perspective. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2119-2123. doi:10.1109/ICASSP.2016.7472051 https://hdl.handle.net/10356/89643 http://hdl.handle.net/10220/47063 10.1109/ICASSP.2016.7472051 200443 en © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/ICASSP.2016.7472051]. 5 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Spoofing Detection
DRNTU::Engineering::Computer science and engineering
Spoofing Attack
spellingShingle Spoofing Detection
DRNTU::Engineering::Computer science and engineering
Spoofing Attack
Tian, Xiaohai
Wu, Zhizheg
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
Spoofing detection from a feature representation perspective
description Spoofing detection, which discriminates the spoofed speech from the natural speech, has gained much attention recently. Low-dimensional features that are used in speaker recognition/verification are also used in spoofing detection. Unfortunately, they don't capture sufficient information required for spoofing detection. In this work, we investigate the use of high-dimensional features for spoofing detection, that maybe more sensitive to the artifacts in the spoofed speech. Six types of high-dimensional feature are employed. For each kind of feature, four different representations are extracted, i.e. the original high-dimensional feature, corresponding low-dimensional feature, the low- and the high-frequency regions of the original high-dimensional feature. Dynamic features are also calculated to assess the effectiveness of the temporal information to detect the artifacts across frames. A neural network-based classifier is adopted to handle the high-dimensional features. Experimental results on the standard ASVspoof 2015 corpus suggest that high-dimensional features and dynamic features are useful for spoofing attack detection. A fusion of them has been shown to achieve 0.0% the equal error rates for nine of ten attack types.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Tian, Xiaohai
Wu, Zhizheg
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
format Conference or Workshop Item
author Tian, Xiaohai
Wu, Zhizheg
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
author_sort Tian, Xiaohai
title Spoofing detection from a feature representation perspective
title_short Spoofing detection from a feature representation perspective
title_full Spoofing detection from a feature representation perspective
title_fullStr Spoofing detection from a feature representation perspective
title_full_unstemmed Spoofing detection from a feature representation perspective
title_sort spoofing detection from a feature representation perspective
publishDate 2018
url https://hdl.handle.net/10356/89643
http://hdl.handle.net/10220/47063
_version_ 1681049262146715648