INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
Biometric systems are prone to spoofing attacks. While research in speech anti-spoofing has been progressing, there is a limited availability of diverse language datasets. This study aims to bridge this gap by developing an Indonesian spoofed speech dataset, which includes replay attacks, text-to...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/85050 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:85050 |
---|---|
spelling |
id-itb.:850502024-08-19T13:57:28ZINDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS Azka Arief, Sarah Indonesia Final Project Spoof speech detection, Indonesian, ResNet, LCNN, LFCC INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/85050 Biometric systems are prone to spoofing attacks. While research in speech anti-spoofing has been progressing, there is a limited availability of diverse language datasets. This study aims to bridge this gap by developing an Indonesian spoofed speech dataset, which includes replay attacks, text-to-speech, and voice conversion. This dataset forms the foundation for creating an Indonesian speech anti-spoofing system. Subsequently, light convolutional neural network (LCNN) and residual network (ResNet) models, based on convolutional neural networks (CNN), were developed to evaluate the dataset. The input features used are linear frequency cepstral coefficients (LFCC). Both models demonstrate remarkably low minDCF and EER scores approaching zero. The results also exhibit exceptional scores with 4-fold cross validation, showing strong initial performance with no signs of overfitting. However, models trained solely on Common Voice or Prosa.ai datasets performed poorly in cross-source tests, suggesting generalization issues due to a lack of diversity in the dataset. This highlights the need for further improvement and continued research in Indonesian speech spoof detection. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Biometric systems are prone to spoofing attacks. While research in speech
anti-spoofing has been progressing, there is a limited availability of diverse
language datasets. This study aims to bridge this gap by developing an Indonesian
spoofed speech dataset, which includes replay attacks, text-to-speech, and voice
conversion. This dataset forms the foundation for creating an Indonesian speech
anti-spoofing system. Subsequently, light convolutional neural network (LCNN)
and residual network (ResNet) models, based on convolutional neural networks
(CNN), were developed to evaluate the dataset. The input features used are linear
frequency cepstral coefficients (LFCC). Both models demonstrate remarkably low
minDCF and EER scores approaching zero. The results also exhibit exceptional
scores with 4-fold cross validation, showing strong initial performance with no
signs of overfitting. However, models trained solely on Common Voice or Prosa.ai
datasets performed poorly in cross-source tests, suggesting generalization issues
due to a lack of diversity in the dataset. This highlights the need for further
improvement and continued research in Indonesian speech spoof detection. |
format |
Final Project |
author |
Azka Arief, Sarah |
spellingShingle |
Azka Arief, Sarah INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS |
author_facet |
Azka Arief, Sarah |
author_sort |
Azka Arief, Sarah |
title |
INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS |
title_short |
INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS |
title_full |
INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS |
title_fullStr |
INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS |
title_full_unstemmed |
INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS |
title_sort |
indonesian speech anti-spoofing system: data creation and convolutional neural network models |
url |
https://digilib.itb.ac.id/gdl/view/85050 |
_version_ |
1822283013552078848 |