INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS

Biometric systems are prone to spoofing attacks. While research in speech anti-spoofing has been progressing, there is a limited availability of diverse language datasets. This study aims to bridge this gap by developing an Indonesian spoofed speech dataset, which includes replay attacks, text-to...

Full description

Saved in:
Bibliographic Details
Main Author: Azka Arief, Sarah
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/85050
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:85050
spelling id-itb.:850502024-08-19T13:57:28ZINDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS Azka Arief, Sarah Indonesia Final Project Spoof speech detection, Indonesian, ResNet, LCNN, LFCC INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/85050 Biometric systems are prone to spoofing attacks. While research in speech anti-spoofing has been progressing, there is a limited availability of diverse language datasets. This study aims to bridge this gap by developing an Indonesian spoofed speech dataset, which includes replay attacks, text-to-speech, and voice conversion. This dataset forms the foundation for creating an Indonesian speech anti-spoofing system. Subsequently, light convolutional neural network (LCNN) and residual network (ResNet) models, based on convolutional neural networks (CNN), were developed to evaluate the dataset. The input features used are linear frequency cepstral coefficients (LFCC). Both models demonstrate remarkably low minDCF and EER scores approaching zero. The results also exhibit exceptional scores with 4-fold cross validation, showing strong initial performance with no signs of overfitting. However, models trained solely on Common Voice or Prosa.ai datasets performed poorly in cross-source tests, suggesting generalization issues due to a lack of diversity in the dataset. This highlights the need for further improvement and continued research in Indonesian speech spoof detection. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Biometric systems are prone to spoofing attacks. While research in speech anti-spoofing has been progressing, there is a limited availability of diverse language datasets. This study aims to bridge this gap by developing an Indonesian spoofed speech dataset, which includes replay attacks, text-to-speech, and voice conversion. This dataset forms the foundation for creating an Indonesian speech anti-spoofing system. Subsequently, light convolutional neural network (LCNN) and residual network (ResNet) models, based on convolutional neural networks (CNN), were developed to evaluate the dataset. The input features used are linear frequency cepstral coefficients (LFCC). Both models demonstrate remarkably low minDCF and EER scores approaching zero. The results also exhibit exceptional scores with 4-fold cross validation, showing strong initial performance with no signs of overfitting. However, models trained solely on Common Voice or Prosa.ai datasets performed poorly in cross-source tests, suggesting generalization issues due to a lack of diversity in the dataset. This highlights the need for further improvement and continued research in Indonesian speech spoof detection.
format Final Project
author Azka Arief, Sarah
spellingShingle Azka Arief, Sarah
INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
author_facet Azka Arief, Sarah
author_sort Azka Arief, Sarah
title INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
title_short INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
title_full INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
title_fullStr INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
title_full_unstemmed INDONESIAN SPEECH ANTI-SPOOFING SYSTEM: DATA CREATION AND CONVOLUTIONAL NEURAL NETWORK MODELS
title_sort indonesian speech anti-spoofing system: data creation and convolutional neural network models
url https://digilib.itb.ac.id/gdl/view/85050
_version_ 1822283013552078848