SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION

Automatic Speaker Recognition system is a system that determines speaker identity through sound waves. This system can facilitate various daily services such as bank transaction via telephone. Nowadays, IVector based Automatic Speaker Recognition system for Bahasa has not been able to handle the...

Full description

Saved in:
Bibliographic Details
Main Author: Kusuma, Andika
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/39464
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:39464
spelling id-itb.:394642019-06-26T13:57:57ZSPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION Kusuma, Andika Indonesia Final Project Atom Aligned Sparse Representation, Automatic Speaker Recognition, emotional difference, IVector INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/39464 Automatic Speaker Recognition system is a system that determines speaker identity through sound waves. This system can facilitate various daily services such as bank transaction via telephone. Nowadays, IVector based Automatic Speaker Recognition system for Bahasa has not been able to handle the problem of emotional difference. However, in reality, speaker enrollment and recognition is often done in different emotional condition. This emotional difference frequently degrades the performance of existing systems. Therefore, this research focuses on constructing Automatic Speaker Recognition system for Bahasa that could handle different emotion problem by applying IVector modelling technique and Atom Aligned Sparse Representation (AASR) transformation technique. This research begins with collecting data in the form of sound recordings of several speakers at neutral and emotional condition. The emotion classes used in this study are angry, happiness, sadness, and contentment. Compared to the baseline system that was built using IVector method only, the AASR system shows an increase in performance, namely a decrease in Equal Error Rate (EER) of 3.79% in non-neutral emotion test data. In neutral emotion test data, the AASR system also experience a decrease in EER of 2.24%. Overall, the AASR system improves speaker recognition performance by reducing the EER by 3.46%. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Automatic Speaker Recognition system is a system that determines speaker identity through sound waves. This system can facilitate various daily services such as bank transaction via telephone. Nowadays, IVector based Automatic Speaker Recognition system for Bahasa has not been able to handle the problem of emotional difference. However, in reality, speaker enrollment and recognition is often done in different emotional condition. This emotional difference frequently degrades the performance of existing systems. Therefore, this research focuses on constructing Automatic Speaker Recognition system for Bahasa that could handle different emotion problem by applying IVector modelling technique and Atom Aligned Sparse Representation (AASR) transformation technique. This research begins with collecting data in the form of sound recordings of several speakers at neutral and emotional condition. The emotion classes used in this study are angry, happiness, sadness, and contentment. Compared to the baseline system that was built using IVector method only, the AASR system shows an increase in performance, namely a decrease in Equal Error Rate (EER) of 3.79% in non-neutral emotion test data. In neutral emotion test data, the AASR system also experience a decrease in EER of 2.24%. Overall, the AASR system improves speaker recognition performance by reducing the EER by 3.46%.
format Final Project
author Kusuma, Andika
spellingShingle Kusuma, Andika
SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
author_facet Kusuma, Andika
author_sort Kusuma, Andika
title SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_short SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_full SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_fullStr SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_full_unstemmed SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_sort speaker verification system in various emotions using atom aligned sparse representation
url https://digilib.itb.ac.id/gdl/view/39464
_version_ 1822925298992152576