SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION

Automatic Speaker Recognition system is a system that determines speaker identity through sound waves. This system can facilitate various daily services such as bank transaction via telephone. Nowadays, IVector based Automatic Speaker Recognition system for Bahasa has not been able to handle the...

Full description

Saved in:

Bibliographic Details
Main Author:	Kusuma, Andika
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/39464
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:39464
spelling	id-itb.:394642019-06-26T13:57:57ZSPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION Kusuma, Andika Indonesia Final Project Atom Aligned Sparse Representation, Automatic Speaker Recognition, emotional difference, IVector INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/39464 Automatic Speaker Recognition system is a system that determines speaker identity through sound waves. This system can facilitate various daily services such as bank transaction via telephone. Nowadays, IVector based Automatic Speaker Recognition system for Bahasa has not been able to handle the problem of emotional difference. However, in reality, speaker enrollment and recognition is often done in different emotional condition. This emotional difference frequently degrades the performance of existing systems. Therefore, this research focuses on constructing Automatic Speaker Recognition system for Bahasa that could handle different emotion problem by applying IVector modelling technique and Atom Aligned Sparse Representation (AASR) transformation technique. This research begins with collecting data in the form of sound recordings of several speakers at neutral and emotional condition. The emotion classes used in this study are angry, happiness, sadness, and contentment. Compared to the baseline system that was built using IVector method only, the AASR system shows an increase in performance, namely a decrease in Equal Error Rate (EER) of 3.79% in non-neutral emotion test data. In neutral emotion test data, the AASR system also experience a decrease in EER of 2.24%. Overall, the AASR system improves speaker recognition performance by reducing the EER by 3.46%. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	Automatic Speaker Recognition system is a system that determines speaker identity through sound waves. This system can facilitate various daily services such as bank transaction via telephone. Nowadays, IVector based Automatic Speaker Recognition system for Bahasa has not been able to handle the problem of emotional difference. However, in reality, speaker enrollment and recognition is often done in different emotional condition. This emotional difference frequently degrades the performance of existing systems. Therefore, this research focuses on constructing Automatic Speaker Recognition system for Bahasa that could handle different emotion problem by applying IVector modelling technique and Atom Aligned Sparse Representation (AASR) transformation technique. This research begins with collecting data in the form of sound recordings of several speakers at neutral and emotional condition. The emotion classes used in this study are angry, happiness, sadness, and contentment. Compared to the baseline system that was built using IVector method only, the AASR system shows an increase in performance, namely a decrease in Equal Error Rate (EER) of 3.79% in non-neutral emotion test data. In neutral emotion test data, the AASR system also experience a decrease in EER of 2.24%. Overall, the AASR system improves speaker recognition performance by reducing the EER by 3.46%.
format	Final Project
author	Kusuma, Andika
spellingShingle	Kusuma, Andika SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
author_facet	Kusuma, Andika
author_sort	Kusuma, Andika
title	SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_short	SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_full	SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_fullStr	SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_full_unstemmed	SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION
title_sort	speaker verification system in various emotions using atom aligned sparse representation
url	https://digilib.itb.ac.id/gdl/view/39464
_version_	1822925298992152576

SPEAKER VERIFICATION SYSTEM IN VARIOUS EMOTIONS USING ATOM ALIGNED SPARSE REPRESENTATION

Similar Items