JAVA AND SUNDA DIALECT RECOGNITION FROM INDONESIAN SPEECH USING GMM AND I-VECTOR

Dialect is a variance of language that can affect the way a person pronounces. In a speech recognition system that translates voice into text form, the speaker dialect may affect the results of the recognition. Research on dialect identification has been done first in Indian (Hindi), Arabic and Bang...

Full description

Saved in:
Bibliographic Details
Main Author: RAHMAWATI (NIM: 23514023), RITA
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/24044
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
Description
Summary:Dialect is a variance of language that can affect the way a person pronounces. In a speech recognition system that translates voice into text form, the speaker dialect may affect the results of the recognition. Research on dialect identification has been done first in Indian (Hindi), Arabic and Bangladeshi dialects. Although in Indonesia there are quite a lot of dialects, but research for the recognition of dialect in Indonesian is still limited, therefore this research focus on recognition of Java and Sunda dialect that have the most speakers in Indonesia. This research begins with data collection used for machine learning experiments based on supervised learning. The sound corpus used to construct the model is recorded voice corpus of 8 men and 2 women in each dialect who read the story in Indonesian with a total duration of training data for 1.5 hours. The recognition of Java and Sunda dialects from Indonesian Speech was built through a combination of MFCC and pitch features and using GMM and I-vector modeling techniques. The process of building the dialect model is done with the ratio of 80:20 for the training and testing data. In addition, the constructed model has been tested using a 5-Fold scheme on 4 tesing data on closed test and 12 tesing data on open test. Classification Error value obtained by using I-vector modeling technique and MFCC + pitch feature combination is 35% for closed test and 13,34% for open test.