Extreme learning machine based speaker recognition

Speaker Recognition is the process of automatically recognizing who is speaking based on the individual information included in speech waves. Speaker Recognition is basically classified into speaker identification and speaker verification. It can be used in many fields such as banking by telephone,...

全面介紹

Saved in:
書目詳細資料
主要作者: Hu, Zongjiang.
其他作者: Huang Guangbin
格式: Final Year Project
語言:English
出版: 2011
主題:
在線閱讀:http://hdl.handle.net/10356/45362
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Speaker Recognition is the process of automatically recognizing who is speaking based on the individual information included in speech waves. Speaker Recognition is basically classified into speaker identification and speaker verification. It can be used in many fields such as banking by telephone, voice dialling, voice mail, and security control for secret information areas. Extreme Learning Machine (ELM), originally proposed by Prof. Huang, is an emerging technique which produces very good generalization performance in regression applications as well as in large dataset classification applications. In this report, a text-independent close-set speaker verification system based on ELM and SVM will be constructed and each part of the system like speaker database, feature extraction, speaker modelling and decision making will be elaborated in detail. Finally and most importantly, the performance of ELM and SVM will be compared. With the results shown in Chapter 6, we can draw the conclusion that ELM is superior to SVM in terms of tuning simplicity, time efficiency and testing accuracy.