Real-Time End-to-End Speech Emotion Recognition with Cross-Domain Adaptation
Language resources are the main factor in speech-emotion-recognition (SER)-based deep learning models. Thai is a low-resource language that has a smaller data size than high-resource languages such as German. This paper describes the framework of using a pretrained-model-based front-end and back-end...
Saved in:
Main Author: | Wongpatikaseree K. |
---|---|
Other Authors: | Mahidol University |
Format: | Article |
Published: |
2023
|
Subjects: | |
Online Access: | https://repository.li.mahidol.ac.th/handle/123456789/84256 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Mahidol University |
Similar Items
-
Real-Time Thai Speech Emotion Recognition With Speech Enhancement Using Time-Domain Contrastive Predictive Coding and Conv-Tasnet
by: Sumeth Yuenyong, et al.
Published: (2022) -
End-to-End Speech Emotion Recognition Using Multi-Scale Convolution Networks
by: Sivanagaraja, Tatinati, et al.
Published: (2018) -
A cross-domain framework for coordinated end-to-end QoS adaptation
by: Zhou, L., et al.
Published: (2013) -
Speaker-invariant speech emotion recognition with domain adversarial training
by: Leow, Bryan Xuan Zhen
Published: (2021) -
Real-Time Thai Speech Emotion Recognition With Speech Enhancement Using Time-Domain Contrastive Predictive Coding and Conv-Tasnet
by: Yuenyong S.
Published: (2023)