Effective Music Tagging through Advanced Statistical Modeling

Music information retrieval (MIR) holds great promise as a technology for managing large music archives. One of the key components of MIR that has been actively researched into is music tagging. While significant progress has been achieved, most of the existing systems still adopt a simple classific...

全面介紹

Saved in:
書目詳細資料
Main Authors: SHEN, Jialie, WANG, Meng, YAN, Shuicheng, PANG, Hwee Hwa, HUA, Xian-Sheng
格式: text
語言:English
出版: Institutional Knowledge at Singapore Management University 2010
主題:
在線閱讀:https://ink.library.smu.edu.sg/sis_research/515
https://ink.library.smu.edu.sg/context/sis_research/article/1514/viewcontent/Effective_Music_Tagging_through_Advanced_Statistical_Modeling__edited_.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Music information retrieval (MIR) holds great promise as a technology for managing large music archives. One of the key components of MIR that has been actively researched into is music tagging. While significant progress has been achieved, most of the existing systems still adopt a simple classification approach, and apply machine learning classifiers directly on low level acoustic features. Consequently, they suffer the shortcomings of (1) poor accuracy, (2) lack of comprehensive evaluation results and the associated analysis based on large scale datasets, and (3) incomplete content representation, arising from the lack of multimodal and temporal information integration. In this paper, we introduce a novel system called MMTagger that effectively integrates both multimodal and temporal information in the representation of music signal. The carefully designed multilayer architecture of the proposed classification framework seamlessly combines Multiple Gaussian Mixture Models (GMMs) and Support Vector Machine (SVM) into a single framework. The structure preserves more discriminative information, leading to more accurate and robust tagging. Experiment results obtained with two large music collections highlight the various advantages of our multilayer framework over state of the art techniques.