Impact of feature selection and kernel functions in classification for MP3 bit rate detection

With the increased availability of audio files from the Internet, the quality of these audio files is of great concern for the audiophiles who download them. The bit rate of MP3 files is used to determine the audio quality. However, if the audio has been transcoded from lower bit rate to higher bit...

Full description

Saved in:
Bibliographic Details
Main Author: Sen, Diana Qiong Ju.
Other Authors: Sabu Emmanuel
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/42398
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:With the increased availability of audio files from the Internet, the quality of these audio files is of great concern for the audiophiles who download them. The bit rate of MP3 files is used to determine the audio quality. However, if the audio has been transcoded from lower bit rate to higher bit rate, it is unlikely to establish the true audio quality. Thus, investigation on how different kernel functions used in SVM to analyze different spectral density signal features will affect the classification results in determining the various bit rates with reference to an existing study deduction. This project will use three spectral density signal estimation methods namely, Pburg, Spectrogram and Periodogram to obtain feature data from three different audio datasets. Polynomial, radial basis function and sigmoid kernel functions were used in Support Vector Machine (SVM) learning based classifier to analyze the extracted features and recognize patterns with C-Support Vector Classifiers (C-SVC) for training and testing of data sets. In the absence of any coding format knowledge other than the audio frequency signal itself, the analysis returned an average success rate of 98.5% in correctly detecting the original compressed bit rate of an audio file. These success rates are very high with 99.49% also detected from the transcoding of lower bit rate, 128 kbps and 192 kbps to higher bit rate, 320 kbps.