Music analysis and similarities measure

As data becomes vastly available in the digital form, the integrity, organizing and searching for these data degrades. This is especially true for digital media data which users can generate and reproduce easily. This project aims to research on current audio analysis techniques and machine learning...

Full description

Saved in:
Bibliographic Details
Main Author: Cheong, Yong Hon.
Other Authors: Chan Syin
Format: Final Year Project
Language:English
Published: 2011
Subjects:
Online Access:http://hdl.handle.net/10356/46455
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-46455
record_format dspace
spelling sg-ntu-dr.10356-464552023-03-03T20:35:44Z Music analysis and similarities measure Cheong, Yong Hon. Chan Syin School of Computer Engineering Centre for Multimedia and Network Technology DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition As data becomes vastly available in the digital form, the integrity, organizing and searching for these data degrades. This is especially true for digital media data which users can generate and reproduce easily. This project aims to research on current audio analysis techniques and machine learning algorithms to develop a Java application to help users better manage their audio media files. This is done by allowing automated classification of presented .mp3 audio files into different genres of music, and other audio similarity related functions. Experiments are also conducted to find out how these tasks can be improved to yield better accuracy. This report documents the application designed, its functionalities in addition to classification, and experiments carried out on it. Each audio file is made up of a set of audio samples at a predefined rate in hertz. The developed application “JClassifier” first computes a set of meaningful features from a set of audio samples segmented into different analysis frames, features extracted include Mel Frequency Cepstral Coefficients, Spectral dimensional features, Linear Predictive Coding and Methods of Moments, which will form a ‘signature’ for each audio. Each feature represents the audio in specific areas such as pitch, melody, beats and timbre of the sound. Classification of the audio is then carried out using these signatures using an ensemble of commonly used classifiers which are Support Vector Machines, K-Nearest Neighbour, and Artificial Neural Network. The system has been trained using a well labelled GTZAN dataset consisting of 1000 music pieces divided into 10 genres. Parameters like the window size, overlap ratio, and processing segment length for the audio stream and combinations of feature extracted is experimented to find out if these factors coupled with the use of ensemble classifier can improve the classification results. Experiment results show improved performance in different parameters setup and better classification results using an ensemble of classifier for the classification task. Bachelor of Engineering (Computer Science) 2011-12-06T03:50:53Z 2011-12-06T03:50:53Z 2011 2011 Final Year Project (FYP) http://hdl.handle.net/10356/46455 en Nanyang Technological University 69 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Cheong, Yong Hon.
Music analysis and similarities measure
description As data becomes vastly available in the digital form, the integrity, organizing and searching for these data degrades. This is especially true for digital media data which users can generate and reproduce easily. This project aims to research on current audio analysis techniques and machine learning algorithms to develop a Java application to help users better manage their audio media files. This is done by allowing automated classification of presented .mp3 audio files into different genres of music, and other audio similarity related functions. Experiments are also conducted to find out how these tasks can be improved to yield better accuracy. This report documents the application designed, its functionalities in addition to classification, and experiments carried out on it. Each audio file is made up of a set of audio samples at a predefined rate in hertz. The developed application “JClassifier” first computes a set of meaningful features from a set of audio samples segmented into different analysis frames, features extracted include Mel Frequency Cepstral Coefficients, Spectral dimensional features, Linear Predictive Coding and Methods of Moments, which will form a ‘signature’ for each audio. Each feature represents the audio in specific areas such as pitch, melody, beats and timbre of the sound. Classification of the audio is then carried out using these signatures using an ensemble of commonly used classifiers which are Support Vector Machines, K-Nearest Neighbour, and Artificial Neural Network. The system has been trained using a well labelled GTZAN dataset consisting of 1000 music pieces divided into 10 genres. Parameters like the window size, overlap ratio, and processing segment length for the audio stream and combinations of feature extracted is experimented to find out if these factors coupled with the use of ensemble classifier can improve the classification results. Experiment results show improved performance in different parameters setup and better classification results using an ensemble of classifier for the classification task.
author2 Chan Syin
author_facet Chan Syin
Cheong, Yong Hon.
format Final Year Project
author Cheong, Yong Hon.
author_sort Cheong, Yong Hon.
title Music analysis and similarities measure
title_short Music analysis and similarities measure
title_full Music analysis and similarities measure
title_fullStr Music analysis and similarities measure
title_full_unstemmed Music analysis and similarities measure
title_sort music analysis and similarities measure
publishDate 2011
url http://hdl.handle.net/10356/46455
_version_ 1759856601595904000