Query-by-singing based music retrieval

Imagine a situation where you have heard a new song over the radio or in a music shop which you like it a lot and recorded it down with your mobile phone. And another situation where you remember a part of a song but forgotten the title. In both situations, neither the song title nor artist...

Full description

Saved in:
Bibliographic Details
Main Author: Goh, Li-Xian.
Other Authors: School of Computer Engineering
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/38814
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-38814
record_format dspace
spelling sg-ntu-dr.10356-388142023-03-03T20:34:18Z Query-by-singing based music retrieval Goh, Li-Xian. School of Computer Engineering Tao Dacheng DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval DRNTU::Engineering::Computer science and engineering::Computer applications::Arts and humanities Imagine a situation where you have heard a new song over the radio or in a music shop which you like it a lot and recorded it down with your mobile phone. And another situation where you remember a part of a song but forgotten the title. In both situations, neither the song title nor artiste name is known, but you would like to retrieve the song. However, the most common method for retrieving a song currently requires the song title or artiste name as the input information. Hence, a query-by-singing based music retrieval system is proposed as a solution for the scenarios above. The system is implemented using MATLAB and is built on a music database of 500 songs. Experiments are conducted to test the accuracy and efficiency of the completed system. The process for implementing the music retrieval system involves segmenting the songs into sentences based on time-coded lyrics. Each sentence is further divided into fixed overlapping frames to perform feature extraction. The Mel-Frequency Cepstral Coefficients (MFCCs) are extracted as the feature from the music database and clustered using K-means algorithm. The resulting data from the clustering will then be processed with Bag-of-Words model to form a histogram for each sentence of a song. The Random Projection Tree (RP-Tree) is used for indexing the histograms for efficient retrieval. Two types of queries are collected for experiments and comprise of recorded playback and a user singing a sentence of a song. The process of audio segmentation, feature extraction and data training is also applied to each query. The system compares a query with the music database to retrieve results that are closest to the query. The results are returned based on the comparison of Chisquare distance between the query input and database. The analysis of the results show that the system level of accuracy for the recorded type of query is reasonably high but it did not perform as well for the singing type of query. The efficiency of the system was greatly improved with the implementation of RP-Tree. Bachelor of Engineering (Computer Science) 2010-05-19T03:25:43Z 2010-05-19T03:25:43Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/38814 en Nanyang Technological University 61 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
DRNTU::Engineering::Computer science and engineering::Computer applications::Arts and humanities
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
DRNTU::Engineering::Computer science and engineering::Computer applications::Arts and humanities
Goh, Li-Xian.
Query-by-singing based music retrieval
description Imagine a situation where you have heard a new song over the radio or in a music shop which you like it a lot and recorded it down with your mobile phone. And another situation where you remember a part of a song but forgotten the title. In both situations, neither the song title nor artiste name is known, but you would like to retrieve the song. However, the most common method for retrieving a song currently requires the song title or artiste name as the input information. Hence, a query-by-singing based music retrieval system is proposed as a solution for the scenarios above. The system is implemented using MATLAB and is built on a music database of 500 songs. Experiments are conducted to test the accuracy and efficiency of the completed system. The process for implementing the music retrieval system involves segmenting the songs into sentences based on time-coded lyrics. Each sentence is further divided into fixed overlapping frames to perform feature extraction. The Mel-Frequency Cepstral Coefficients (MFCCs) are extracted as the feature from the music database and clustered using K-means algorithm. The resulting data from the clustering will then be processed with Bag-of-Words model to form a histogram for each sentence of a song. The Random Projection Tree (RP-Tree) is used for indexing the histograms for efficient retrieval. Two types of queries are collected for experiments and comprise of recorded playback and a user singing a sentence of a song. The process of audio segmentation, feature extraction and data training is also applied to each query. The system compares a query with the music database to retrieve results that are closest to the query. The results are returned based on the comparison of Chisquare distance between the query input and database. The analysis of the results show that the system level of accuracy for the recorded type of query is reasonably high but it did not perform as well for the singing type of query. The efficiency of the system was greatly improved with the implementation of RP-Tree.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Goh, Li-Xian.
format Final Year Project
author Goh, Li-Xian.
author_sort Goh, Li-Xian.
title Query-by-singing based music retrieval
title_short Query-by-singing based music retrieval
title_full Query-by-singing based music retrieval
title_fullStr Query-by-singing based music retrieval
title_full_unstemmed Query-by-singing based music retrieval
title_sort query-by-singing based music retrieval
publishDate 2010
url http://hdl.handle.net/10356/38814
_version_ 1759855945146433536