Next Bar Predictor: An Architecture in Automated Music Generation
Music generation has been an active field of research in computer science and is considered as a creative task attempting to imitate human creativity. With the different approaches to generate musical content, recent works have focused on general adversarial networks. One of these is the Midinet, wh...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Published: |
Archīum Ateneo
2020
|
Subjects: | |
Online Access: | https://archium.ateneo.edu/discs-faculty-pubs/229 https://ieeexplore.ieee.org/document/9182341 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Ateneo De Manila University |
id |
ph-ateneo-arc.discs-faculty-pubs-1235 |
---|---|
record_format |
eprints |
spelling |
ph-ateneo-arc.discs-faculty-pubs-12352022-01-10T09:23:01Z Next Bar Predictor: An Architecture in Automated Music Generation Dungan, Belinda M. Fernandez, Proceso L, Jr Music generation has been an active field of research in computer science and is considered as a creative task attempting to imitate human creativity. With the different approaches to generate musical content, recent works have focused on general adversarial networks. One of these is the Midinet, which is considered the baseline model in this study. In this paper, we propose our Next Bar Predictor, a generative model that creates melody one bar at a time using the previous bar as basis to generate aesthetically pleasing melodies. We explore several variants of this by experimenting on different regression and classification models such as Decision Trees (DT), K-Nearest Neighbors (KNN), and Multilayer Perceptron (MLP). The models were trained using the dataset from Theorytab which consists of 460 songs. The outputs of these different variant models were then compared against those from the Midinet, using both machine-based objective scoring mechanism as well as human-based subjective evaluations. The dissimilarity scores obtained by our KNN (0.65) and DT (0.74) models, scored against the melodies in the dataset, are sufficiently high and indicates that both models are generally creative. Furthermore, based on the evaluation by human listeners, the melodies generated by our DT models are more realistic and pleasing than those of the Midinet. Casual listeners also prefer the DT model to be more interesting, although professional listeners think otherwise. Finally, all the variant models, when compared with Midinet, require much less training time and computational power. The proposed Next Bar Predictor is therefore a viable alternative for automated music generation. 2020-01-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/229 https://ieeexplore.ieee.org/document/9182341 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo bars computational modeling predictive models decision trees multilayer perceptrons training machine learning computer generated music artificial intelligence Artificial Intelligence and Robotics Computer Sciences Music |
institution |
Ateneo De Manila University |
building |
Ateneo De Manila University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
Ateneo De Manila University Library |
collection |
archium.Ateneo Institutional Repository |
topic |
bars computational modeling predictive models decision trees multilayer perceptrons training machine learning computer generated music artificial intelligence Artificial Intelligence and Robotics Computer Sciences Music |
spellingShingle |
bars computational modeling predictive models decision trees multilayer perceptrons training machine learning computer generated music artificial intelligence Artificial Intelligence and Robotics Computer Sciences Music Dungan, Belinda M. Fernandez, Proceso L, Jr Next Bar Predictor: An Architecture in Automated Music Generation |
description |
Music generation has been an active field of research in computer science and is considered as a creative task attempting to imitate human creativity. With the different approaches to generate musical content, recent works have focused on general adversarial networks. One of these is the Midinet, which is considered the baseline model in this study. In this paper, we propose our Next Bar Predictor, a generative model that creates melody one bar at a time using the previous bar as basis to generate aesthetically pleasing melodies. We explore several variants of this by experimenting on different regression and classification models such as Decision Trees (DT), K-Nearest Neighbors (KNN), and Multilayer Perceptron (MLP). The models were trained using the dataset from Theorytab which consists of 460 songs. The outputs of these different variant models were then compared against those from the Midinet, using both machine-based objective scoring mechanism as well as human-based subjective evaluations. The dissimilarity scores obtained by our KNN (0.65) and DT (0.74) models, scored against the melodies in the dataset, are sufficiently high and indicates that both models are generally creative. Furthermore, based on the evaluation by human listeners, the melodies generated by our DT models are more realistic and pleasing than those of the Midinet. Casual listeners also prefer the DT model to be more interesting, although professional listeners think otherwise. Finally, all the variant models, when compared with Midinet, require much less training time and computational power. The proposed Next Bar Predictor is therefore a viable alternative for automated music generation. |
format |
text |
author |
Dungan, Belinda M. Fernandez, Proceso L, Jr |
author_facet |
Dungan, Belinda M. Fernandez, Proceso L, Jr |
author_sort |
Dungan, Belinda M. |
title |
Next Bar Predictor: An Architecture in Automated Music Generation |
title_short |
Next Bar Predictor: An Architecture in Automated Music Generation |
title_full |
Next Bar Predictor: An Architecture in Automated Music Generation |
title_fullStr |
Next Bar Predictor: An Architecture in Automated Music Generation |
title_full_unstemmed |
Next Bar Predictor: An Architecture in Automated Music Generation |
title_sort |
next bar predictor: an architecture in automated music generation |
publisher |
Archīum Ateneo |
publishDate |
2020 |
url |
https://archium.ateneo.edu/discs-faculty-pubs/229 https://ieeexplore.ieee.org/document/9182341 |
_version_ |
1722366422974201856 |