Next Bar Predictor: An Architecture in Automated Music Generation

Music generation has been an active field of research in computer science and is considered as a creative task attempting to imitate human creativity. With the different approaches to generate musical content, recent works have focused on general adversarial networks. One of these is the Midinet, wh...

Full description

Saved in:
Bibliographic Details
Main Authors: Dungan, Belinda M., Fernandez, Proceso L, Jr
Format: text
Published: Archīum Ateneo 2020
Subjects:
Online Access:https://archium.ateneo.edu/discs-faculty-pubs/229
https://ieeexplore.ieee.org/document/9182341
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.discs-faculty-pubs-1235
record_format eprints
spelling ph-ateneo-arc.discs-faculty-pubs-12352022-01-10T09:23:01Z Next Bar Predictor: An Architecture in Automated Music Generation Dungan, Belinda M. Fernandez, Proceso L, Jr Music generation has been an active field of research in computer science and is considered as a creative task attempting to imitate human creativity. With the different approaches to generate musical content, recent works have focused on general adversarial networks. One of these is the Midinet, which is considered the baseline model in this study. In this paper, we propose our Next Bar Predictor, a generative model that creates melody one bar at a time using the previous bar as basis to generate aesthetically pleasing melodies. We explore several variants of this by experimenting on different regression and classification models such as Decision Trees (DT), K-Nearest Neighbors (KNN), and Multilayer Perceptron (MLP). The models were trained using the dataset from Theorytab which consists of 460 songs. The outputs of these different variant models were then compared against those from the Midinet, using both machine-based objective scoring mechanism as well as human-based subjective evaluations. The dissimilarity scores obtained by our KNN (0.65) and DT (0.74) models, scored against the melodies in the dataset, are sufficiently high and indicates that both models are generally creative. Furthermore, based on the evaluation by human listeners, the melodies generated by our DT models are more realistic and pleasing than those of the Midinet. Casual listeners also prefer the DT model to be more interesting, although professional listeners think otherwise. Finally, all the variant models, when compared with Midinet, require much less training time and computational power. The proposed Next Bar Predictor is therefore a viable alternative for automated music generation. 2020-01-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/229 https://ieeexplore.ieee.org/document/9182341 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo bars computational modeling predictive models decision trees multilayer perceptrons training machine learning computer generated music artificial intelligence Artificial Intelligence and Robotics Computer Sciences Music
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic bars
computational modeling
predictive models
decision trees
multilayer perceptrons
training
machine learning
computer generated music
artificial intelligence
Artificial Intelligence and Robotics
Computer Sciences
Music
spellingShingle bars
computational modeling
predictive models
decision trees
multilayer perceptrons
training
machine learning
computer generated music
artificial intelligence
Artificial Intelligence and Robotics
Computer Sciences
Music
Dungan, Belinda M.
Fernandez, Proceso L, Jr
Next Bar Predictor: An Architecture in Automated Music Generation
description Music generation has been an active field of research in computer science and is considered as a creative task attempting to imitate human creativity. With the different approaches to generate musical content, recent works have focused on general adversarial networks. One of these is the Midinet, which is considered the baseline model in this study. In this paper, we propose our Next Bar Predictor, a generative model that creates melody one bar at a time using the previous bar as basis to generate aesthetically pleasing melodies. We explore several variants of this by experimenting on different regression and classification models such as Decision Trees (DT), K-Nearest Neighbors (KNN), and Multilayer Perceptron (MLP). The models were trained using the dataset from Theorytab which consists of 460 songs. The outputs of these different variant models were then compared against those from the Midinet, using both machine-based objective scoring mechanism as well as human-based subjective evaluations. The dissimilarity scores obtained by our KNN (0.65) and DT (0.74) models, scored against the melodies in the dataset, are sufficiently high and indicates that both models are generally creative. Furthermore, based on the evaluation by human listeners, the melodies generated by our DT models are more realistic and pleasing than those of the Midinet. Casual listeners also prefer the DT model to be more interesting, although professional listeners think otherwise. Finally, all the variant models, when compared with Midinet, require much less training time and computational power. The proposed Next Bar Predictor is therefore a viable alternative for automated music generation.
format text
author Dungan, Belinda M.
Fernandez, Proceso L, Jr
author_facet Dungan, Belinda M.
Fernandez, Proceso L, Jr
author_sort Dungan, Belinda M.
title Next Bar Predictor: An Architecture in Automated Music Generation
title_short Next Bar Predictor: An Architecture in Automated Music Generation
title_full Next Bar Predictor: An Architecture in Automated Music Generation
title_fullStr Next Bar Predictor: An Architecture in Automated Music Generation
title_full_unstemmed Next Bar Predictor: An Architecture in Automated Music Generation
title_sort next bar predictor: an architecture in automated music generation
publisher Archīum Ateneo
publishDate 2020
url https://archium.ateneo.edu/discs-faculty-pubs/229
https://ieeexplore.ieee.org/document/9182341
_version_ 1722366422974201856