Data Augmentation Strategies for Music Composition using Generative Adversarial Networks

Matheus Bitarães; Frederico Guimarães; Frederico Coelho

doi:10.20906/CBA2022/3270

Matheus Bitarães Machine Intelligence and Data Science Laboratory (MINDS), Graduate Program in Electrical Engineering - Universidade Federal de Minas Gerais - Av. Antônio Carlos 6627, 31270-901, Belo Horizonte, MG
Frederico Guimarães Machine Intelligence and Data Science Laboratory (MINDS), UFMG
Frederico Coelho Computational Intelligence Laboratory (LITC), UFMG

DOI: https://doi.org/10.20906/CBA2022/3270

Keywords: Generative Adversarial Networks, Neural Network, Generative Music, Data Augmentation, Algorithmic Music Composition, Agorithmic Art, Machine Learning

Abstract

The field of Algorithmic Art has been following technological advances in Artificial Intelligence and, as Generative Adversarial Networks (GANs) have become popular, applications on art generation began to emerge. For most deep neural networks, large amounts of training data are essential to achieve satisfactory model quality. But there are cases, such as in MIDI musical melodies, where it is not trivial to acquire data in such a high volume. Data augmentation strategies play an important role on these cases. This paper presents a data augmentation pipeline, composed of three strategies, with the objective of improving the quality of a GAN- based musical melody generator. The proposed data augmentation pipeline was compared with a non-augmented dataset and a replicated dataset, which had the same size of the augmented dataset, but composed only of replicas. From the statistical tests performed it can be stated that the augmented dataset outperformed the non-augmented dataset and the replicated dataset, when evaluating the Fréchet Inception distance (FID) score.