Convolutional generative adversarial network, via transfer learning, for traditional Scottish music generation

Marchetti, Francesco and Wilson, Callum and Powell, Cheyenne and Minisci, Edmondo and Riccardi, Annalisa; Romero, Juan and Martins, Tiago and Rodríguez-Fernández, Nereida, eds. (2021) Convolutional generative adversarial network, via transfer learning, for traditional Scottish music generation. In: Artificial Intelligence in Music, Sound, Art and Design. Lecture Notes in Computer Science . Springer, ESP, pp. 187-202. ISBN 9783030729141 (https://doi.org/10.1007/978-3-030-72914-1_13)

[thumbnail of Marchetti-etal-EvoMUSART-2021-Convolutional-generative-adversarial-network-via-transfer-learning]

Preview

Text. Filename: Marchetti_etal_EvoMUSART_2021_Convolutional_generative_adversarial_network_via_transfer_learning.pdf
Accepted Author Manuscript
Download (958kB)| Preview

Abstract

The concept of a Binary Multi-track Sequential Generative Adversarial Network (BinaryMuseGAN) used for the generation of music has been applied and tested for various types of music. However, the concept is yet to be tested on more specific genres of music such as traditional Scottish music, for which extensive collections are not readily available. Hence exploring the capabilities of a Transfer Learning (TL) approach on these types of music is an interesting challenge for the methodology. The curated set of MIDI Scottish melodies was preprocessed in order to obtain the same number of tracks used in the BinaryMuseGAN model; converted into pianoroll format and then used as training set to fine tune a pretrained model, generated from the Lakh MIDI dataset. The results obtained have been compared with the results obtained by training the same GAN model from scratch on the sole Scottish music dataset. Results are presented in terms of variation and average performances achieved at different epochs for five performance metrics, three adopted from the Lakh dataset (qualified note rate, polyphonicity, tonal distance) and two custom defined to highlight Scottish music characteristics (dotted rhythm and pentatonic note). From these results, the TL method shows to be more effective, with lower number of epochs, to converge stably and closely to the original dataset reference metrics values.

ORCID iDs

Marchetti, Francesco

, Wilson, Callum

, Powell, Cheyenne, Minisci, Edmondo

and Riccardi, Annalisa

; Romero, Juan, Martins, Tiago and Rodríguez-Fernández, Nereida

Share and Export

Item metadata

Item type:	Book Section
ID code:	75944
Dates:	Date Event 17 May 2021 Published 9 April 2021 Published Online 20 March 2021 Accepted
Subjects:	Science > Mathematics > Electronic computers. Computer science Music and Books on Music > Music
Department:	Faculty of Engineering > Mechanical and Aerospace Engineering Strategic Research Themes > Ocean, Air and Space Strategic Research Themes > Measurement Science and Enabling Technologies
Depositing user:	Pure Administrator
Date deposited:	25 Mar 2021 13:19
Last modified:	21 Oct 2024 00:12
Related URLs:	Event
URI:	https://strathprints.strath.ac.uk/id/eprint/75944

CORE (COnnecting REpositories)