Latent Walking Techniques for Conditioning GAN-Generated Music

Eisenbeiser, Logan Ryan

Latent Walking Techniques for Conditioning GAN-Generated Music

Files

Eisenbeiser_LR_T_2020.pdf (10.08 MB)

Downloads: 1450

Date

2020-09-21

Authors

Eisenbeiser, Logan Ryan

Publisher

Virginia Tech

Abstract

Artificial music generation is a rapidly developing field focused on the complex task of creating neural networks that can produce realistic-sounding music. Generating music is very difficult; components like long and short term structure present time complexity, which can be difficult for neural networks to capture. Additionally, the acoustics of musical features like harmonies and chords, as well as timbre and instrumentation require complex representations for a network to accurately generate them. Various techniques for both music representation and network architecture have been used in the past decade to address these challenges in music generation.

The focus of this thesis extends beyond generating music to the challenge of controlling and/or conditioning that generation. Conditional generation involves an additional piece or pieces of information which are input to the generator and constrain aspects of the results. Conditioning can be used to specify a tempo for the generated song, increase the density of notes, or even change the genre. Latent walking is one of the most popular techniques in conditional image generation, but its effectiveness on music-domain generation is largely unexplored. This paper focuses on latent walking techniques for conditioning the music generation network MuseGAN and examines the impact of this conditioning on the generated music.

Keywords

Music Generation, Latent Walking, Conditional Generation, Generative Adversarial Network

Persistent link

http://hdl.handle.net/10919/100052

Collections

Masters Theses

Full item page