Abstract

This thesis presents efforts to lay the foundations for an Artificial-Intelligence musical compositional system conceived on similar principles to DeepDream, a revolutionary computer vision process. This theoretical system would be designed to engage in stylistic feature transfer between existing musical pieces, and eventually to compose original music either autonomously or in collaboration with human musicians and composers. In this thesis, construction of the analysis and feature recognition systems necessary for this long-term goal is achieved through the use of neural networks. Originally, DeepDream came about as a way of visualising the weights inside neural network layers – matrices of variables containing the data that determines what information the network has learned – for better understanding of training and trouble-shooting of such networks that have been trained to classify images. This approach spawned an unexpectedly artistic process whereby feature recognition could be used to alter images in a dreamlike fashion, akin to seeing shapes in clouds. The proposed musical version of this process involves analysing sound files and generating spectrograms – pictures of the sound that could be manipulated in much the same ways as regular images. As described in this thesis, a sizeable bank of sound samples has been gathered – of individual musical notes from a selection of instruments – in pursuit of this application of the DeepDream architecture. These samples are curated, edited and analysed to produce spectrograms that make up a dataset for neural network training. Using the Python programming language and its machine learning library ‘Scikit Learn’, a rudimentary deep learning system is constructed to be trained on the sample spectrograms and learn to classify them. Once this is complete, additional tests are performed to determine the validity and effectiveness of the approach.

Awarding Institution(s)

University of Plymouth

Supervisor

Alexis Kirke, Eduardo Miranda

Document Type

Thesis

Publication Date

2019

Deposit Date

June 2024

Additional Links

http://dx.doi.org/10.24382/628

Recommended Citation

Pearce-Davies, S. (2019) Sonic Analysis for Machine Learning: Multi-Layer Perceptron Training using Spectrograms. Thesis. University of Plymouth. Available at: http://dx.doi.org/10.24382/628

Download

Additional Files

license.txt (3 kB)

COinS

Other Faculty of Arts, Humanities and Business Theses

Sonic Analysis for Machine Learning: Multi-Layer Perceptron Training using Spectrograms

Abstract

Awarding Institution(s)

Supervisor

Document Type

Publication Date

Deposit Date

Additional Links

Recommended Citation

Additional Files

Search

Browse

About

Links

Other Faculty of Arts, Humanities and Business Theses

Sonic Analysis for Machine Learning: Multi-Layer Perceptron Training using Spectrograms

Authors

Abstract

Awarding Institution(s)

Supervisor

Document Type

Publication Date

Deposit Date

Additional Links

Recommended Citation

Additional Files

Share

Search

Browse

About

Links