Sonic Analysis for Machine Learning: Multi-Layer Perceptron Training using Spectrograms

Pearce-Davies, Samuel Louis

dc.contributor.supervisor	Kirke, Alexis
dc.contributor.author	Pearce-Davies, Samuel Louis
dc.contributor.other	Faculty of Arts, Humanities and Business	en_US
dc.date.accessioned	2019-12-10T15:03:41Z
dc.date.available	2019-12-10T15:03:41Z
dc.date.issued	2019
dc.identifier	10599194	en_US
dc.identifier.uri	http://hdl.handle.net/10026.1/15240
dc.description.abstract	This thesis presents efforts to lay the foundations for an Artificial-Intelligence musical compositional system conceived on similar principles to DeepDream, a revolutionary computer vision process. This theoretical system would be designed to engage in stylistic feature transfer between existing musical pieces, and eventually to compose original music either autonomously or in collaboration with human musicians and composers. In this thesis, construction of the analysis and feature recognition systems necessary for this long-term goal is achieved through the use of neural networks. Originally, DeepDream came about as a way of visualising the weights inside neural network layers – matrices of variables containing the data that determines what information the network has learned – for better understanding of training and trouble-shooting of such networks that have been trained to classify images. This approach spawned an unexpectedly artistic process whereby feature recognition could be used to alter images in a dreamlike fashion, akin to seeing shapes in clouds. The proposed musical version of this process involves analysing sound files and generating spectrograms – pictures of the sound that could be manipulated in much the same ways as regular images. As described in this thesis, a sizeable bank of sound samples has been gathered – of individual musical notes from a selection of instruments – in pursuit of this application of the DeepDream architecture. These samples are curated, edited and analysed to produce spectrograms that make up a dataset for neural network training. Using the Python programming language and its machine learning library ‘Scikit Learn’, a rudimentary deep learning system is constructed to be trained on the sample spectrograms and learn to classify them. Once this is complete, additional tests are performed to determine the validity and effectiveness of the approach.	en_US
dc.language.iso	en
dc.publisher	University of Plymouth
dc.subject.classification	ResM	en_US
dc.title	Sonic Analysis for Machine Learning: Multi-Layer Perceptron Training using Spectrograms	en_US
dc.type	Thesis
plymouth.version	publishable	en_US
dc.identifier.doi	http://dx.doi.org/10.24382/628
dc.rights.embargoperiod	No embargo	en_US
dc.type.qualification	Masters	en_US
rioxxterms.version	NA

Files in this item

Name:: 2019PEARCE-DAVIES10599194ResM.pdf
Size:: 7.784Mb
Format:: PDF
Description:: Thesis

View/Open

Name:: license.txt
Size:: 3.016Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

01 Research Theses Main Collection
Research Theses Main

Show simple item record