ORCID
- Howard, Ian: 0000-0002-6041-9669
Abstract
This work describes a speech fundamental period estimation algorithm that estimates the time of excitation of the vocal tract using a pattern classifier, the multi-layer perceptron (MLP). The pattern classifier was trained using speech semi-automatically labelled by means of an algorithm that makes use of the output from a Laryngograph. Various issues arising in the training of the system were explored. Three basic configurations of the system were compared using different pre-processing strategies. It was found that processing the sampled speech time - waveform directly with the pattern classifier gave better results than using one of two filterbanks. The performance of the algorithm was evaluated against that of a simple peak-picking algorithm and the well known cepstrum algorithm using quantitative frequency contour comparisons. The performance of the new algorithm on a difficult set of test data was shown to be better than the peak-picker and comparable to the cepstrum algorithm. The advantage of the scheme is that fundamental period estimates are made on a period-by-period basis, thus preserving the irregularity in the speech excitation that is lost by techniques that produce as average period estimate. In addition, its simple structure lends itself to real-time implementation (Howard & Walliker, 9; Walliker & Howard, 14).
Publication Date
1991-12-01
Publication Title
IEE Conference Publication
Issue
349
ISSN
0537-9989
Organisational Unit
School of Engineering, Computing and Mathematics
First Page
340
Last Page
344
Recommended Citation
Howard, I. (1991) 'Further developments of a neural network speech fundamental period estimation algorithm', IEE Conference Publication, (349), pp. 340-344. Retrieved from https://pearl.plymouth.ac.uk/secam-research/714