ORCID

Abstract

This work describes a speech fundamental period estimation algorithm that estimates the time of excitation of the vocal tract using a pattern classifier, the multi-layer perceptron (MLP). The pattern classifier was trained using speech semi-automatically labelled by means of an algorithm that makes use of the output from a Laryngograph. Various issues arising in the training of the system were explored. Three basic configurations of the system were compared using different pre-processing strategies. It was found that processing the sampled speech time - waveform directly with the pattern classifier gave better results than using one of two filterbanks. The performance of the algorithm was evaluated against that of a simple peak-picking algorithm and the well known cepstrum algorithm using quantitative frequency contour comparisons. The performance of the new algorithm on a difficult set of test data was shown to be better than the peak-picker and comparable to the cepstrum algorithm. The advantage of the scheme is that fundamental period estimates are made on a period-by-period basis, thus preserving the irregularity in the speech excitation that is lost by techniques that produce as average period estimate. In addition, its simple structure lends itself to real-time implementation (Howard & Walliker, 9; Walliker & Howard, 14).

Publication Date

1991-12-01

Publication Title

IEE Conference Publication

Issue

349

First Page

340

Last Page

344

ISSN

0537-9989

Organisational Unit

School of Engineering, Computing and Mathematics

Share

COinS