PITCH ESTIMATION FOR NOISY SPEECH

KHURSHID, AZAR

dc.contributor.author	KHURSHID, AZAR
dc.contributor.other	Faculty of Science and Engineering	en_US
dc.date.accessioned	2013-09-13T10:42:02Z
dc.date.available	2013-09-13T10:42:02Z
dc.date.issued	2002
dc.identifier	NOT AVAILABLE	en_US
dc.identifier.uri	http://hdl.handle.net/10026.1/1692
dc.description.abstract	In this dissertation a biologically plausible system of pitch estimation is proposed. The system is designed from the bottom up to be robust to challenging noise conditions. This robustness to the presence of noise in the signal is achieved by developing a new representation of the speech signal, based on the operation of damped harmonic oscillators, and temporal mode analysis of their output. This resulting representation is shown to possess qualities which are not degraded in presence of noise. A harmonic grouping based system is used to estimate the pitch frequency. A detailed statistical analysis is performed on the system, and performance compared with some of the most established and recent pitch estimation and tracking systems. The detailed analysis includes results of experiments with a variety of noises with a large range of signal to noise ratios, under different signal conditions. Situations where the interfering "noise" is speech from another speaker are also considered. The proposed system is able to estimate the pitch of both the main speaker, and the interfering speaker, thus emulating the phenomena of auditory streaming and "cocktail party effect" in terms of pitch perception. The results of the extensive statistical analysis show that the proposed system exhibits some very interesting properties in its ability of handling noise. The results also show that the proposed system’s overall performance is much better than any of the other systems tested, especially in presence of very large amounts of noise. The system is also shown to successfully simulate some very interesting psychoacoustical pitch perception phenomena. Through a detailed and comparative computational requirements analysis, it is also demonstrated that the proposed system is comparatively inexpensive in terms of processing and memory requirements.	en_US
dc.language.iso	en	en_US
dc.publisher	University of Plymouth	en_US
dc.title	PITCH ESTIMATION FOR NOISY SPEECH	en_US
dc.type	Thesis
plymouth.version	Full version	en_US
dc.identifier.doi	http://dx.doi.org/10.24382/3425

Files in this item

Name:: AZAR KHURSHID.PDF
Size:: 7.974Mb
Format:: PDF
Description:: THESIS

View/Open

Name:: license.txt
Size:: 3.285Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

01 Research Theses Main Collection
Research Theses Main

Show simple item record