Single-Channel Speech Enhancement Using Statistical Modelling

Chehrehsa, Sarang

Single-Channel Speech Enhancement Using Statistical Modelling

aut.embargo	No	en_NZ
aut.thirdpc.contains	No	en_NZ
aut.thirdpc.permission	No	en_NZ
aut.thirdpc.removed	No	en_NZ
dc.contributor.advisor	Moir, Tom
dc.contributor.advisor	Collins, John
dc.contributor.author	Chehrehsa, Sarang
dc.date.accessioned	2017-02-23T22:17:59Z
dc.date.available	2017-02-23T22:17:59Z
dc.date.copyright	2016
dc.date.created	2017
dc.date.issued	2016
dc.date.updated	2017-02-23T21:10:36Z
dc.description.abstract	A new speech enhancement method based on Maximum A-Posteriori (MAP) estimation on Gaussian Mixture Models (GMMs) of speech and different noise types is introduced. The GMMs model the distribution of speech and noise periodograms in a high dimensional space and hence decrease the complexity of estimation procedure. Using the GMMs the Probability Density Functions (PDFs) of clean speech and noise can be calculated and by applying MAP on these PDFs, the estimates of speech and noise periodograms that form the noisy speech periodogram of the observed noisy speech frame can be estimated. These estimates are then used in a Wiener filter to enhance the noisy speech and recover the speech signal as close as possible to the original one. Since the PDFs are complicated and hence the realization of a MAP criterion can become even more complicated, some approximations are used to find the MAP criterion. Some improvements on this MAP estimation based on the characteristics of periodograms are also introduced in which the approximations are improved in a way which leads to more accurate estimates of speech and noise periodograms. Since the accuracy of the introduced MAP estimate is highly dependent on the accuracy of speech and noise power estimation in the noisy frame, a new power estimation method using Gamma modelling is introduced to replace the older methods like Minimum Statistics. The results of all the estimation methods are used in a classic Wiener filter to be applied on the noisy frame to enhance it. Since all the estimation algorithms can have some errors, we introduce an improvement of Wiener filter in which we can attenuate the effect of these errors on the enhanced speech signal. The performance of all the introduced methods are analyzed in terms of quality and intelligibility and reported thus.	en_NZ
dc.identifier.uri	https://hdl.handle.net/10292/10339
dc.language.iso	en	en_NZ
dc.publisher	Auckland University of Technology
dc.rights.accessrights	OpenAccess
dc.subject	Speech enhancement	en_NZ
dc.subject	Gaussian Mixture Modelling	en_NZ
dc.subject	Wiener filter	en_NZ
dc.subject	Maximum A-Posteriori estimation	en_NZ
dc.title	Single-Channel Speech Enhancement Using Statistical Modelling	en_NZ
dc.type	Thesis
thesis.degree.grantor	Auckland University of Technology
thesis.degree.level	Doctoral Theses
thesis.degree.name	Doctor of Philosophy	en_NZ

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ChehrehsaS.pdf
Size:: 6.37 MB
Format:: Adobe Portable Document Format
Description:: Whole thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 889 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Doctoral Theses