Knowing the wheat from the weeds in noisy speech

Date
1997
Authors
Moir, T
Agaiby, H
Supervisor
Item type
Conference Contribution
Degree name
Journal Title
Journal ISSN
Volume Title
Publisher
International Speech Communication Association (ISCA)
Abstract

This paper introduces a word boundary detection algorithm that works in a variety of noise conditions including what is commonly called the 'cocktail party' situation. The algorithm uses the direction of the signal as the main criterion for differentiating between desired-speech and background noise. To determine the signal direction the algorithm calculates estimates of the time delay between signals received at two microphones. These time delay estimates together with estimates of the coherence function and signal energy are used to locate word boundaries. The algorithm was tested using speech embedded in different types and levels of noise including car noise, factory noise, babble noise, and competing talkers. The test results showed that the algorithm performs very well under adverse conditions and with SNR down to -14.5dB.

Description
Keywords
Source
Eurospeech 97. 5th European Conference on Speech Communication and Technology, Rhodes Greece, 1997-09-22 - 1997-09-25, pages 1119 - 1122
DOI
Rights statement
Use, reproduction, modification, and other intellectual property rights to content posted on isca-speech.org shall be subject to the terms of any applicable approved license. With respect to content posted on isca-speech.org the submitting user / member retains ownership of such content.