Knowing the wheat from the weeds in noisy speech

Moir, T
Agaiby, H
Item type
Conference Contribution
Degree name
Journal Title
Journal ISSN
Volume Title
International Speech Communication Association (ISCA)

This paper introduces a word boundary detection algorithm that works in a variety of noise conditions including what is commonly called the 'cocktail party' situation. The algorithm uses the direction of the signal as the main criterion for differentiating between desired-speech and background noise. To determine the signal direction the algorithm calculates estimates of the time delay between signals received at two microphones. These time delay estimates together with estimates of the coherence function and signal energy are used to locate word boundaries. The algorithm was tested using speech embedded in different types and levels of noise including car noise, factory noise, babble noise, and competing talkers. The test results showed that the algorithm performs very well under adverse conditions and with SNR down to -14.5dB.

Eurospeech 97. 5th European Conference on Speech Communication and Technology, Rhodes Greece, 1997-09-22 - 1997-09-25, pages 1119 - 1122
Rights statement
Use, reproduction, modification, and other intellectual property rights to content posted on shall be subject to the terms of any applicable approved license. With respect to content posted on the submitting user / member retains ownership of such content.