Repository logo
 

Knowing the wheat from the weeds in noisy speech

Date

Supervisor

Item type

Conference Contribution

Degree name

Journal Title

Journal ISSN

Volume Title

Publisher

International Speech Communication Association (ISCA)

Abstract

This paper introduces a word boundary detection algorithm that works in a variety of noise conditions including what is commonly called the 'cocktail party' situation. The algorithm uses the direction of the signal as the main criterion for differentiating between desired-speech and background noise. To determine the signal direction the algorithm calculates estimates of the time delay between signals received at two microphones. These time delay estimates together with estimates of the coherence function and signal energy are used to locate word boundaries. The algorithm was tested using speech embedded in different types and levels of noise including car noise, factory noise, babble noise, and competing talkers. The test results showed that the algorithm performs very well under adverse conditions and with SNR down to -14.5dB.

Description

Keywords

Source

Eurospeech 97. 5th European Conference on Speech Communication and Technology, Rhodes Greece, 1997-09-22 - 1997-09-25, pages 1119 - 1122

DOI

Rights statement

Use, reproduction, modification, and other intellectual property rights to content posted on isca-speech.org shall be subject to the terms of any applicable approved license. With respect to content posted on isca-speech.org the submitting user / member retains ownership of such content.