Global ETD Search

Return to search

Sound source segregation of multiple concurrent talkers via Short-Time Target Cancellation

The Short-Time Target Cancellation (STTC) algorithm, developed as part of this dissertation research, is a “Cocktail Party Problem” processor that can boost speech intelligibility for a target talker from a specified “look” direction, while suppressing the intelligibility of competing talkers. The algorithm holds promise for both automatic speech recognition and assistive listening device applications. The STTC algorithm operates on a frame-by-frame basis, leverages the computational efficiency of the Fast Fourier Transform (FFT), and is designed to run in real time. Notably, performance in objective measures of speech intelligibility and sound source segregation is comparable to that of the Ideal Binary Mask (IBM) and Ideal Ratio Mask (IRM). Because the STTC algorithm computes a time-frequency mask that can be applied independently to both the left and right signals, binaural cues for spatial hearing, including Interaural Time Differences (ITDs), Interaural Level Differences (ILDs) and spectral cues, can be preserved in potential hearing aid applications. A minimalist design for a proposed STTC Assistive Listening Device (ALD), consisting of six microphones embedded in the frame of a pair of eyeglasses, is presented and evaluated using virtual room acoustics and both objective and behavioral measures. The results suggest that the proposed STTC ALD can provide a significant speech intelligibility benefit in complex auditory scenes comprised of multiple spatially separated talkers. / 2020-10-22T00:00:00Z

https://hdl.handle.net/2144/32082

Electrical engineering

Binaural hearing

Cocktail party problem

Computational auditory scene analysis

Hearing aid design

Signal processing

Spatial hearing

Identifer	oai:union.ndltd.org:bu.edu/oai:open.bu.edu:2144/32082
Date	22 October 2018
Creators	Cantu, Marcos Antonio
Contributors	Colburn, H. Steven
Source Sets	Boston University
Language	en_US
Detected Language	English
Type	Thesis/Dissertation

Page generated in 0.0024 seconds

Sound source segregation of multiple concurrent talkers via Short-Time Target Cancellation

Description

Links & Downloads

Tags

Additional Fields