Return to search

Robust Single-Channel Speech Enhancement and Speaker Localization in Adverse Environments

In speech communication systems such as voice-controlled systems, hands-free mobile telephones and hearing aids, the received signals are degraded by room reverberation and background noise. This degradation can reduce the perceived quality and intelligibility of the speech, and decrease the performance of speech enhancement and source localization. These problems are difficult to solve due to the colored and nonstationary nature of the speech signals, and features of the Room Impulse Response (RIR) such as its long duration and non-minimum phase. In this dissertation, we focus on two topics of speech enhancement and speaker localization in noisy reverberant environments.

A two-stage speech enhancement method is presented
to suppress both early and late reverberation in noisy speech using only one microphone. It is shown that this method works well even in highly reverberant rooms.
Experiments under different acoustic conditions confirm that the proposed blind method is superior in terms of reducing early and late reverberation effects and noise compared to other well known single-microphone techniques in the literature.

Time Difference Of Arrival (TDOA)-based methods usually provide the most accurate source localization in adverse conditions. The key issue for these methods is to accurately estimate the TDOA using the smallest number of microphones.
Two robust Time Delay Estimation (TDE) methods are proposed which use the information from only two microphones. One method is based on adaptive inverse filtering which provides superior performance even in highly reverberant and moderately noisy conditions. It also has negligible failure estimation which makes it a reliable method in realistic environments. This method has high computational complexity due to the estimation in the first stage for the first microphone. As a result, it can not be applied in time-varying environments and real-time applications. Our second method improves this problem by introducing two effective preprocessing stages for the conventional Cross Correlation (CC)-based methods. The results obtained in different noisy reverberant conditions including a real and time-varying environment demonstrate that the proposed methods are superior compared to the conventional TDE methods. / Graduate / 2015-04-23 / 0544 / 0984 / saeed.mosayyebpour@gmail.com

  1. http://hdl.handle.net/1828/5342
  2. S. Mosayyebpour, M. Esmaeili, and T. A. Gulliver, "Single-Microphone Early and Late Reverberation Suppression in Noisy Speech," IEEE Trans. Audio, Speech, Lang. Process.,vol. 21, no. 2, pp. 322-335, Feb. 2013.
  3. S. Mosayyebpour, H. Sheikhzadeh, T. A. Gulliver, and M. Esmaeili, "Single- Microphone LP Residual Skewness-based Approach for Inverse Filtering of Room Impulse Response," IEEE Trans. Audio, Speech and Lang. Process., vol. 20, pp. 1617-1632, July 2012.
  4. S. Mosayyebpour, A. Keshavarz, M. Biguesh, T. A. Gulliver, and M. Esmaeili "Speech-Model based Accurate Blind Reverberation Time Estimation Using an LPC Filter," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 6, pp. 1884-1893, Aug. 2012.
  5. S. Mosayyebpour, H. Lohrasbipeydeh, M. Esmaeili, and T. A. Gulliver, "Time Delay Estimation via Minimum-Phase and All-Pass Component Processing," in Proc. IEEE Int. Conf. Acoustics, Speech and Signal Process. (ICASSP), Vancouver, BC, pp. 4285-4289, May 2013.
  6. S. Mosayyebpour, T. A. Gulliver, and M. Esmaeili, "Single-Microphone Speech Enhancement by Skewness Maximization and Spectral Subtraction," International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-4, Sep. 2012.
  7. S. Mosayyebpour, A. Sayyadiyan, M. Zareian, and A. Shahbazi, "Single Channel Inverse Filtering of Room Impulse Response by Maximizing Skewness of LP Residual," IEEE Int. Conf. on Signal Acquisition and Process. (ICSAP), pp. 130-134, Feb. 2010.
  8. S. Mosayyebpour, A. Sayyadiyan, E. Soltan Mohammadi, A. Shahbazi, and A. Keshavarz, "Time Delay Estimation using One Microphone Inverse Filtering in a Highly Reverberant Room," Proc. IEEE Int. Conf. on Signal Acquisition and Process. (ICSAP), pp. 140-144, Feb. 2010.
Identiferoai:union.ndltd.org:LACETR/oai:collectionscanada.gc.ca:BVIV.1828/5342
Date30 April 2014
CreatorsMosayyebpour, Saeed
ContributorsGulliver, T. Aaron, Esmaeili, Morteza
Source SetsLibrary and Archives Canada ETDs Repository / Centre d'archives des thèses électroniques de Bibliothèque et Archives Canada
LanguageEnglish, English
Detected LanguageEnglish
TypeThesis
RightsAvailable to the World Wide Web, http://creativecommons.org/publicdomain/zero/1.0/

Page generated in 0.003 seconds