Global ETD Search

Return to search

Processing and analysis of sounds signals by Huang transform (Empirical Mode Decomposition: EMD)

This dissertation explores the potential of EMD as analyzing tool for audio and speech processing. This signal expansion into IMFs is adaptive and without any prior assumptions (stationarity and linearity) on the signal to be analyzed. Salient properties of EMD such as dyadic filter bank structure, quasi-symmetry of IMF and fully description of IMF by its extrema, are exploited for denoising, coding and watermarking purposes. In speech signals denoising, we initially proposed a technique based on IMFs thresholding. A comparative analysis of performance of this technique compared to the denoising technique based on the wavelet. Then, to remedy the problem of the MMSE filters which requires an estimation of the spectral properties of noise, we introduced the ACWA filter in the denoising procedure. The proposed approach is consisted to filter all IMFs of the noisy signal by ACWA filter. This filtering approach is implemented in the time domain, and also applicable in the context of colored noise. Finally, to handle the case of hybrid speech frames, that is composed of voiced and unvoiced speech, we introduced a stationarity index in the denoising approach to detect the transition between the mixture of voiced and unvoiced sounds. In audio signals coding, we proposed four compression approaches. The first two approaches are based on the EMD, and the other two approaches exploit the EMD in association with Hilbert transform. In particular, we proposed to use a predictive coding of the instantaneous amplitude and frequency of the IMFs Finally, we studied the problem of audio signals watermarking in context of copyright protection. The number of IMFs can be variable depending on the attack type. The proposed approach involves inserting the mark in the extrema of last IMFs. In addition, we introduced a synchronization code in the procedure in order to facility the extraction of the mark. These contributions are illustrated on synthetic and real data and results compared to well established methods such as MMSE filter, wavelets approach, MP3 and AAC coders showing the good performances of EMD based signal processes. These findings demonstrate the real potential of EMD as analyzing tool (in adaptive way) in speech and audio processing.

Instantaneous Amplitude

Instantaneous Frequency

Identifer	oai:union.ndltd.org:CCSD/oai:tel.archives-ouvertes.fr:tel-00719637
Date	20 January 2012
Creators	Khaldi, Kais
Source Sets	CCSD theses-EN-ligne, France
Language	English
Detected Language	English
Type	PhD thesis

Page generated in 0.0019 seconds

Processing and analysis of sounds signals by Huang transform (Empirical Mode Decomposition: EMD)

Description

Links & Downloads

Tags

Additional Fields