Return to search

Robust speaker recognition using both vocal source and vocal tract features estimated from noisy input utterances.

Wang, Ning. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2007. / Includes bibliographical references (leaves 106-115). / Abstracts in English and Chinese. / Chapter 1 --- Introduction --- p.1 / Chapter 1.1 --- Introduction to Speech and Speaker Recognition --- p.1 / Chapter 1.2 --- Difficulties and Challenges of Speaker Authentication --- p.6 / Chapter 1.3 --- Objectives and Thesis Outline --- p.7 / Chapter 2 --- Speaker Recognition System --- p.10 / Chapter 2.1 --- Baseline Speaker Recognition System Overview --- p.10 / Chapter 2.1.1 --- Feature Extraction --- p.12 / Chapter 2.1.2 --- Pattern Generation and Classification --- p.24 / Chapter 2.2 --- Performance Evaluation Metric for Different Speaker Recognition Tasks --- p.30 / Chapter 2.3 --- Robustness of Speaker Recognition System --- p.30 / Chapter 2.3.1 --- Speech Corpus: CU2C --- p.30 / Chapter 2.3.2 --- Noise Database: NOISEX-92 --- p.34 / Chapter 2.3.3 --- Mismatched Training and Testing Conditions --- p.35 / Chapter 2.4 --- Summary --- p.37 / Chapter 3 --- Speaker Recognition System using both Vocal Tract and Vocal Source Features --- p.38 / Chapter 3.1 --- Speech Production Mechanism --- p.39 / Chapter 3.1.1 --- Speech Production: An Overview --- p.39 / Chapter 3.1.2 --- Acoustic Properties of Human Speech --- p.40 / Chapter 3.2 --- Source-filter Model and Linear Predictive Analysis --- p.44 / Chapter 3.2.1 --- Source-filter Speech Model --- p.44 / Chapter 3.2.2 --- Linear Predictive Analysis for Speech Signal --- p.46 / Chapter 3.3 --- Vocal Tract Features --- p.51 / Chapter 3.4 --- Vocal Source Features --- p.52 / Chapter 3.4.1 --- Source Related Features: An Overview --- p.52 / Chapter 3.4.2 --- Source Related Features: Technical Viewpoints --- p.54 / Chapter 3.5 --- Effects of Noises on Speech Properties --- p.55 / Chapter 3.6 --- Summary --- p.61 / Chapter 4 --- Estimation of Robust Acoustic Features for Speaker Discrimination --- p.62 / Chapter 4.1 --- Robust Speech Techniques --- p.63 / Chapter 4.1.1 --- Noise Resilience --- p.64 / Chapter 4.1.2 --- Speech Enhancement --- p.64 / Chapter 4.2 --- Spectral Subtractive-Type Preprocessing --- p.65 / Chapter 4.2.1 --- Noise Estimation --- p.66 / Chapter 4.2.2 --- Spectral Subtraction Algorithm --- p.66 / Chapter 4.3 --- LP Analysis of Noisy Speech --- p.67 / Chapter 4.3.1 --- LP Inverse Filtering: Whitening Process --- p.68 / Chapter 4.3.2 --- Magnitude Response of All-pole Filter in Noisy Condition --- p.70 / Chapter 4.3.3 --- Noise Spectral Reshaping --- p.72 / Chapter 4.4 --- Distinctive Vocal Tract and Vocal Source Feature Extraction . . --- p.73 / Chapter 4.4.1 --- Vocal Tract Feature Extraction --- p.73 / Chapter 4.4.2 --- Source Feature Generation Procedure --- p.75 / Chapter 4.4.3 --- Subband-specific Parameterization Method --- p.79 / Chapter 4.5 --- Summary --- p.87 / Chapter 5 --- Speaker Recognition Tasks & Performance Evaluation --- p.88 / Chapter 5.1 --- Speaker Recognition Experimental Setup --- p.89 / Chapter 5.1.1 --- Task Description --- p.89 / Chapter 5.1.2 --- Baseline Experiments --- p.90 / Chapter 5.1.3 --- Identification and Verification Results --- p.91 / Chapter 5.2 --- Speaker Recognition using Source-tract Features --- p.92 / Chapter 5.2.1 --- Source Feature Selection --- p.92 / Chapter 5.2.2 --- Source-tract Feature Fusion --- p.94 / Chapter 5.2.3 --- Identification and Verification Results --- p.95 / Chapter 5.3 --- Performance Analysis --- p.98 / Chapter 6 --- Conclusion --- p.102 / Chapter 6.1 --- Discussion and Conclusion --- p.102 / Chapter 6.2 --- Suggestion of Future Work --- p.104

Identiferoai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_325986
Date January 2007
ContributorsWang, Ning, Chinese University of Hong Kong Graduate School. Division of Electronic Engineering.
Source SetsThe Chinese University of Hong Kong
LanguageEnglish, Chinese
Detected LanguageEnglish
TypeText, bibliography
Formatprint, xiii, 115 leaves : ill. ; 30 cm.
RightsUse of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0021 seconds