Global ETD Search

Return to search

Speech Coder using Line Spectral Frequencies of Cascaded Second Order Predictors

A major objective in speech coding is to represent speech with as few bits as possible. Usual transmission parameters include auto regressive parameters, pitch parameters, excitation signals and excitation gains. The pitch predictor makes these coders sensitive to channel errors. Aiming for robustness to channel errors, we do not use pitch prediction and compensate for its lack with a better representation of the excitation signal. We propose a new speech coding approach, Vector Sum Excited Cascaded Linear Prediction (VSECLP), based on code excited linear prediction.

We implement forward linear prediction using five cascaded second order sections - parameterized in terms of line spectral frequency - in place of the conventional tenth order filter. The line spectral frequency parameters estimated by the Direct Line Spectral Frequency (DLSF) adaptation algorithm are closer to the true values than those estimated by the Cascaded Recursive Least Squares - Subsection algorithm. A simplified version of DLSF is proposed to further reduce computational complexity.

Split vector quantization is used to quantize the line spectral frequency parameters and vector sum codebooks to quantize the excitation signals. The effect on reconstructed speech quality and transmission rate, of an increased number of bits and differently split combinations, is analyzed by testing VSECLP on the TIMIT database. The quantization of the excitation vectors using the discrete cosine transform resulted in segmental signal to noise ratio of 4 dB at 20.95 kbps, whereas the same quality was obtained at 9.6 kbps using vector sum codebooks. / Master of Science

Vector Quantization

Speech Coding

Cascaded Second Order Predictors

Linear Prediction

Line Spectral Frequencies

Identifer	oai:union.ndltd.org:VTETD/oai:vtechworks.lib.vt.edu:10919/35670
Date	14 November 2001
Creators	Namburu, Visala
Contributors	Electrical and Computer Engineering, Beex, A. A. Louis, Baumann, William T., Woerner, Brian D.
Publisher	Virginia Tech
Source Sets	Virginia Tech Theses and Dissertation
Detected Language	English
Type	Thesis
Format	application/pdf
Rights	In Copyright, http://rightsstatements.org/vocab/InC/1.0/
Relation	VN_etd.pdf

Page generated in 0.0027 seconds

Speech Coder using Line Spectral Frequencies of Cascaded Second Order Predictors

Description

Links & Downloads

Tags

Additional Fields