Global ETD Search

1	[en] EFFICIENT FEATURES AND INTERPOLATION DOMAINS IN DISTRIBUTED SPEECH RECOGNITION / [pt] ATRIBUTOS E DOMÍNIOS DE INTERPOLAÇÃO EFICIENTES EM RECONHECIMENTO DE VOZ DISTRIBUÍDO VLADIMIR FABREGAS SURIGUE DE ALENCAR 01 April 2005 (has links) [pt] Com o crescimento gigantesco da Internet e dos sistemas de comunicações móveis celulares, as aplicações de processamento de voz nessas redes têm despertado grande interesse . Um problema particularmente importante nessa área consiste no reconhecimento de voz em um sistema servidor, baseado nos parâmetros acústicos calculados e quantizados no terminal do usuário (Reconhecimento de Voz Distribuído). Como em geral estes parâmetros não são os mais indicados como atributos de voz para o sistema de reconhecimento remoto, é importante que sejam examinadas diferentes transformações dos parâmetros, que permitam um melhor desempenho do reconhecedor. Esta dissertação trata da extração de atributos de reconhecimento eficientes a partir dos parâmetros dos codificadores utilizados em redes móveis celulares e em redes IP. Além disso, como a taxa dos parâmetros fornecidos ao reconhecedor de voz é normalmente superior àquela com a qual os codificadores geram os parâmetros, é importante analisar o efeito da interpolação dos parâmetros sobre o desempenho do sistema de reconhecimento, bem como o melhor domínio sobre o qual esta interpolação deve ser realizada. Estes são outros tópicos apresentados nesta dissertação. / [en] The huge growth of the Internet and cellular mobile communication systems has stimulated a great interest in the applications of speech processing in these networks. An important problem in this field consists in speech recognition in a server system, based on the acoustic parameters calculated and quantized in the user terminal (Distributed Speech Recognition). Since these parameters are not the most indicated ones for the remote recognition system, it is important to examine different transformations of these parameters, in order to allow a better performance of the recogniser. This dissertation is concerned with the extraction of efficient recognition features from the coder parameters used in cellular mobile networks and IP networks. In addition, as the rate that parameters supplied for the speech recogniser must be usually higher than that generated by the codec, it is important to analyze the effect of the interpolation of the parameters over the performance of the recognition system. Moreover, it is paramount to establish the best domain over which this interpolation must be carried out. These are other topics presented in this dissertation. [pt] REDES MOVEIS CELULARES [en] CELLULAR MOBILE NETWORKS [pt] REDES IP [en] IP NETWORK [pt] RECONHECIMENTO DE VOZ DISTRIBUIDO [en] DISTRIBUTED SPEECH RECOGNITION [pt] HMM [en] HMM [pt] LSF [en] LSF
2	[en] LOW RATE CODECS OPERATING IN NOISY ENVIRONMENT AND IP NETWORKS / [pt] CODIFICADORES DE VOZ A BAIXAS TAXAS OPERANDO EM AMBIENTES RUIDOSOS E REDES IP FRED BERKOWICZ BORGES 19 April 2005 (has links) [pt] Este trabalho examina o impacto da quantização vetorial das LSFs sobre a qualidade de voz em codecs a baixas taxas operando em redes IP e em diversos ambientes ruidosos. São considerados diferentes esquemas de quantização vetorial (QV) multiestágio com busca em árvore envolvendo QV sem memória e QV preditiva chaveada com 2 e 4 classes. A distribuição de perda de quadros em redes IP foi modelada de acordo com o Modelo de Gilbert e a avaliação de desempenho foi realizada tanto em termos das distorções espectrais como da qualidade de voz resultante de codecs a baixas taxas. Ainda neste trabalho, foi avaliada a qualidade da voz codificada após a utilização de uma técnica de supressão de ruído baseada em transformadas wavelets (Wavelet Denoising). / [en] This work investigates the impact of LSF vector quantisation over the voice quality in low rate codecs operating in IP networks. Tree-structured multistage vector quantisation (VQ) schemes involving memoryless VQ and switched-predictive VQ with 2 and 4 classes are considered. The packet loss frame distribution in IP networks was modelled according to the Gilbert Model and the performance was carried out both in terms of spectral distortions and the speech quality at the out put of low rate codecs. In this work, we also evaluated the quality of the coded speech after employing Wavelet Denoising. [pt] QUANTIZACAO VETORIAL [en] VECTOR QUANTISATION [pt] REDES IP [en] IP NETWORK [pt] LSF [en] LSF [pt] PERDA DE QUADROS [en] FRAME LOSS [pt] WAVELET DENOISING [en] WAVELET DENOISING

Search results

[en] EFFICIENT FEATURES AND INTERPOLATION DOMAINS IN DISTRIBUTED SPEECH RECOGNITION / [pt] ATRIBUTOS E DOMÍNIOS DE INTERPOLAÇÃO EFICIENTES EM RECONHECIMENTO DE VOZ DISTRIBUÍDO

[en] LOW RATE CODECS OPERATING IN NOISY ENVIRONMENT AND IP NETWORKS / [pt] CODIFICADORES DE VOZ A BAIXAS TAXAS OPERANDO EM AMBIENTES RUIDOSOS E REDES IP