Global ETD Search

Return to search

[en] CONTINUOUS SPEECH RECOGNITION WITH MFCC, SSCH AND PNCC FEATURES, WAVELET DENOISING AND NEURAL NETWORKS / [pt] RECONHECIMENTO DE VOZ CONTÍNUA COM ATRIBUTOS MFCC, SSCH E PNCC, WAVELET DENOISING E REDES NEURAIS

[pt] Um dos maiores desafios na área de reconhecimento de voz contínua é
desenvolver sistemas robustos ao ruído aditivo. Para isso, este trabalho analisa e
testa três técnicas. A primeira delas é a extração de atributos do sinal de voz
usando os métodos MFCC, SSCH e PNCC. A segunda é a remoção de ruído do
sinal de voz via wavelet denoising. A terceira e última é uma proposta original
batizada de feature denoising, que busca melhorar os atributos extraídos usando
um conjunto de redes neurais. Embora algumas dessas técnicas já sejam
conhecidas na literatura, a combinação entre elas trouxe vários resultados
interessantes e inéditos. Inclusive, nota-se que o melhor desempenho vem da
união de PNCC com feature denoising. / [en] One of the biggest challenges on the continuous speech recognition field is
to develop systems that are robust to additive noise. To do so, this work analyses
and tests three techniques. The first one extracts features from the voice signal
using the MFCC, SSCH and PNCC methods. The second one removes noise from
the voice signal through wavelet denoising. The third one is an original one,
called feature denoising, that seeks to improve the extracted features using a set of
neural networks. Although some of these techniques are already known in the
literature, the combination of them brings many interesting and new results. In
fact, it is noticed that the best performance comes from the union of PNCC and
feature denoising.

[pt] RUIDO

[en] NOISE

[pt] RECONHECIMENTO DE VOZ

[en] SPEECH RECOGNITION

[pt] REDE NEURAL

[en] NEURAL NETWORK

Identifer	oai:union.ndltd.org:puc-rio.br/oai:MAXWELL.puc-rio.br:19143
Date	09 February 2012
Creators	JAN KRUEGER SIQUEIRA
Contributors	ABRAHAM ALCAIM
Publisher	MAXWELL
Source Sets	PUC Rio
Language	Portuguese
Detected Language	Portuguese
Type	TEXTO

Page generated in 0.002 seconds

[en] CONTINUOUS SPEECH RECOGNITION WITH MFCC, SSCH AND PNCC FEATURES, WAVELET DENOISING AND NEURAL NETWORKS / [pt] RECONHECIMENTO DE VOZ CONTÍNUA COM ATRIBUTOS MFCC, SSCH E PNCC, WAVELET DENOISING E REDES NEURAIS

Description

Links & Downloads

Tags

Additional Fields