Global ETD Search

Return to search

Vliv akustiky prostředí na úspěšnost rozpoznávače řeči / Impact of Environment Acoustics on Speech Recognition Accuracy

This diploma thesis deals with impact of room acoustics on automatic speech recognition (ASR) accuracy. Experiments were evaluated on speech corpus LibriSpeech and database of impulse responses and noise called ReverbDB. Used ASRs were based on Mini LibriSpeech recipe for Kaldi. First it was examined how well can ASR learn to transcribe in selected environments by using the same acoustic conditions during training and testing. Next, experiments were carried out with modifications of ASR architecture in order to achieve better robustness against new conditions by using methods for adapation to room acoustics - r-vectors and i-vectors. It was shown that recently proposed method of r-vectors is beneficial even when using real impulse responses for data augmentation.

http://www.nusl.cz/ntk/nusl-445549

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:445549
Date	January 2021
Creators	Paliesek, Jakub
Contributors	Karafiát, Martin, Szőke, Igor
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0019 seconds

Vliv akustiky prostředí na úspěšnost rozpoznávače řeči / Impact of Environment Acoustics on Speech Recognition Accuracy

Description

Links & Downloads

Tags

Additional Fields