Global ETD Search

Return to search

Modelování jazyka v rozpoznávání češtiny / Language Modeling for Spech Recognition in Czech

This work concerns the problematic of language modeling in automatic speech recognition. Currently widely used techniques for advanced language modeling based on statistical approach are described in the first part of work - class based language models, factored language models and neural network based language models. In the next section, implementation of neural network based language model is described. Results obtained on "Pražský mluvený korpus" and "Brněnský mluvený korpus" corpora (1 170 000 words) are reported, with perplexity reduction around 20%. Also, results obtained after rescoring N-best lists with spontaneous speech are reported, with absolute improvement in accuracy by more than 1%. In the conclusion, possible uses of the work are mentioned, along with possible extensions in the future. Finally, main weaknesses of current statistical language modeling techniques are described.

http://www.nusl.cz/ntk/nusl-236911

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:236911
Creators	Mikolov, Tomáš
Contributors	Černocký, Jan, Smrž, Pavel
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0014 seconds

Modelování jazyka v rozpoznávání češtiny / Language Modeling for Spech Recognition in Czech

Description

Links & Downloads

Tags

Additional Fields