Return to search

Odhad přesnosti řečových technologií na základě měření signálové kvality a obsahové bohatosti audia / Estimation of accuracy of speech technologies based on signal quality and audio content richness

This thesis discusses theoretical analysis of the origin of speech, introduces applications of speech technologies and explains the contemporary approach to phonetical transcription of speech recordings. Furthermore, it describes the metrics of audio recordings quality assessment, which is split into two discrete classes. The first one groups signal quality metrics, while the other one groups content richness metrics. The first goal of the practical section is to create a statistical model for accuracy prediction of machine transcription of speech recordings based on a measurement of their quality. The second goal is to evaluate which partial metrics are the most essential for accuracy prediction of machine transcription.

Identiferoai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:413168
Date January 2020
CreatorsNezval, Jiří
ContributorsSmital, Lukáš, Schwarz, Petr
PublisherVysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií
Source SetsCzech ETDs
LanguageCzech
Detected LanguageEnglish
Typeinfo:eu-repo/semantics/masterThesis
Rightsinfo:eu-repo/semantics/restrictedAccess

Page generated in 0.0017 seconds