Global ETD Search

Return to search

Odhad přesnosti řečových technologií na základě měření signálové kvality a obsahové bohatosti audia / Estimation of accuracy of speech technologies based on signal quality and audio content richness

This thesis discusses theoretical analysis of the origin of speech, introduces applications of speech technologies and explains the contemporary approach to phonetical transcription of speech recordings. Furthermore, it describes the metrics of audio recordings quality assessment, which is split into two discrete classes. The first one groups signal quality metrics, while the other one groups content richness metrics. The first goal of the practical section is to create a statistical model for accuracy prediction of machine transcription of speech recordings based on a measurement of their quality. The second goal is to evaluate which partial metrics are the most essential for accuracy prediction of machine transcription.

http://www.nusl.cz/ntk/nusl-413168

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:413168
Date	January 2020
Creators	Nezval, Jiří
Contributors	Smital, Lukáš, Schwarz, Petr
Publisher	Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0058 seconds

Odhad přesnosti řečových technologií na základě měření signálové kvality a obsahové bohatosti audia / Estimation of accuracy of speech technologies based on signal quality and audio content richness

Description

Links & Downloads

Tags

Additional Fields