Global ETD Search

Return to search

Zvyšování účinnosti strojového rozpoznávání řeči / Enhancing the effectiveness of automatic speech recognition

This work identifies the causes for unsatisfactory reliability of contemporary systems for automatic speech recognition when deployed in demanding conditions. The impact of the individual sources of performance degradation is documented and a list of known methods for their identification from the recognized signal is given. An overview of the usual methods to suppress the impact of the disruptive influences on the performance of speech recognition is provided. The essential contribution of the work is the formulation of new approaches to constructing acoustical models of noisy speech and nonstationary noise allowing high recognition performance in challenging conditions. The viability of the proposed methods is verified on an isolated-word speech recognizer utilizing several-hour-long recording of the real operating room background acoustical noise recorded at the Uniklinikum Marburg in Germany. This work is the first to identify the impact of changes in speaker’s vocal effort on the reliability of automatic speech recognition in the full vocal effort range (i.e. whispering through shouting). A new concept of a speech recognizer immune to the changes in vocal effort is proposed. For the purposes of research on changes in vocal effort, a new speech database, BUT-VE1, was created.

http://www.nusl.cz/ntk/nusl-233578

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:233578
Date	January 2012
Creators	Zelinka, Petr
Contributors	Tučková,, Jana, Nouza,, Jan, Sigmund, Milan
Publisher	Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/doctoralThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0016 seconds

Zvyšování účinnosti strojového rozpoznávání řeči / Enhancing the effectiveness of automatic speech recognition

Description

Links & Downloads

Tags

Additional Fields