This thesis describes a designed and implemented system for efficient storage, indexing and search in collections of spoken documents that takes advantage of automatic speech recognition. As the quality of current speech recognizers is not sufficient for a great deal of applications, it is necessary to index the ambiguous output of the recognition, i.\,e. the acyclic graphs of word hypotheses -- recognition lattices. Then, it is not possible to directly apply the standard methods known from text--based systems. This paper discusses an optimized indexing system for efficient search in the complex and large data structures which are the output of the recognizer.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:236845 |
Creators | Fapšo, Michal |
Contributors | Černocký, Jan, Szőke, Igor |
Publisher | Vysoké učení technické v Brně. Fakulta informačních technologií |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0018 seconds