Global ETD Search

Return to search

Turkish Large Vocabulary Continuous Speech Recognition By Using Limited Audio Corpus

Speech recognition in Turkish Language is a challenging problem in several perspectives. Most of the challenges are related to the morphological structure of the language. Since Turkish is an agglutinative language, it is possible to generate many words from a single stem by using suffixes. This characteristic of the language increases the out-of-vocabulary (OOV) words, which degrade the performance of a speech recognizer dramatically. Also, Turkish language allows words to be ordered in a free manner, which makes it difficult to generate robust language models.

In this thesis, the existing models and approaches which address the problem of Turkish LVCSR (Large Vocabulary Continuous Speech Recognition) are explored. Different recognition units (words, morphs, stem and endings) are used in
generating the n-gram language models. 3-gram and 4-gram language models are generated with respect to the recognition unit.

Since the solution domain of speech recognition is involved with machine learning, the performance of the recognizer depends on the sufficiency of the audio data used in acoustic model training. However, it is difficult to obtain rich audio corpora for
the Turkish language. In this thesis, existing approaches are used to solve the problem of Turkish LVCSR by using a limited audio corpus. We also proposed several data selection approaches in order to improve the robustness of the acoustic
model.

http://etd.lib.metu.edu.tr/upload/12614207/index.pdf

QA Computer Software 76.75-76.765

Identifer	oai:union.ndltd.org:METU/oai:etd.lib.metu.edu.tr:http://etd.lib.metu.edu.tr/upload/12614207/index.pdf
Date	01 March 2012
Creators	Susman, Derya
Contributors	Yazici, Adnan
Publisher	METU
Source Sets	Middle East Technical Univ.
Language	English
Detected Language	English
Type	M.S. Thesis
Format	text/pdf
Rights	To liberate the content for public access

Page generated in 0.0025 seconds

Turkish Large Vocabulary Continuous Speech Recognition By Using Limited Audio Corpus

Description

Links & Downloads

Tags

Additional Fields