Global ETD Search

Return to search

A Design of English Speech Recognition System

This thesis investigates the design and implementation strategies for a English speech recognition system. Two speech inputting methods, the spelling inputting and the reading inputting, are implemented for English word recognition and query. Mel-frequency cepstrum coefficients, linear predicted cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 1.6 GHz personal computer and Ubuntu 8.04 operating system environment, a 95% correct recognition rate can be obtained for a 110 thousand English word database by the spelling inputting method; and a 93% correct recognition rate can be achieved for a 1,500 English word database by the reading inputting method. The average computation time for each word using either inputting method is about 1.5 seconds.

http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824109-172533

Linear predicted cepstrum coefficients

Hidden Markov model

Mel frequency cepstrum coefficients

Speech recognition

Identifer	oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824109-172533
Date	24 August 2009
Creators	Chen, Yung-ming
Contributors	Tsung Lee, Chih-Chien Chen, Chii-Maw Uang, Sheau-Shong Bor, Sheau-Shong Bor
Publisher	NSYSU
Source Sets	NSYSU Electronic Thesis and Dissertation Archive
Language	Cholon
Detected Language	English
Type	text
Format	application/pdf
Source	http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824109-172533
Rights	not_available, Copyright information available at source archive

Page generated in 0.0025 seconds

A Design of English Speech Recognition System

Description

Links & Downloads

Tags

Additional Fields