Global ETD Search

Return to search

Chinese input method based on reduced phonetic transcription

In this paper, we investigate a highly efficient input method in Chinese. In the traditional
Mandarin phonetic input method, users have to input the complete Mandarin phonetic symbol.
The proposed new Chinese input method is which transforms the first Mandarin phonetic
symbol sequence to character sequence. Users only have to input the first Mandarin phonetic
symbol. Users input first Mandarin phonetic symbol and follow the input rule that spaces are
inserted between the words. The system outputs the candidate character sequence hypotheses.
Bigram model is used to describe the relation between words. We use the dynamic programing
for decoding. We estimate the feasibility for our new Chinese input method and estimate the
Stanford segmenter. In the experiment, we estimate the Standford Segmenter works on the
simplified Chinese and Traditional Chinese firstly. We observe that the precision and recall on
the simplified Chinese are 84.52% and 85.20% which is better than works on the Traditional
Chinese 68.43% and 63.43%. And we estimate system efficiency based on language model
that trained by WIKI corpus and ASBC corpus separately. The sentence and word accuracy
for the ASBC corpus are 39.8% and 70.3%. And the word and character accuracy for WIKI
corpus are 20.3% and 53.3%. Finally we estimate the number of candidate hypotheses. The
research shows the 10 hypotheses and 20 hypotheses the sentence accuracy are closed.

http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0522112-103522

Identifer	oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0522112-103522
Date	22 May 2012
Creators	Hsu, Feng-Ho
Contributors	Chung-Hsien Wu, Liang-Chih Yu, Chia Ping Chen, Hsin-Min Wang
Publisher	NSYSU
Source Sets	NSYSU Electronic Thesis and Dissertation Archive
Language	Cholon
Detected Language	English
Type	text
Format	application/pdf
Source	http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0522112-103522
Rights	user_define, Copyright information available at source archive

Page generated in 0.002 seconds

Chinese input method based on reduced phonetic transcription

Description

Links & Downloads

Tags

Additional Fields