Global ETD Search

Return to search

Korektor anglické gramatiky: určité a neurčité členy / English grammar checker and corrector: the determiners

Correction of the articles in English texts is approached as an article generation task, i.e. each noun phrase is assigned with a class corresponding to the definite, indefinite or zero article. Supervised machine learning methods are used to first replicate and then improve upon the best reported result in the literature known to the author. By feature engineering and a different choice of the learning method, about 34% drop in error is achieved. The resulting model is further compared to the performance of expert annotators. Although the comparison is not straightforward due to the differences in the data, the results indicate the performance of the trained model is comparable to the human-level performance when measured on the in-domain data. On the other hand, the model does not generalize well to different types of data. Using a large-scale language model to predict an article (or no article) for each word of the text has not proved successful. 1

http://www.nusl.cz/ntk/nusl-355992

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:355992
Date	January 2017
Creators	Auersperger, Michal
Contributors	Pecina, Pavel, Straňák, Pavel
Source Sets	Czech ETDs
Language	English
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0016 seconds

Korektor anglické gramatiky: určité a neurčité členy / English grammar checker and corrector: the determiners

Description

Links & Downloads

Tags

Additional Fields