Global ETD Search

Return to search

Corpus-Based Techniques for Word Sense Disambiguation

The need for robust and easily extensible systems for word sense disambiguation coupled with successes in training systems for a variety of tasks using large on-line corpora has led to extensive research into corpus-based statistical approaches to this problem. Promising results have been achieved by vector space representations of context, clustering combined with a semantic knowledge base, and decision lists based on collocational relations. We evaluate these techniques with respect to three important criteria: how their definition of context affects their ability to incorporate different types of disambiguating information, how they define similarity among senses, and how easily they can generalize to new senses. The strengths and weaknesses of these systems provide guidance for future systems which must capture and model a variety of disambiguating information, both syntactic and semantic.

MIT

Artificial Intelligence

NLP

Word Sense Disambiguation

Identifer	oai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/5934
Date	27 May 1998
Creators	Levow, Gina-Anne
Source Sets	M.I.T. Theses and Dissertation
Language	en_US
Detected Language	English
Format	20 p., 216242 bytes, 338116 bytes, application/postscript, application/pdf
Relation	AIM-1637

Page generated in 0.0019 seconds

Corpus-Based Techniques for Word Sense Disambiguation

Description

Links & Downloads

Tags

Additional Fields