Global ETD Search

Return to search

Implementation och utvärdering av termlänkare i Java

Aligning parallell terms in a parallell corpus can be done by aligning all words and phrases in the corpus and then performing term extraction on the aligned set of word pairs. Alternatively, term extraction in the source and target text can be made separately and then the resulting term candidates can be aligned, forming aligned parallell terms. This thesis describes an implementation of a word aligner that is applied on extracted term candidates in both the source and the target texts. The term aligner uses statistical measures, the tool Giza++ and heuristics in the search for alignments. The evaluation reveals that the best results are obtained when the term alignment relies heavily on the Giza++ tool and Levenshtein heuristic.

http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-92732

termlänkning

giza++

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-92732
Date	January 2013
Creators	Axelsson, Robin
Publisher	Linköpings universitet, Interaktiva och kognitiva system, Linköpings universitet, Tekniska högskolan
Source Sets	DiVA Archive at Upsalla University
Language	Swedish
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0018 seconds

Implementation och utvärdering av termlänkare i Java

Description

Links & Downloads

Tags

Additional Fields