Global ETD Search

Return to search

Flexible RDF data extraction from Wiktionary - Leveraging the power of community build linguistic wikis

We present a declarative approach implemented in a comprehensive opensource
framework (based on DBpedia) to extract lexical-semantic resources (an ontology about language use) from Wiktionary. The data currently includes language, part of speech, senses, definitions, synonyms, taxonomies (hyponyms, hyperonyms, synonyms, antonyms) and translations for each lexical word. Main focus is on flexibility to the loose schema and configurability towards differing language-editions ofWiktionary. This is achieved by a declarative mediator/wrapper approach. The goal is, to allow the addition of languages just by configuration without the need of programming, thus enabling the swift and resource-conserving adaptation of wrappers by domain experts. The extracted data is as fine granular as the source data in Wiktionary and additionally follows the lemon model. It enables use cases like disambiguation or machine translation. By offering a linked data service, we hope to extend DBpedia’s central role in the LOD infrastructure to the world of Open Linguistics.
I

info:eu-repo/classification/ddc/000

ddc:000

Identifer	oai:union.ndltd.org:DRESDEN/oai:qucosa:de:qucosa:17215
Date	26 February 2018
Creators	Brekle, Jonas
Contributors	Fähnrich, Klaus-Peter, Universität Leipzig
Source Sets	Hochschulschriftenserver (HSSS) der SLUB Dresden
Language	English
Detected Language	English
Type	info:eu-repo/semantics/publishedVersion, doc-type:masterThesis, info:eu-repo/semantics/masterThesis, doc-type:Text
Rights	info:eu-repo/semantics/openAccess
Relation	urn:nbn:de:bsz:15-qucosa2-163403, qucosa:16340

Page generated in 0.0012 seconds

Flexible RDF data extraction from Wiktionary - Leveraging the power of community build linguistic wikis

Description

Links & Downloads

Tags

Additional Fields