Global ETD Search

Return to search

Automatic Extraction and Assessment of Entities from the Web

The search for information about entities, such as people or movies, plays an increasingly important role on the Web. This information is still scattered across many Web pages, making it more time consuming for a user to ﬁnd all relevant information about an entity. This thesis describes techniques to extract entities and information about these entities from the Web, such as facts, opinions, questions and answers, interactive multimedia objects, and events. The ﬁndings of this thesis are that it is possible to create a large knowledge base automatically using a manually-crafted ontology. The precision of the extracted information was found to be between 75–90 % (facts and entities respectively) after using assessment algorithms. The algorithms from this thesis can be used to create such a knowledge base, which can be used in various research ﬁelds, such as question answering, named entity recognition, and information retrieval.

Entitätenextraktion

Faktextraktion

entity extraction

named entity recognition

Identifer	oai:union.ndltd.org:DRESDEN/oai:qucosa.de:bsz:14-qucosa-97469
Date	23 October 2012
Creators	Urbansky, David
Contributors	Technische Universität Dresden, Fakultät Informatik, Dr. rer. nat. habil. Dr. h. c. Alexander Schill, Associate Professor Dr. James Thom, Dr. rer. nat. habil. Dr. h. c. Alexander Schill, Associate Professor Dr. James Thom, Prof. Dr. Michael Schroeder
Publisher	Saechsische Landesbibliothek- Staats- und Universitaetsbibliothek Dresden
Source Sets	Hochschulschriftenserver (HSSS) der SLUB Dresden
Language	English
Detected Language	English
Type	doc-type:doctoralThesis
Format	application/pdf

Page generated in 0.002 seconds

Automatic Extraction and Assessment of Entities from the Web

Description

Links & Downloads

Tags

Additional Fields