Global ETD Search

Return to search

Use of ontologies in information extraction

xiii, 149 p. : ill. (some col.) / Information extraction (IE) aims to recognize and retrieve certain types of information from natural language text. For instance, an information extraction system may extract key geopolitical indicators about countries from a set of web pages while ignoring other types of information. IE has existed as a research field for a few decades, and ontology-based information extraction (OBIE) has recently emerged as one of its subfields. Here, the general idea is to use ontologies--which provide formal and explicit specifications of shared conceptualizations--to guide the information extraction process. This dissertation presents two novel directions for ontology-based information extraction in which ontologies are used to improve the information extraction process.

First, I describe how a component-based approach for information extraction can be designed through the use of ontologies in information extraction. A key idea in this approach is identifying components of information extraction systems which make extractions with respect to specific ontological concepts. These components are termed "information extractors". The component-based approach explores how information extractors as well as other types of components can be used in developing information extraction systems. This approach has the potential to make a significant contribution towards the widespread usage and commercialization of information extraction.

Second, I describe how an ontology-based information extraction system can make use of multiple ontologies. Almost all previous systems use a single ontology, although multiple ontologies are available for most domains. Using multiple ontologies in information extraction has the potential to extract more information from text and thus leads to an improvement in performance measures. The concept of information extractor, conceived in the component-based approach for information extraction, is used in designing the principles for accommodating multiple ontologies in an ontology-based information extraction system. / Committee in charge: Dr. Dejing Dou, Chair;
Dr. Arthur Farley, Member;
Dr. Michal Young, Member;
Dr. Monte Westerfield, Outside Member

http://hdl.handle.net/1794/11216

Information extraction

Ontologies (Information retrieval)

Software components

Computer science

Identifer	oai:union.ndltd.org:uoregon.edu/oai:scholarsbank.uoregon.edu:1794/11216
Date	03 1900
Creators	Wimalasuriya, Daya Chinthana
Publisher	University of Oregon
Source Sets	University of Oregon
Language	en_US
Detected Language	English
Type	Thesis
Relation	University of Oregon theses, Dept. of Computer and Information Science, Ph. D., 2011;

Page generated in 0.0023 seconds

Use of ontologies in information extraction

Description

Links & Downloads

Tags

Additional Fields