Supervised approaches, especially those employing deep neural networks, have showcased impressive performance, relying on a significant volume of manual annotations. However, their effectiveness encounters challenges when attempting to generalize to new languages, domains, or types, particularly in the absence of sufficient annotations. Current methods fall short in effectively addressing information extraction (IE) under limited supervision. In this dissertation, we approach information extraction with limited supervision from three perspectives. Firstly, we refine the previous classification-based extraction paradigm by introducing a query-and-extract framework, which uses target information as natural language queries to extract candidate information from the input text. Additionally, we leverage the excellent generation capability of large language models (LLMs) to produce high-quality annotation data, enriching IE semantics within limited annotation data. We also utilize LLMs' instruction-following capability to iteratively refine and optimize solutions through a debating process. Beyond text-only IE, we define a new multimodal IE task that links an entity mention within heterogeneous information sources to a knowledge base with limited annotation data. We demonstrate that excellent multimodal IE performance can be achieved, even with limited annotation data, by leveraging monomodal external information. These combined efforts aim to make optimal use of limited knowledge, ensuring more robust and generalizable solutions. / Doctor of Philosophy / This dissertation explores the development of information extraction (IE) algorithms and systems that work effectively with limited supervision. Information extraction is a complex and challenging task that involves extracting structured data from plain text. Traditional IE systems are often tailored to specific tasks and domains where ample annotated data is available, limiting their ability to adapt to new domains. This research focuses on developing IE systems that can generalize to new domains with limited supervision, reducing the reliance on extensive annotations. The proposed solutions demonstrate the potential to transfer knowledge from existing annotations to new tasks and domains, emphasizing the importance of learning from limited data and improving knowledge transfer to previously unknown domains.
Identifer | oai:union.ndltd.org:VTETD/oai:vtechworks.lib.vt.edu:10919/121157 |
Date | 18 September 2024 |
Creators | Wang, Sijia |
Contributors | Computer Science and#38; Applications, Huang, Lifu, Zhou, Dawei, Reddy, Chandan K., Wang, Xuan, Yu, Mo, Lourentzou, Ismini |
Publisher | Virginia Tech |
Source Sets | Virginia Tech Theses and Dissertation |
Language | English |
Detected Language | English |
Type | Dissertation |
Format | ETD, application/pdf, application/vnd.openxmlformats-officedocument.wordprocessingml.document |
Rights | In Copyright, http://rightsstatements.org/vocab/InC/1.0/ |
Page generated in 0.0024 seconds