Towards Generalizable Information Extraction with Limited Supervision

dc.contributor.authorWang, Sijiaen
dc.contributor.committeechairHuang, Lifuen
dc.contributor.committeememberZhou, Daweien
dc.contributor.committeememberReddy, Chandan K.en
dc.contributor.committeememberWang, Xuanen
dc.contributor.committeememberYu, Moen
dc.contributor.committeememberLourentzou, Isminien
dc.contributor.departmentComputer Science and#38; Applicationsen
dc.date.accessioned2024-09-19T08:00:12Zen
dc.date.available2024-09-19T08:00:12Zen
dc.date.issued2024-09-18en
dc.description.abstractSupervised approaches, especially those employing deep neural networks, have showcased impressive performance, relying on a significant volume of manual annotations. However, their effectiveness encounters challenges when attempting to generalize to new languages, domains, or types, particularly in the absence of sufficient annotations. Current methods fall short in effectively addressing information extraction (IE) under limited supervision. In this dissertation, we approach information extraction with limited supervision from three perspectives. Firstly, we refine the previous classification-based extraction paradigm by introducing a query-and-extract framework, which uses target information as natural language queries to extract candidate information from the input text. Additionally, we leverage the excellent generation capability of large language models (LLMs) to produce high-quality annotation data, enriching IE semantics within limited annotation data. We also utilize LLMs' instruction-following capability to iteratively refine and optimize solutions through a debating process. Beyond text-only IE, we define a new multimodal IE task that links an entity mention within heterogeneous information sources to a knowledge base with limited annotation data. We demonstrate that excellent multimodal IE performance can be achieved, even with limited annotation data, by leveraging monomodal external information. These combined efforts aim to make optimal use of limited knowledge, ensuring more robust and generalizable solutions.en
dc.description.abstractgeneralThis dissertation explores the development of information extraction (IE) algorithms and systems that work effectively with limited supervision. Information extraction is a complex and challenging task that involves extracting structured data from plain text. Traditional IE systems are often tailored to specific tasks and domains where ample annotated data is available, limiting their ability to adapt to new domains. This research focuses on developing IE systems that can generalize to new domains with limited supervision, reducing the reliance on extensive annotations. The proposed solutions demonstrate the potential to transfer knowledge from existing annotations to new tasks and domains, emphasizing the importance of learning from limited data and improving knowledge transfer to previously unknown domains.en
dc.description.degreeDoctor of Philosophyen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:41269en
dc.identifier.urihttps://hdl.handle.net/10919/121157en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectInformation Extractionen
dc.subjectLimited Supervisionen
dc.subjectEvent Extractionen
dc.subjectEntity Linkingen
dc.titleTowards Generalizable Information Extraction with Limited Supervisionen
dc.typeDissertationen
thesis.degree.disciplineComputer Science & Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.nameDoctor of Philosophyen

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Wang_S_D_2024.pdf
Size:
8.4 MB
Format:
Adobe Portable Document Format
Name:
Wang_S_D_2024_support_1.docx
Size:
13.96 KB
Format:
Microsoft Word XML
Description:
Supporting documents