Global ETD Search

Return to search

Be More with Less: Scaling Deep-learning with Minimal Supervision

<p> </p>
<p>Large-scale deep learning models have reached previously unattainable performance for various tasks. However, the ever-growing resource consumption of neural networks generates large carbon footprint, brings difficulty for academics to engage in research and stops emerging economies from enjoying growing Artificial Intelligence (AI) benefits. To further scale AI to bring more benefits, two major challenges need to be solved. Firstly, even though large-scale deep learning models achieved remarkable success, their performance is still not satisfactory when fine-tuning with only a handful of examples, thereby hindering widespread adoption in real-world applications where a large scale of labeled data is difficult to obtain. Secondly, current machine learning models are still mainly designed for tasks in closed environments where testing datasets are highly similar to training datasets. When the deployed datasets have distribution shift relative to collected training data, we generally observe degraded performance of developed models. How to build adaptable models becomes another critical challenge. To address those challenges, in this dissertation, we focus on two topics: few-shot learning and domain adaptation, where few-shot learning aims to learn tasks with limited labeled data and domain adaption address the discrepancy between training data and testing data. In Part 1, we show our few-shot learning studies. The proposed few-shot solutions are built upon large-scale language models with evolutionary explorations from improving supervision signals, incorporating unlabeled data and improving few-shot learning abilities with lightweight fine-tuning design to reduce deployment costs. In Part 2, domain adaptation studies are introduced. We develop a progressive series of domain adaption approaches to transfer knowledge across domains efficiently to handle distribution shifts, including capturing common patterns across domains, adaptation with weak supervision and adaption to thousands of domains with limited labeled data and unlabeled data. </p>

10.25394/pgs.19678611.v1

Computer Engineering

Applied Computer Science

Pattern Recognition and Data Mining

Minimally-supervised Learning

Semi-supervised Learning

Data Mining

Deep Learning

Fake News Detection

Natural Language Processing

Domain Adaptation

Identifer	oai:union.ndltd.org:purdue.edu/oai:figshare.com:article/19678611
Date	28 April 2022
Creators	Yaqing Wang (12470301)
Source Sets	Purdue University
Detected Language	English
Type	Text, Thesis
Rights	CC BY 4.0
Relation	https://figshare.com/articles/thesis/Be_More_with_Less_Scaling_Deep-learning_with_Minimal_Supervision/19678611

Page generated in 0.0019 seconds

Be More with Less: Scaling Deep-learning with Minimal Supervision

Description

Links & Downloads

Tags

Additional Fields