Global ETD Search

Return to search

The "What"-"Where" Network: A Tool for One-Shot Image Recognition and Localization

One common shortcoming of modern computer vision is the inability of most models to generalize to new classes—one/few shot image recognition. We propose a new problem formulation for this task and present a network architecture and training methodology to solve this task. Further, we provide insights into how careful focus on how not just the data, but the way data presented to the model can have significant impact on performance. Using these method, we achieve high accuracy in few-shot image recognition tasks.

computer vision

semantic segmentation

few-shot learning

one-shot learning

embedding

Physical Sciences and Mathematics

Identifer	oai:union.ndltd.org:BGMYU2/oai:scholarsarchive.byu.edu:etd-10375
Date	06 January 2021
Creators	Hurlburt, Daniel
Publisher	BYU ScholarsArchive
Source Sets	Brigham Young University
Detected Language	English
Type	text
Format	application/pdf
Source	Theses and Dissertations
Rights	https://lib.byu.edu/about/copyright/

Page generated in 0.0019 seconds

The "What"-"Where" Network: A Tool for One-Shot Image Recognition and Localization

Description

Links & Downloads

Tags

Additional Fields