Global ETD Search

Return to search

Low-shot Visual Recognition

Many real world datasets are characterized by having a long tailed distribution, with several samples for some classes and only a few samples for other classes. While many Deep Learning based solutions exist for object recognition when hundreds of samples are available, there are not many solutions for the case when there are only a few samples available per class. Recognition in the regime where the number of training samples available for each class are low, ranging from 1 to couple of tens of examples is called Lowshot Recognition. In this work, we attempt to solve this problem. Our framework is similar to [1]. We use a related dataset with sufficient number (a couple of hundred) of samples per class to learn representations using a Convolutional Neural Network (CNN). This CNN is used to extract features of the lowshot samples and learn a classifier . During representation learning, we enforce the learnt representations to obey certain property by using a custom loss function. We believe that when the lowshot sample obey this property the classification step becomes easier. We show that the proposed solution performs better than the softmax classifier by a good margin. / Master of Science

Identifer	oai:union.ndltd.org:VTETD/oai:vtechworks.lib.vt.edu:10919/73321
Date	24 October 2016
Creators	Pemula, Latha
Contributors	Electrical and Computer Engineering, Batra, Dhruv, Parikh, Devi, Abbott, A. Lynn
Publisher	Virginia Tech
Source Sets	Virginia Tech Theses and Dissertation
Detected Language	English
Type	Thesis
Format	ETD, application/pdf
Rights	In Copyright, http://rightsstatements.org/vocab/InC/1.0/

Page generated in 0.0013 seconds

Low-shot Visual Recognition

Description

Links & Downloads

Tags

Additional Fields