Global ETD Search

Return to search

Bounding Box Improvement with Reinforcement Learning

In this thesis, I explore a reinforcement learning technique for improving bounding box localizations of objects in images. The model takes as input a bounding box already known to overlap an object and aims to improve the fit of the box through a series of transformations that shift the location of the box by translation, or change its size or aspect ratio. Over the course of these actions, the model adapts to new information extracted from the image. This active localization approach contrasts with existing bounding-box regression methods, which extract information from the image only once. I implement, train, and test this reinforcement learning model using data taken from the Portland State Dog-Walking image set.
The model balances exploration with exploitation in training using an ε-greedy policy. I find that the performance of the model is sensitive to the ε-greedy configuration used during training, performing best when the epsilon parameter is set to very low values over the course of training. With = 0.01, I find the algorithm can improve bounding boxes in about 78% of test cases for the "dog" object category, and 76% for the "human" category.

Reinforcement learning

Machine learning

Computer vision

Computer Sciences

Identifer	oai:union.ndltd.org:pdx.edu/oai:pdxscholar.library.pdx.edu:open_access_etds-5509
Date	12 June 2018
Creators	Cleland, Andrew Lewis
Publisher	PDXScholar
Source Sets	Portland State University
Detected Language	English
Type	text
Format	application/pdf
Source	Dissertations and Theses

Page generated in 0.021 seconds

Bounding Box Improvement with Reinforcement Learning

Description

Links & Downloads

Tags

Additional Fields