Global ETD Search

Return to search

Designing 2D Interfaces For 3D Gesture Retrieval Utilizing Deep Learning

Gesture retrieval can be defined as the process of retrieving the correct meaning of the hand movement from a pre-assembled gesture dataset. The purpose of the research discussed here is to design and implement a gesture interface system that facilitates retrieval for an American Sign Language gesture set using a mobile device. The principal challenge discussed here will be the normalization of 2D gestures generated from the mobile device interface and the 3D gestures captured from video samples into a common data structure that can be utilized by deep learning networks. This thesis covers convolutional neural networks and auto encoders which are used to transform 2D gestures into the correct form, before being classified by a convolutional neural network. The architecture and implementation of the front-end and back-end systems and each of their respective responsibilities are discussed. Lastly, this thesis covers the results of the experiment and breakdown the final classification accuracy of 83% and how this work could be further improved by using depth based videos for the 3D data.

Thesis

University of North Florida

UNF

Artificial Intelligence and Robotics

Identifer	oai:union.ndltd.org:unf.edu/oai:digitalcommons.unf.edu:etd-1820
Date	01 January 2017
Creators	Southard, Spencer
Publisher	UNF Digital Commons
Source Sets	University of North Florida
Detected Language	English
Type	text
Format	application/pdf
Source	UNF Graduate Theses and Dissertations

Page generated in 0.0021 seconds

Designing 2D Interfaces For 3D Gesture Retrieval Utilizing Deep Learning

Description

Links & Downloads

Tags

Additional Fields