Global ETD Search

Return to search

Im2Vid: Future Video Prediction for Static Image Action Recognition

Static image action recognition aims at identifying the action performed in a given image. Most existing static image action recognition approaches use high-level cues present in the image such as objects, object human interaction, or human pose to better capture the action performed. Unlike images, videos have temporal information that greatly improves action recognition by resolving potential ambiguity. We propose to leverage a large amount of readily available unlabeled videos to transfer the temporal information from video domain to static image domain and hence improve static image action recognition. Specifically, We propose a video prediction model to predict the future video of a static image and use the future predicted video to improve static image action recognition. Our experimental results on four datasets validate that the idea of transferring the temporal information from videos to static images is promising, and can enhance static image action recognition performance. / Master of Science

Human Action Recognition

Static Image Action Recognition

Video Action Recognition

Future Video Prediction

Identifer	oai:union.ndltd.org:VTETD/oai:vtechworks.lib.vt.edu:10919/83602
Date	20 June 2018
Creators	AlBahar, Badour A Sh A.
Contributors	Electrical and Computer Engineering, Huang, Jia-Bin, Tokekar, Pratap, Abbott, A. Lynn
Publisher	Virginia Tech
Source Sets	Virginia Tech Theses and Dissertation
Detected Language	English
Type	Thesis
Format	ETD, application/pdf
Rights	In Copyright, http://rightsstatements.org/vocab/InC/1.0/

Page generated in 0.0029 seconds

Im2Vid: Future Video Prediction for Static Image Action Recognition

Description

Links & Downloads

Tags

Additional Fields