We develop a framework that uses visual attention analysis combined with temporal
coherence to detect the attended region from a H.264 video bitstream, and display it on
a small screen. A visual attention module based upon Walther and Koch's model gives us
the attended region in I-frames. We propose a temporal coherence matching framework that
uses the motion information in P-frames to extend the attended region over the H.264
video sequence. Evaluations show encouraging results with over 80% successful detection rate for objects of interest, and 85% respondents claiming satisfactory output.
Identifer | oai:union.ndltd.org:WATERLOO/oai:uwspace.uwaterloo.ca:10012/3929 |
Date | January 2008 |
Creators | Mukherjee, Abir |
Source Sets | University of Waterloo Electronic Theses Repository |
Language | English |
Detected Language | English |
Type | Thesis or Dissertation |
Page generated in 0.0347 seconds