We develop a framework that uses visual attention analysis combined with temporal
coherence to detect the attended region from a H.264 video bitstream, and display it on
a small screen. A visual attention module based upon Walther and Koch's model gives us
the attended region in I-frames. We propose a temporal coherence matching framework that
uses the motion information in P-frames to extend the attended region over the H.264
video sequence. Evaluations show encouraging results with over 80% successful detection rate for objects of interest, and 85% respondents claiming satisfactory output.
Identifer | oai:union.ndltd.org:LACETR/oai:collectionscanada.gc.ca:OWTU.10012/3929 |
Date | January 2008 |
Creators | Mukherjee, Abir |
Source Sets | Library and Archives Canada ETDs Repository / Centre d'archives des thèses électroniques de Bibliothèque et Archives Canada |
Language | English |
Detected Language | English |
Type | Thesis or Dissertation |
Page generated in 0.015 seconds