Global ETD Search

Return to search

Multi-modal Video Ummarization Using Hidden Markov Models For Content-based Multimedia Indexing

This thesis deals with scene level summarization of story-based videos. Two different approaches for story-based video summarization are investigated. The first approach probabilistically models the input video and identifies scene boundaries using the same model. The second approach models scenes and classifies scene types
by evaluating likelihood values of these models. In both approaches, hidden Markov models are used as the probabilistic modeling tools. The first approach also exploits the relationship between video summarization and video production, which is briefly explained, by means of content types. Two content types are defined, dialog driven and action driven content, and the need to define such content types is emonstrated
by simulations. Different content types use different hidden Markov models and
features. The selected model segments input video as a whole. The second approach models scene types. Two types, dialog scene and action scene, are defined with different features and models. The system classifies fixed sized partitions of the video as either of the two scene types, and segments partitions separately according to their scene types. Performance of these two systems are compared against a iv
deterministic video summarization method employing clustering based on visual properties and video structure related rules. Hidden Markov model based video summarization using content types enjoys the highest performance.

http://etd.lib.metu.edu.tr/upload/3/1124734/index.pdf

Identifer	oai:union.ndltd.org:METU/oai:etd.lib.metu.edu.tr:http://etd.lib.metu.edu.tr/upload/3/1124734/index.pdf
Date	01 January 2003
Creators	Yasaroglu, Yagiz
Contributors	Alatan, Aydin
Publisher	METU
Source Sets	Middle East Technical Univ.
Language	English
Detected Language	English
Type	M.S. Thesis
Format	text/pdf
Rights	To liberate the content for public access

Page generated in 0.0023 seconds

Multi-modal Video Ummarization Using Hidden Markov Models For Content-based Multimedia Indexing

Description

Links & Downloads

Tags

Additional Fields