Global ETD Search

Return to search

A Scalable Multimedia Content Processing Framework with Application to TV Shopping

The advent of smart TVs has reshaped the TV-consumer interaction by combining TVs with mobile-like applications and access to the Internet. However, consumers are still unable to seamlessly interact with the contents being streamed. An example of such limitation is TV shopping, in which a consumer makes a purchase of a product or item displayed in the current TV show. Currently, consumers can only stop the current show and attempt to find a similar item in the Web or an actual store. It would be more convenient if the consumer could interact with the TV to purchase interesting items.
Towards the realization of TV shopping, this dissertation proposes a scalable multimedia content processing framework. Two main challenges in TV shopping are addressed: the efficient detection of products in the content stream, and the retrieval of similar products given a consumer-selected product. The proposed framework consists of three components. The first component performs computational and temporal aware multimedia abstraction to select a reduced number of frames that summarize the important information in the video stream. By both reducing the number of frames and taking into account the computational cost of the subsequent detection phase, this component component allows the efficient detection of products in the stream. The second component realizes the detection phase. It executes scalable product detection using multi-cue optimization. Additional information cues are formulated into an optimization problem that allows the detection of complex products, i.e., those that do not have a rigid form and can appear in various poses. After the second component identifies products in the video stream, the consumer can select an interesting one for which similar ones must be located in a product database. To this end, the third component of the framework consists of an efficient, multi-dimensional, tree-based indexing method for multimedia databases. The proposed index mechanism serves as the backbone of the search. Moreover, it is able to efficiently bridge the semantic gap and perception subjectivity issues during the retrieval process to provide more relevant results.

TV Shopping

Multimedia Abstraction

Contextual Object Detection

Multi-dimensional Tree Indexing

Identifer	oai:union.ndltd.org:fiu.edu/oai:digitalcommons.fiu.edu:etd-2553
Date	12 May 2014
Creators	Fleites, Fausto C
Publisher	FIU Digital Commons
Source Sets	Florida International University
Detected Language	English
Type	text
Format	application/pdf
Source	FIU Electronic Theses and Dissertations

Page generated in 0.0023 seconds

A Scalable Multimedia Content Processing Framework with Application to TV Shopping

Description

Links & Downloads

Tags

Additional Fields