Global ETD Search

Return to search

Image Annotation With Semi-supervised Clustering

Image annotation is defined as generating a set of textual words for a given image, learning from the available training data consisting of visual image content and annotation words.

Methods developed for image annotation usually make use of region clustering algorithms to quantize the visual information. Visual codebooks are generated from the region clusters of low level visual features. These codebooks are then, matched with the words of the text document related to the image, in various ways.

In this thesis, we propose a new image annotation technique, which improves the representation and quantization of the visual information by employing the available but unused information, called side information, which is hidden in the system. This side information is used to semi-supervise the clustering process which creates the visterms. The selection of side information depends on the visual image content, the annotation words and the relationship between them. Although there may be many different ways of defining and selecting side information, in this thesis, three types of side information are proposed. The first one is the hidden topic probability information obtained automatically from the text document associated with the image. The second one is the orientation and the third one is the color information around interest points that correspond to critical locations in the image. The side information provides a set of constraints in a semi-supervised K-means region clustering algorithm. Consequently, in generation of the visual terms from the regions, not only low level features are clustered, but also side information is used to complement the visual information,
called visterms. This complementary information is expected to close the semantic gap between the low level features extracted from each region and the high level textual information. Therefore, a better match between visual codebook and the annotation words is obtained. Moreover, a speedup is obtained in the modified K-means algorithm because of the constraints brought by the side information. The proposed algorithm is implemented in a high performance parallel computation environment.

http://etd.lib.metu.edu.tr/upload/3/12611251/index.pdf

QA General 15707

Identifer	oai:union.ndltd.org:METU/oai:etd.lib.metu.edu.tr:http://etd.lib.metu.edu.tr/upload/3/12611251/index.pdf
Date	01 December 2009
Creators	Sayar, Ahmet
Contributors	Yarman Vural, Fatos T.
Publisher	METU
Source Sets	Middle East Technical Univ.
Language	English
Detected Language	English
Type	Ph.D. Thesis
Format	text/pdf
Rights	Access forbidden for 1 year

Page generated in 0.0018 seconds

Image Annotation With Semi-supervised Clustering

Description

Links & Downloads

Tags

Additional Fields