Spelling suggestions: "subject:"ctructural similarity"" "subject:"1structural similarity""
1 |
Complex-Wavelet Structural Similarity Based Image ClassificationGao, Yang January 2012 (has links)
Complex wavelet structural similarity (CW-SSIM) index has been recognized as a novel image similarity measure of broad potential applications due to its robustness to small geometric distortions such as translation, scaling and rotation of images. Nevertheless, how to make the best use of it in image classification problems has not been deeply investi- gated. In this study, we introduce a series of novel image classification algorithms based on CW-SSIM and use handwritten digit and face image recognition as examples for demonstration, including CW-SSIM based nearest neighbor method, CW-SSIM based k means method, CW-SSIM based support vector machine method (SVM) and CW-SSIM based SVM using affinity propagation. Among the proposed approaches, the best compromise between accuracy and complexity is obtained by the CW-SSIM support vector machine algorithm, which combines an unsupervised clustering method to divide the training images into clusters with representative images and a supervised learning method based on support vector machines to maximize the classification accuracy. Our experiments show that such a conceptually simple image classification method, which does not involve any registration, intensity normalization or sophisticated feature extraction processes, and does not rely on any modeling of the image patterns or distortion processes, achieves competitive performance with reduced computational cost.
|
2 |
DEVELOPMENTAL FMRI STUDY: FACE AND OBJECT RECOGNITIONGathers, Ann D. 01 January 2005 (has links)
Visual processing, though seemingly automatic, is complex. Typical humansprocess objects and faces routinely. Yet, when a disease or disorder disrupts face andobject recognition, the effects are profound. Because of its importance and complexity,visual processing has been the subject of many adult functional imaging studies.However, relatively little is known about the development of the neural organization andunderlying cognitive mechanisms of face and object recognition. The current projectused functional magnetic resonance imaging (fMRI) to identify maturational changes inthe neural substrates of face and object recognition in 5-8 year olds, 9-11 year olds, andadults. A passive face and object viewing task revealed cortical shifts in the faceresponsiveloci of the ventral processing stream (VPS), an inferior occipito-temporalregion known to function in higher visual processing. Older children and adults recruitedmore anterior regions of the ventral processing stream than younger children. Toinvestigate the potential cognitive basis for these developmental changes, researchersimplemented a shape-matching task with parametric variations of shape overlap,structural similarity (SS), in stimulus pairs. VPS regions sensitive to high SS emerged inolder children and adults. Younger children recruited no structurally-sensitive regions inthe VPS. Two right hemisphere VPS regions were sensitive to maturational changes inSS. A comparison of face-responsive regions from the passive viewing task and the VPSSS regions did not reveal overlap. Though SS drives organization of the VPS, it did notexplain the cortical shifts in the neural substrates for face processing. In addition to VPSregions, results indicated additional maturational SS changes in frontal, parietal, andcerebellar regions. Based on these findings, further analyses were conducted to quantifyand qualify maturational changes in face and object processing throughout the brain.Results indicated developmental changes in activation extent, signal magnitude, andlateralization of face and object recognition networks. Collectively, this project supportsa developmental change in visual processing between 5-8 years and 9-11 years of age.Chapters Four through Six provide an in-depth discussion of the implications of thesefindings.
|
3 |
Complex-Wavelet Structural Similarity Based Image ClassificationGao, Yang January 2012 (has links)
Complex wavelet structural similarity (CW-SSIM) index has been recognized as a novel image similarity measure of broad potential applications due to its robustness to small geometric distortions such as translation, scaling and rotation of images. Nevertheless, how to make the best use of it in image classification problems has not been deeply investi- gated. In this study, we introduce a series of novel image classification algorithms based on CW-SSIM and use handwritten digit and face image recognition as examples for demonstration, including CW-SSIM based nearest neighbor method, CW-SSIM based k means method, CW-SSIM based support vector machine method (SVM) and CW-SSIM based SVM using affinity propagation. Among the proposed approaches, the best compromise between accuracy and complexity is obtained by the CW-SSIM support vector machine algorithm, which combines an unsupervised clustering method to divide the training images into clusters with representative images and a supervised learning method based on support vector machines to maximize the classification accuracy. Our experiments show that such a conceptually simple image classification method, which does not involve any registration, intensity normalization or sophisticated feature extraction processes, and does not rely on any modeling of the image patterns or distortion processes, achieves competitive performance with reduced computational cost.
|
4 |
2D SPECTRAL SUBTRACTION FOR NOISE SUPPRESSION IN FINGERPRINT IMAGESDandu, Sai Venkata Satya Siva Kumar, Kadimisetti, Sujit January 2017 (has links)
Human fingerprints are rich in details called the minutiae, which can be used as identification marks for fingerprint verification. To get the details, the fingerprint capturing techniques are to be improved. Since when we the fingerprint is captured, the noise from outside adds to it. The goal of this thesis is to remove the noise present in the fingerprint image. To achieve a good quality fingerprint image, this noise has to be removed or suppressed and here it is done by using an algorithm or technique called ’Spectral Subtraction’, where the algorithm is based on subtraction of estimated noise spectrum from noisy signal spectrum. The performance of the algorithm is assessed by comparing the original fingerprint image and image obtained after spectral subtraction several parameters like PSNR, SSIM and also for different fingerprints on the database. Finally, performance matching was done using NIST matching software, and the obtained results were presented in the form of Receiver Operating Characteristics (ROC)graphs, using MATLAB, and the experimental results were presented.
|
5 |
The mind as a predictive modelling engine : generative models, structural similarity, and mental representationWilliams, Daniel George January 2018 (has links)
I outline and defend a theory of mental representation based on three ideas that I extract from the work of the mid-twentieth century philosopher, psychologist, and cybernetician Kenneth Craik: first, an account of mental representation in terms of idealised models that capitalize on structural similarity to their targets; second, an appreciation of prediction as the core function of such models; and third, a regulatory understanding of brain function. I clarify and elaborate on each of these ideas, relate them to contemporary advances in neuroscience and machine learning, and favourably contrast a predictive model-based theory of mental representation with other prominent accounts of the nature, importance, and functions of mental representations in cognitive science and philosophy.
|
6 |
Digital Video Watermarking Robust to Geometric Attacks and CompressionsLiu, Yan 03 October 2011 (has links)
This thesis focuses on video watermarking robust against geometric attacks and
video compressions. In addition to the requirements for an image watermarking algorithm,
a digital video watermarking algorithm has to be robust against advanced
video compressions, frame loss, frame swapping, aspect ratio change, frame rate change,
intra- and inter-frame filtering, etc. Video compression, especially, the most efficient
compression standard, H.264, and geometric attacks, such as rotation and cropping,
frame aspect ratio change, and translation, are considered the most challenging attacks
for video watermarking algorithms.
In this thesis, we first review typical watermarking algorithms robust against geometric
attacks and video compressions, and point out their advantages and disadvantages.
Then, we propose our robust video watermarking algorithms against Rotation,
Scaling and Translation (RST) attacks and MPEG-2 compression based on the logpolar
mapping and the phase-only filtering method. Rotation or scaling transformation
in the spatial domain results in vertical or horizontal shift in the log-polar mapping
(LPM) of the magnitude of the Fourier spectrum of the target frame. Translation has
no effect in this domain. This method is very robust to RST attacks and MPEG-2
compression. We also demonstrate that this method can be used as a RST parameters
detector to work with other watermarking algorithms to improve their robustness to
RST attacks.
Furthermore, we propose a new video watermarking algorithm based on the 1D
DFT (one-dimensional Discrete Fourier Transform) and 1D projection. This algorithm
enhances the robustness to video compression and is able to resist the most advanced video compression, H.264. The 1D DFT for a video sequence along the temporal domain
generates an ideal domain, in which the spatial information is still kept and the
temporal information is obtained. With detailed analysis and calculation, we choose
the frames with highest temporal frequencies to embed the fence-shaped watermark
pattern in the Radon transform domain of the selected frames. The performance of the
proposed algorithm is evaluated by video compression standards MPEG-2 and H.264;
geometric attacks such as rotation, translation, and aspect-ratio changes; and other
video processing. The most important advantages of this video watermarking algorithm
are its simplicity, practicality and robustness.
|
7 |
Digital Video Watermarking Robust to Geometric Attacks and CompressionsLiu, Yan 03 October 2011 (has links)
This thesis focuses on video watermarking robust against geometric attacks and
video compressions. In addition to the requirements for an image watermarking algorithm,
a digital video watermarking algorithm has to be robust against advanced
video compressions, frame loss, frame swapping, aspect ratio change, frame rate change,
intra- and inter-frame filtering, etc. Video compression, especially, the most efficient
compression standard, H.264, and geometric attacks, such as rotation and cropping,
frame aspect ratio change, and translation, are considered the most challenging attacks
for video watermarking algorithms.
In this thesis, we first review typical watermarking algorithms robust against geometric
attacks and video compressions, and point out their advantages and disadvantages.
Then, we propose our robust video watermarking algorithms against Rotation,
Scaling and Translation (RST) attacks and MPEG-2 compression based on the logpolar
mapping and the phase-only filtering method. Rotation or scaling transformation
in the spatial domain results in vertical or horizontal shift in the log-polar mapping
(LPM) of the magnitude of the Fourier spectrum of the target frame. Translation has
no effect in this domain. This method is very robust to RST attacks and MPEG-2
compression. We also demonstrate that this method can be used as a RST parameters
detector to work with other watermarking algorithms to improve their robustness to
RST attacks.
Furthermore, we propose a new video watermarking algorithm based on the 1D
DFT (one-dimensional Discrete Fourier Transform) and 1D projection. This algorithm
enhances the robustness to video compression and is able to resist the most advanced video compression, H.264. The 1D DFT for a video sequence along the temporal domain
generates an ideal domain, in which the spatial information is still kept and the
temporal information is obtained. With detailed analysis and calculation, we choose
the frames with highest temporal frequencies to embed the fence-shaped watermark
pattern in the Radon transform domain of the selected frames. The performance of the
proposed algorithm is evaluated by video compression standards MPEG-2 and H.264;
geometric attacks such as rotation, translation, and aspect-ratio changes; and other
video processing. The most important advantages of this video watermarking algorithm
are its simplicity, practicality and robustness.
|
8 |
A Study of the Structural Similarity Image Quality Measure with Applications to Image ProcessingBrunet, Dominique 02 August 2012 (has links)
Since its introduction in 2004, the Structural Similarity (SSIM) index has gained widespread popularity as an image quality assessment measure. SSIM is currently recognized to be one of the most powerful methods of assessing the visual closeness of images. That being said, the Mean Squared Error (MSE), which performs very poorly from a perceptual point of view, still remains the most common optimization criterion in image processing applications because of its relative simplicity along with a number of other properties that are deemed important. In this thesis, some necessary tools to assist in the design of SSIM-optimal algorithms are developed. This work combines theoretical developments with experimental research and practical algorithms.
The description of the mathematical properties of the SSIM index represents the principal theoretical achievement in this thesis. Indeed, it is demonstrated how the SSIM index can be transformed into a distance metric. Local convexity, quasi-convexity, symmetries and invariance properties are also proved. The study of the SSIM index is also generalized to a family of metrics called normalized (or M-relative) metrics.
Various analytical techniques for different kinds of SSIM-based optimization are then devised. For example, the best approximation according to the SSIM is described for orthogonal and redundant basis sets. SSIM-geodesic paths with arclength parameterization are also traced between images. Finally, formulas for SSIM-optimal point estimators are obtained.
On the experimental side of the research, the structural self-similarity of images is studied. This leads to the confirmation of the hypothesis that the main source of self-similarity of images lies in their regions of low variance.
On the practical side, an implementation of local statistical tests on the image residual is proposed for the assessment of denoised images. Also, heuristic estimations of the SSIM index and the MSE are developed.
The research performed in this thesis should lead to the development of state-of-the-art image denoising algorithms. A better comprehension of the mathematical properties of the SSIM index represents another step toward the replacement of the MSE with SSIM in image processing applications.
|
9 |
Selecting Web Services by Problem SimilarityYan, Shih-hua 11 February 2009 (has links)
The recent development of the service-oriented architecture (SOA) has provided an opportunity to apply this new technology to support model management. This is particularly critical when more and more decision models are delivered as web services. A web-services-based approach to model management is useful in providing effective decision support.
When a decision model is implemented as a web service, it is called a model-based web service. In model management, selecting a proper model-based web service is an important issue. Most current research on selecting such web service relies on matching inputs and outputs of the model, which is oversimplified. The incorporation of more semantic knowledge may be necessary to make the selection of model-based web services more effective.
In this research, we propose a new mechanism that represents the semantics associated with a problem and then use the similarity of semantic information between a new problem description and existing web services to find the most suitable web services for solving the new problem. The paper defines the concept of entity similarity, attribute similarity, and functional similarity for problem matching. The web service that has the highest similarity is chosen as a base for constructing the new web services. The identified mapping is converted into BPEL4WS codes for utilizing the web services. To verify the feasibility of the proposed method, a prototype system has been implemented in JAVA.
|
10 |
Spatial pooling strategies for image quality assessmentMoorthy, Anush Krishna 03 September 2009 (has links)
Recent image quality assessment (IQA) metrics achieve high correlation with human perception of image quality. Naturally, it is of interest to produce even better results. One promising method is to weight image quality measurements by visual importance. To this end, we describe three strategies - visual fixation-based weighting, quality-based weighting and weighting based on distribution of local quality scores about the mean. By contrast with some prior studies we find that these strategies can improve the correlations with subjective judgment significantly. We demonstrate improvements on the SSIM index in both its multi-scale and single-scale versions, using the LIVE database as a test-bed. / text
|
Page generated in 0.0723 seconds