Global ETD Search

11	Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems Ahmad, J., Sajjad, M., Mehmood, Irfan, Rho, S., Baik, S.W. 18 July 2019 (has links) Yes / The exponential growth in the volume of digital image databases is making it increasingly difficult to retrieve relevant information from them. Efficient retrieval systems require distinctive features extracted from visually rich contents, represented semantically in a human perception-oriented manner. This paper presents an efficient framework to model image contents as an undirected attributed relational graph, exploiting color, texture, layout, and saliency information. The proposed method encodes salient features into this rich representative model without requiring any segmentation or clustering procedures, reducing the computational complexity. In addition, an efficient graph-matching procedure implemented on specialized hardware makes it more suitable for real-time retrieval applications. The proposed framework has been tested on three publicly available datasets, and the results prove its superiority in terms of both effectiveness and efficiency in comparison with other state-of-the-art schemes. / Supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2013R1A1A2012904). Attributed relational graph Image representation Content-based image retrieval Saliency map Real-time retrieval
12	Towards a Quantitative Evaluation of Layout Using Graphic Design Principles Mosora, Daniel J. 15 May 2012 (has links) No description available. Cognitive Psychology Computer Science Design Psychology Quantitative Psychology graphic design psychophysics automatic layout visualization cognitive psychology saliency map image analysis
13	Modulation noradrénergique et ajustement des processus attentionnels chez le singe / Noradrenergic modulation and adjustement of attentional processes in monkeys Reynaud, Amélie 31 October 2019 (has links) L'attention est une fonction au cœur de la cognition qui, à tout moment, nous permet de sélectionner les informations pertinentes à traiter, tout en ignorant les autres. Cette sélection de l’information qui s’opère à la fois dans l'espace et dans le temps résulte de l’intégration des informations sensorielles et d’un contrôle de "haut niveau" en fonction de nos buts. Cette fonction dépend d’un réseau cérébral incluant le système fronto-pariétal et est sous l’influence de différents neuromodulateurs, en particulier la noradrénaline, dont l’action reste encore mal connue. Mon travail de thèse consistait à comprendre le rôle de la noradrénaline sur les processus attentionnels. Mes objectifs étaient d’une part de vérifier notre hypothèse selon laquelle la noradrénaline modulerait les différentes facettes de l’attention (attention spatiale et attention soutenue) et d’autre part d’élucider les mécanismes d’action par lesquelles la noradrénaline exercerait ces effets. Pour répondre à ces questions, nous avons testé l’impact d’une augmentation de la transmission noradrénergique (administration intramusculaire d'atomoxétine) chez le singe, dans des tâches comportementales nécessitant une sélection de l’information visuelle soit dans l’espace (tâche d'attention avec indice et exploration spontanée d'images) soit au cours du temps (tâche de discrimination go/nogo). Nos résultats démontrent que l’atomoxétine facilite les processus attentionnels à la fois dans l’espace et au cours du temps. Dans l’espace, l’atomoxétine module l’orientation de l’attention visuo-spatiale en fonction du contexte, en ajustant le taux d’accumulation sensorielle ou l’impact de la saillance des images sur l’orientation de l’attention. Au cours du temps, l’atomoxétine ajuste la relation entre la sensibilité à discriminer la cible parmi des distracteurs et le biais de réponse des animaux. En résumé, mes résultats démontrent que la noradrénaline influence les deux facettes, spatiale et temporelle de l’attention et suggèrent une action via un ajustement des processus de traitement de l’information sensorielle et un ajustement du contrôle de l’attention au contexte / Attention is a function at the heart of cognition that, at any given moment, enables us to select some information for further processing, while setting aside others. This selection of information that operates both in space and time, results from the integration of sensory information and higher-level control according to our goals. This function depends on a cerebral network including the fronto-parietal system. It is also under the influence of different neuromodulators, in particular norepinephrine, the action of which is still poorly understood.The aim of my PhD work was to understand the role of norepinephrine on attentional processes. My objectives were, on the one hand, to test our hypothesis that norepinephrine is capable of acting on the different facets of attention (spatial attention and sustained attention) and, on the other hand, to elucidate the mechanisms of action by which noradrenaline exerts its action. To answer these questions, we tested the impact of an increase in noradrenergic transmission (intramuscular administration of atomoxetine) in monkeys, using behavioral tasks requiring a selection of visual information in space (cued attentional task and spontaneous image exploration) or over time (go/nogo discrimination task). Our results demonstrate that atomoxetine facilitates attentional processes both in space and over time. In space, atomoxetine modulates the orientation of visuospatial attention according to the context, adjusting the rate of sensory accumulation or the impact of image saliency on attention orientation. Over time, atomoxetine adjusts the relationship between the sensitivity to discriminate a target among distractors and the animal’s response bias.In summary, my results demonstrate that norepinephrine influences both the spatial and temporal facets of attention and suggests an action through an adjustment of sensory information processing and an adjustment of attention control to the context Noradrénaline Attention Singe Atomoxétine Modèle LATER Carte de saillance Théorie de détection du signal Norepinephrine Attention Monkey Atomoxetine LATER model Saliency map Signal detection theory 610
14	Visual saliency extraction from compressed streams / Extraction de la saillance visuelle à partir de flux compressés Ammar, Marwa 15 June 2017 (has links) Les fondements théoriques pour la saillance visuelle ont été dressés, il y a 35 ans, par Treisman qui a proposé "feature-integration theory" pour le système visuel humain: dans n’importe quel contenu visuel, certaines régions sont saillantes en raison de la différence entre leurs caractéristiques (intensité, couleur, texture, et mouvement) et leur voisinage. Notre thèse offre un cadre méthodologique et expérimental compréhensif pour extraire les régions saillantes directement des flux compressés (MPEG-4 AVC et HEVC), tout en minimisant les opérations de décodage. L’extraction de la saillance visuelle à partir du flux compressé est à priori une contradiction conceptuelle. D’une part, comme suggéré par Treisman, dans un contenu vidéo, la saillance est donnée par des singularités visuelles. D’autre part, afin d’éliminer la redondance visuelle, les flux compressés ne devraient plus préserver des singularités. La thèse souligne également l’avantage pratique de l’extraction de la saillance dans le domaine compressé. Dans ce cas, nous avons démontré que, intégrée dans une application de tatouage robuste de la vidéo compressée, la carte saillance agit comme un outil d’optimisation, ce qui permet d’augmenter la transparence (pour une quantité d’informations insérées et une robustesse contre les attaques prescrites) tout en diminuant la complexité globale du calcul. On peut conclure que la thèse démontre aussi bien méthodologiquement que expérimentalement que même si les normes MPEG-4 AVC et HEVC ne dépendent pas explicitement d’aucun principe de saillance visuelle, leurs flux préservent cette propriété remarquable reliant la représentation numérique de la vidéo au mécanisme psycho-cognitifs humains / The theoretical ground for visual saliency was established some 35 years ago by Treisman who advanced the integration theory for the human visual system: in any visual content, some regions are salient (appealing) because of the discrepancy between their features (intensity, color, texture, motion) and the features of their surrounding areas. This present thesis offers a comprehensive methodological and experimental framework for extracting the salient regions directly from video compressed streams (namely MPEG-4 AVC and HEVC), with minimal decoding operations. Note that saliency extraction from compressed domain is a priori a conceptual contradiction. On the one hand, as suggested by Treisman, saliency is given by visual singularities in the video content. On the other hand, in order to eliminate the visual redundancy, the compressed streams are no longer expected to feature singularities. The thesis also brings to light the practical benefit of the compressed domain saliency extraction. In this respect, the case of robust video watermarking is targeted and it is demonstrated that the saliency acts as an optimization tool, allowing the transparency to be increased (for prescribed quantity of inserted information and robustness against attacks) while decreasing the overall computational complexity. As an overall conclusion, the thesis methodologically and experimentally demonstrates that although the MPEG-4 AVC and the HEVC standards do not explicitly rely on any visual saliency principle, their stream syntax elements preserve this remarkable property linking the digital representation of the video to sophisticated psycho-cognitive mechanisms Système visuel humain Extraction de la saillance visuelle Domaine compressé MPEG-4 AVC HEVC Carte de saillance Carte de fixation Emplacements saccades Tatouage numérique Human visual system Visual saliency extraction Compressed stream MPEG-4 AVC HEVC Saliency map Fixation map Saccade location Watermarking
15	Self-calibrating eye tracker using imagesaliency : Självkalibrerande ögonspårare medhjälp av image saliency / Självkalibrerande ögonspårare medhjälp av image saliency : Self-calibrating eye tracker using imagesaliency Vega, Gabriel January 2022 (has links) Self-calibrating eye tracker using image saliency. / Självkalibrerande ögonspårare med hjälp av image saliency. Eye tracker image saliency saliency map fovea centralis fovea offset static saliency detection simpleblobdetector parameters fovea hypothesis. Ögonspårning image saliency kartor fovea centralis fovea offset static saliency detection simpleblobdetector parameterar fovea hypotes. Computer Engineering Datorteknik
16	Objective assessment of stereoscopic video quality of 3DTV / Évaluation objective de la qualité vidéo en TV 3D relief Khaustova, Darya 30 January 2015 (has links) Le niveau d'exigence minimum pour tout système 3D (images stéréoscopiques) est de garantir le confort visuel des utilisateurs. Le confort visuel est un des trois axes perceptuels de la qualité d'expérience (QoE) 3D qui peut être directement lié aux paramètres techniques du système 3D. Par conséquent, le but de cette thèse est de caractériser objectivement l'impact de ces paramètres sur la perception humaine afin de contrôler la qualité stéréoscopique. La première partie de la thèse examine l'intérêt de prendre en compte l'attention visuelle des spectateurs dans la conception d'une mesure objective de qualité 3D. Premièrement, l'attention visuelle en 2D et 3D sont comparées en utilisant des stimuli simples. Les conclusions de cette première expérience sont validées en utilisant des scènes complexes avec des disparités croisées et décroisées. De plus, nous explorons l'impact de l'inconfort visuel causé par des disparités excessives sur l'attention visuelle. La seconde partie de la thèse est dédiée à la conception d'un modèle objectif de QoE pour des vidéos 3D, basé sur les seuils perceptuels humains et le niveau d'acceptabilité. De plus nous explorons la possibilité d'utiliser la modèle proposé comme une nouvelle échelle subjective. Pour la validation de ce modèle, des expériences subjectives sont conduites présentant aux sujets des images stéréoscopiques fixes et animées avec différents niveaux d'asymétrie. La performance est évaluée en comparant des prédictions objectives avec des notes subjectives pour différents niveaux d'asymétrie qui pourraient provoquer un inconfort visuel. / The minimum requirement for any 3D (stereoscopic images) system is to guarantee visual comfort of viewers. Visual comfort is one of the three primary perceptual attributes of 3D QoE, which can be linked directly with technical parameters of a 3D system. Therefore, the goal of this thesis is to characterize objectively the impact of these parameters on human perception for stereoscopic quality monitoring. The first part of the thesis investigates whether visual attention of the viewers should be considered when designing an objective 3D quality metrics. First, the visual attention in 2D and 3D is compared using simple test patterns. The conclusions of this first experiment are validated using complex stimuli with crossed and uncrossed disparities. In addition, we explore the impact of visual discomfort caused by excessive disparities on visual attention. The second part of the thesis is dedicated to the design of an objective model of 3D video QoE, which is based on human perceptual thresholds and acceptability level. Additionally we explore the possibility to use the proposed model as a new subjective scale. For the validation of proposed model, subjective experiments with fully controlled still and moving stereoscopic images with different types of view asymmetries are conducted. The performance is evaluated by comparing objective predictions with subjective scores for various levels of view discrepancies which might provoke visual discomfort. Vidéo 3D Métrique objective Échelle de couleur Évaluation subjective Acceptabilité Gêne visuelle Évaluation de la qualité vidéo Asymétrie entre les vues Confort visuel Attention visuelle Profondeur Texture Carte de saillance Mouvement saccade Temps de fixationdisparité 3D video Objective metric Color scale Subjective assessment Acceptability Visual annoyance Views asymmetry Visual comfort Video quality assessment Human factors Visual attention Depth Texture Saliency map Depth map Saccade length Fixation duration Disparity
17	Self-Organizing Neural Visual Models to Learn Feature Detectors and Motion Tracking Behaviour by Exposure to Real-World Data Yogeswaran, Arjun January 2018 (has links) Advances in unsupervised learning and deep neural networks have led to increased performance in a number of domains, and to the ability to draw strong comparisons between the biological method of self-organization conducted by the brain and computational mechanisms. This thesis aims to use real-world data to tackle two areas in the domain of computer vision which have biological equivalents: feature detection and motion tracking. The aforementioned advances have allowed efficient learning of feature representations directly from large sets of unlabeled data instead of using traditional handcrafted features. The first part of this thesis evaluates such representations by comparing regularization and preprocessing methods which incorporate local neighbouring information during training on a single-layer neural network. The networks are trained and tested on the Hollywood2 video dataset, as well as the static CIFAR-10, STL-10, COIL-100, and MNIST image datasets. The induction of topography or simple image blurring via Gaussian filters during training produces better discriminative features as evidenced by the consistent and notable increase in classification results that they produce. In the visual domain, invariant features are desirable such that objects can be classified despite transformations. It is found that most of the compared methods produce more invariant features, however, classification accuracy does not correlate to invariance. The second, and paramount, contribution of this thesis is a biologically-inspired model to explain the emergence of motion tracking behaviour in early development using unsupervised learning. The model’s self-organization is biased by an original concept called retinal constancy, which measures how similar visual contents are between successive frames. In the proposed two-layer deep network, when exposed to real-world video, the first layer learns to encode visual motion, and the second layer learns to relate that motion to gaze movements, which it perceives and creates through bi-directional nodes. This is unique because it uses general machine learning algorithms, and their inherent generative properties, to learn from real-world data. It also implements a biological theory and learns in a fully unsupervised manner. An analysis of its parameters and limitations is conducted, and its tracking performance is evaluated. Results show that this model is able to successfully follow targets in real-world video, despite being trained without supervision on real-world video. restricted Boltzmann machine self-organization deep learning deep belief network unsupervised learning smooth pursuit saccade motion tracking feature learning invariance Gaussian filter neural network Hebbian learning real-world data biologically-inspired image classification feature extraction visual attention retinal slip saliency map

Page generated in 0.0543 seconds