Global ETD Search

91	Détection de primitives par une approche discrète et non linéaire : application à la détection et la caractérisation de points d'intérêt dans les maillages 3D / Primitives detection by a discrete and non linear approach : application to the detection and caracterization of interest points for 3D meshes Walter, Nicolas 26 August 2010 (has links) Ce manuscrit est dédié à la détection et la caractérisation de points d'intérêt dans les maillages. Nous montrons tout d'abord les limitations de la mesure de courbure sur des contours francs, mesure habituellement utilisée dans le domaine de l'analyse de maillages. Nous présentons ensuite une généralisation de l'opérateur SUSAN pour les maillages, nommé SUSAN-3D. La mesure de saillance proposée quantifie les variations locales de la surface et classe directement les points analysés en cinq catégories : saillant, crête, plat, vallée et creux. Les maillages considérés sont à variété uniforme avec ou sans bords et peuvent être réguliers ou irréguliers, denses ou non et bruités ou non. Nous étudions ensuite les performances de SUSAN-3D en les comparant à celles de deux opérateurs de courbure : l'opérateur de Meyer et l'opérateur de Stokely. Deux méthodes de comparaison des mesures de saillance et courbure sont proposées et utilisées sur deux types d’objets : des sphères et des cubes. Les sphères permettent l'étude de la précision sur des surfaces différentiables et les cubes sur deux types de contours non-différentiables : les arêtes et les coins. Nous montrons au travers de ces études les avantages de notre méthode qui sont une forte répétabilité de la mesure, une faible sensibilité au bruit et la capacité d'analyser les surfaces peu denses. Enfin, nous présentons une extension multi-échelle et une automatisation de la détermination des échelles d'analyse qui font de SUSAN-3D un opérateur générique et autonome d’analyse et de caractérisation pour les maillages / This manuscript is dedicated to the detection and caracterization of interest points for 3D meshes. First of all, we show the limitations of the curvature measure on sharp edges, the measure usually used for the analysis of meshes. Then, we present a generalization of the SUSAN operator for meshes, named SUSAN-3D. The saliency measure proposed quantify the local variation of the surface and classify directly the analysed vertices in five classes: salient, crest, flat, valley and cavity. The meshes under consideration are manifolds and can be closed or non-closed, regulars or irregulars, dense or not and noised or not. The accuracy of the SUSAN-3D operator is compared to two curvature operators: the Meyer's operator and the Stokely's operator. Two comparison methods of saliency and curvature measures are described and used on two types of objects: spheres and cubes. The spheres allow the study of the accuracy for differentiable surfaces and the cubes for two types of sharp edges: crests and corners. Through these studies, we show the benefits of our method that are a strong repeatability of the measure, high robustness to noise and capacity to analyse non dense meshes. Finally, we present a multi-scale scheme and automation of the determination of the analysis scales that allow SUSAN-3D to be a general and autonomous operator for the analysis and caracterization of meshes Saillance Détection Caractérisation Points d'intérêt Points saillants Maillages irréguliers Analyse multi-échelle Saliency Detection Caracterization Interest points Salient points Irregular meshes Multi-scale analysis 006.6 516
92	La programmation des saccades oculaires chez l'homme : rôle et décours temporel des traitements visuels élémentaires / Saccade programming in humans : Influence and time course of elementary visual processes Massendari, Delphine 23 April 2015 (has links) Notre environnement visuel est riche en lumière, couleurs, traits, textures et formes. Pour appréhender cette richesse, nous déplaçons nos yeux tous les quarts de seconde à l'aide de mouvements très rapides appelés saccades. Une telle vision dite active a fait l’objet de multiples recherches, mais les interactions entre les systèmes visuel et oculomoteur ne sont pas clairement établies. Cette thèse vise à préciser ces interactions en étudiant si les délais temporels associés au traitement d'informations visuelles de plus en plus élaborées contraignent où et quand nos yeux bougent. Trois séries d'études comportementales menées chez l'homme et utilisant des paradigmes novateurs ont été réalisées. Elles nous ont permis de mettre en évidence que le traitement des contrastes d'orientation, tout comme le traitement des contrastes de luminance sont intégrés par le système saccadique. En effet, un stimulus (distracteur) différant d'un fond texturé par sa luminance ou son orientation dévie le regard de sa cible dans la même mesure, et ce, quelle que soit la latence des saccades. Néanmoins, le contraste de luminance conserve un rôle prédominant. Premièrement, il conduit au déclenchement plus précoce des saccades en comparaison avec le contraste d’orientation. Deuxièmement, dès lors qu'il entre en compétition avec des informations plus élaborées comme le contour, il suffit à déterminer la métrique des saccades. Ainsi, en accord avec l'architecture des systèmes visuel et oculomoteur, les traitements visuels influencent la programmation des saccades de manière ordonnée. / Our environment is rich in light, color, features, textures, and shapes. To extract this information, we move our eyes four times per second with rapid eye movements called saccades. This so-called active vision has been studied extensively, but the interactions between the visual and oculomotor systems have not been fully characterized yet. This thesis aims to clarify these interactions by investigating whether the delays in processing visual information of increasing complexity determine where and when our eyes move. The present work focuses on three types of basic visual processing for which the neural substrates are well established and predict a specific order in the programming of saccades at the level of the superior colliculus. We conducted three series of behavioral studies with human participants using novel experimental paradigms. These studies showed that orientation-contrast processing as well as early luminance-contrast processing are integrated in the saccadic system to the same extent. When aiming for a target stimulus, the eyes deviate toward a distractor stimulus in equal measure, irrespective of whether the distractor differed in luminance or orientation from a texture background and irrespective of saccade latency. However, the role of luminance contrast remains dominant. Firstly, luminance contrast triggers faster saccades than orientation contrast. Secondly, when luminance contrast competes with more complex information such as contour, solely luminance contrast determines saccade metrics. Therefore, visual processes influence saccade programming in a specific order that is consistent with the architecture of the visual and oculomotor systems. Saccades Contrastes de luminance Contrastes d'orientation Contour Effet global Décours temporel Saillance Choix forcé Saccades Luminance contrast Orientation contrast Contour Global effet Time course Saliency Forced choice
93	CONTENT UNDERSTANDING FOR IMAGING SYSTEMS: PAGE CLASSIFICATION, FADING DETECTION, EMOTION RECOGNITION, AND SALIENCY BASED IMAGE QUALITY ASSESSMENT AND CROPPING Shaoyuan Xu (9116033) 12 October 2021 (has links) <div>This thesis consists of four sections which are related with four research projects.</div><div><br></div><div>The first section is about Page Classification. In this section, we extend our previous approach which could classify 3 classes of pages: Text, Picture and Mixed, to 5 classes which are: Text, Picture, Mixed, Receipt and Highlight. We first design new features to define those two new classes and then use DAG-SVM to classify those 5 classes of images. Based on the results, our algorithm performs well and is able to classify 5 types of pages.</div><div><br></div><div>The second section is about Fading Detection. In this section, we develop an algorithm that can automatically detect fading for both text and non-text region. For text region, we first do global alignment and then perform local alignment. After that, we create a 3D color node system, assign each connected component to a color node and get the color difference between raster page connected component and scanned page connected. For non-text region, after global alignment, we divide the page into "super pixels" and get the color difference between raster super pixels and testing super pixels. Compared with the traditional method that uses a diagnostic page, our method is more efficient and effective.</div><div><br></div><div>The third section is about CNN Based Emotion Recognition. In this section, we build our own emotion recognition classification and regression system from scratch. It includes data set collection, data preprocessing, model training and testing. We extend the model to real-time video application and it performs accurately and smoothly. We also try another approach of solving the emotion recognition problem using Facial Action Unit detection. By extracting Facial Land Mark features and adopting SVM training framework, the Facial Action Unit approach achieves comparable accuracy to the CNN based approach.</div><div><br></div><div>The forth section is about Saliency Based Image Quality Assessment and Cropping. In this section, we propose a method of doing image quality assessment and recomposition with the help of image saliency information. Saliency is the remarkable region of an image that attracts people's attention easily and naturally. By showing everyday examples as well as our experimental results, we demonstrate the fact that, utilizing the saliency information will be beneficial for both tasks.</div> Fading Detection Image Classification SVM Classifiers Emotion Recognition CNN facial action unit (AU) detection saliency Image Quality Assessment Image Cropping
94	Modulation noradrénergique et ajustement des processus attentionnels chez le singe / Noradrenergic modulation and adjustement of attentional processes in monkeys Reynaud, Amélie 31 October 2019 (has links) L'attention est une fonction au cœur de la cognition qui, à tout moment, nous permet de sélectionner les informations pertinentes à traiter, tout en ignorant les autres. Cette sélection de l’information qui s’opère à la fois dans l'espace et dans le temps résulte de l’intégration des informations sensorielles et d’un contrôle de "haut niveau" en fonction de nos buts. Cette fonction dépend d’un réseau cérébral incluant le système fronto-pariétal et est sous l’influence de différents neuromodulateurs, en particulier la noradrénaline, dont l’action reste encore mal connue. Mon travail de thèse consistait à comprendre le rôle de la noradrénaline sur les processus attentionnels. Mes objectifs étaient d’une part de vérifier notre hypothèse selon laquelle la noradrénaline modulerait les différentes facettes de l’attention (attention spatiale et attention soutenue) et d’autre part d’élucider les mécanismes d’action par lesquelles la noradrénaline exercerait ces effets. Pour répondre à ces questions, nous avons testé l’impact d’une augmentation de la transmission noradrénergique (administration intramusculaire d'atomoxétine) chez le singe, dans des tâches comportementales nécessitant une sélection de l’information visuelle soit dans l’espace (tâche d'attention avec indice et exploration spontanée d'images) soit au cours du temps (tâche de discrimination go/nogo). Nos résultats démontrent que l’atomoxétine facilite les processus attentionnels à la fois dans l’espace et au cours du temps. Dans l’espace, l’atomoxétine module l’orientation de l’attention visuo-spatiale en fonction du contexte, en ajustant le taux d’accumulation sensorielle ou l’impact de la saillance des images sur l’orientation de l’attention. Au cours du temps, l’atomoxétine ajuste la relation entre la sensibilité à discriminer la cible parmi des distracteurs et le biais de réponse des animaux. En résumé, mes résultats démontrent que la noradrénaline influence les deux facettes, spatiale et temporelle de l’attention et suggèrent une action via un ajustement des processus de traitement de l’information sensorielle et un ajustement du contrôle de l’attention au contexte / Attention is a function at the heart of cognition that, at any given moment, enables us to select some information for further processing, while setting aside others. This selection of information that operates both in space and time, results from the integration of sensory information and higher-level control according to our goals. This function depends on a cerebral network including the fronto-parietal system. It is also under the influence of different neuromodulators, in particular norepinephrine, the action of which is still poorly understood.The aim of my PhD work was to understand the role of norepinephrine on attentional processes. My objectives were, on the one hand, to test our hypothesis that norepinephrine is capable of acting on the different facets of attention (spatial attention and sustained attention) and, on the other hand, to elucidate the mechanisms of action by which noradrenaline exerts its action. To answer these questions, we tested the impact of an increase in noradrenergic transmission (intramuscular administration of atomoxetine) in monkeys, using behavioral tasks requiring a selection of visual information in space (cued attentional task and spontaneous image exploration) or over time (go/nogo discrimination task). Our results demonstrate that atomoxetine facilitates attentional processes both in space and over time. In space, atomoxetine modulates the orientation of visuospatial attention according to the context, adjusting the rate of sensory accumulation or the impact of image saliency on attention orientation. Over time, atomoxetine adjusts the relationship between the sensitivity to discriminate a target among distractors and the animal’s response bias.In summary, my results demonstrate that norepinephrine influences both the spatial and temporal facets of attention and suggests an action through an adjustment of sensory information processing and an adjustment of attention control to the context Noradrénaline Attention Singe Atomoxétine Modèle LATER Carte de saillance Théorie de détection du signal Norepinephrine Attention Monkey Atomoxetine LATER model Saliency map Signal detection theory 610
95	The Antecedents of Work-School Conflict and Work-School Enrichment Robertson, Katelyn 26 February 2021 (has links) The cost of higher education is rapidly increasing on both a global scale (Creed, French & Hood, 2015), and in the local South African context (Calitz & Fourie, 2016). This rise in costs has seen a commensurate increase in the number of university students who work, largely as a means to fund the increasing cost of their higher education (Butler, 2007; Cinamon, 2016; Owen, Kavanagh & Dollard, 2018). These working students are frequently referred to as non-traditional students in the academic literature. The psychological experiences of non-traditional students who work is a pertinent and expanding area of interest for multiple stakeholders (Owen et al., 2018). These experiences can be classified through the constructs of Work-School Conflict (WSC) and Work-School Enrichment (WSE), which refer, respectively, to the negative and positive aspects of the work-school interface (Butler, 2007). The antecedents of WSC and WSE experiences amongst nontraditional working students have to date not received any empirical attention in the South African research literature. This study aims to address this gap by contributing to the national body of knowledge in this area. The measures used were secondary self-report survey data completed by post-graduate university students who are simultaneously engaged in paid work (N=330). Multiple regression analyses indicated that time demands, job demands and social support from work explained a significant proportion of WSC; whilst job-school congruence and social support within the work context were statistically significant predictors of WSE. Moderation analyses revealed that social support at work influenced the relationship between job demands and WSC, whilst employee role saliency significantly interacted with job-school congruence to influence WSE. The results of this study are aligned to international work-school research findings, which support the additive model of job characteristics as antecedents to WSC and WSE. These results also provide deeper insight into the less explored moderation effects of work resources and demands interacting to influence WSC and WSE. Theoretical, management and educational implications of these findings are considered in relation to the existing literature. Organisational Psychology Work-school conflict work-school enrichment antecedents time demands job demands role saliency social support from work job-school congruence job control
96	Intrinsic motivation mecanisms for incremental learning of visual saliency / Apprentissage incrémental de la saillance visuelle par des mécanismes de motivation intrinsèque Craye, Céline 03 April 2017 (has links) La conception de systèmes de perception autonomes, tels que des robots capables d’accomplir un ensemble de tâches de manière sûre et sans assistance humaine, est l’un des grands déﬁs de notre siècle. Pour ce faire, la robotique développementale propose de concevoir des robots qui, comme des enfants, auraient la faculté d’apprendre directement par interaction avec leur environnement. Nous avons dans cette thèse exploré de telles possibilités en se limitant à l’apprentissage de la localisation des objets d’intérêt (ou objets saillants) dans l’environnement du robot.Pour ce faire, nous présentons dans ces travaux un mécanisme capable d’apprendre la saillance visuelle directement sur un robot, puis d’utiliser le modèle appris de la sorte pour localiser des objets saillants dans son environnement. Cette méthode a l’avantage de permettre la création de modèles spécialisés pour l’environnement du robot et les tâches qu’il doit accomplir, tout en restant ﬂexible à d’éventuelles nouveautés ou modiﬁcations de l’environnement.De plus, aﬁn de permettre un apprentissage efﬁcace et de qualité, nous avons développé des stratégies d’explorations basées sur les motivations intrinsèques, très utilisées en robotique développementale. Nous avons notamment adapté l’algorithme IAC à l’apprentissage de la saillance visuelle, et en avons conçu une extension, RL-IAC, pour permettre une exploration efﬁcace sur un robot mobile. Aﬁn de vériﬁer et d’analyser les performances de nos algorithmes, nous avons réalisé des évaluations sur plusieurs plateformes robotiques dont une plateforme fovéale et un robot mobile, ainsi que sur des bases de données publiques. / Conceiving autonomous perceptual systems, such as robots able to accomplish a set of tasks in a safe way, without any human assistance, is one of the biggest challenge of the century. To this end, the developmental robotics suggests to conceive robots able to learn by interacting directly with their environment, just like children would. This thesis is exploring such possibility while restricting the problem to the one of localizing objects of interest (or salient objects) within the robot’s environment.For that, we present in this work a mechanism able to learn visual saliency directly on a robot, then to use the learned model so as to localize salient objects within their environment. The advantage of this method is the creation of models dedicated to the robot’s environment and tasks it should be asked to accomplish, while remaining flexible to any change or novelty in the environment.Furthermore, we have developed exploration strategies based on intrinsic motivations, widely used in developmental robotics, to enable efficient learning of good quality. In particular, we adapted the IAC algorithm to visual saliency leanring, and proposed an extension, RL-IAC to allow an efficient exploration on mobile robots.In order to verify and analyze the performance of our algorithms, we have carried out various experiments on several robotics platforms, including a foveated system and a mobile robot, as well as publicly available datasets. Saillance visuelle Robotique mobile Robotique cognitive Bio-inspiration Motivation intrinsèque Localisation d'objets Visual saliency Mobile robotics Cognitive robotics Bio-inspiration Intrinsic motivation Object localization 006
97	Mesure sans référence de la qualité des vidéos haute déﬁnition diffusées avec des pertes de transmission / No-Reference Video Quality Assessment of High Deﬁnition Video Streams Delivered with Losses Boujut, Hugo 24 September 2012 (has links) Les objectifs de ce travail de thèse ont été: d’une part de détecter automatique-ment les images gelées dans des vidéos télédiffusées; et d’autre part de mesurer sans référencela qualité des vidéos télédiffusées (IP et DVB-T). Ces travaux ont été effectués dans le cadred’un projet de recherche mené conjointement par le LaBRI et la société Audemat WorldCastSystems.Pour la détection d’images gelées, trois méthodes ont été proposées: MV (basée vecteurde mouvement), DC (basée sur les coefﬁcients DC de la DCT) et SURF (basée sur les pointscaractéristiques SURF). Les deux premières méthodes ne nécessitent qu’un décodage partieldu ﬂux vidéo.Le second objectif était de mesurer sans référence la qualité des vidéos télédiffusées (IP etDVB-T). Une métrique a été développée pour mesurer la qualité perçue lorsque le ﬂux vidéoa été altéré par des pertes de transmission. Cette métrique "Weighted Macro-Block ErrorRate" (WMBER) est fondée sur la mesure de la saillance visuelle et la détection des macro-blocs endommagés. Le rôle de la saillance visuelle est de pondérer l’importance des erreursdétectées. Certaines améliorations ont été apportées à la construction des cartes de saillancespatio-temporelle. En particulier, la fusion des cartes de saillance spatiale et temporelle aété améliorée par rapport à l’état de l’art. Par ailleurs, plusieurs études ont montré que lasémantique d’une scène visuelle avait une inﬂuence sur le comportement du système visuelhumain. Il apparaît que ce sont surtout les visages humains qui attirent le regard. C’est laraison pour laquelle nous avons ajouté une dimension sémantique aux cartes de saillancespatio-temporelle. Cette dimension sémantique est essentiellement basée sur le détecteurde visage de Viola Jones. Pour prédire la qualité perçue par les utilisateurs, nous avonsutilisé une méthode par apprentissage supervisé. Cette méthode offre ainsi la possibilité deprédire la métrique subjective "Mean Opinion Score" (MOS) à partir de mesures objectivestelles que le WMBER, PSNR ou SSIM. Une expérience psycho-visuelle a été menée avec 50sujets pour évaluer ces travaux. Cette base de données vidéo Haute-Déﬁnition est en coursde transfert à l’action COST Qualinet. Ces travaux ont également été évalués sur une autrebase de données vidéo (en déﬁnition standard) provenant de l’IRCCyN / The goal of this Ph.D thesis is to design a no-reference video quality assessment method for lossy net-works. This Ph.D thesis is conducted in collaboration with the Audemat Worldcast Systemscompany.Our ﬁrst no-reference video quality assessment indicator is the frozen frame detection.Frozen frame detection was a research topic which was well studied in the past decades.However, the challenge is to embed a frozen frame detection method in the GoldenEagleAudemat equipment. This equipment has low computation resources that not allow real-time HD video decoding. Two methods are proposed: one based on the compressed videostream motion vectors (MV-method) and another one based on the DC coefﬁcients from thedct transform (DC-method). Both methods only require the partial decoding of the com-pressed video stream which allows for real-time analysis on the GoldenEagle equipment.The evaluation shows that results are better than the frame difference base-line method.Nevertheless, the MV and the DC methods are only suitable with for MPEG2 and H.264video streams. So a third method based on SURF points is proposed.As a second step on the way to a no-reference video quality assessment metric, we areinterested in the visual perception of transmission impairments. We propose a full-referencemetric based on saliency maps. This metric, Weighted Mean Squared Error (WMSE), is theMSE metric weighted by the saliency map. The saliency map role is to distinguish betweennoticeable and unnoticeable transmission impairments. Therefore this spatio-temporal saliencymaps is computed on the impaired frame. Thus the pixel difference in the MSE computationis emphasized or diminished with regard to the pixel saliency. According to the state of theart, several improvements are brought to the saliency map computation process. Especially,new spatio-temporal saliency map fusion strategies are designed.After our successful attempt to assess the video quality with saliency maps, we develop ano-reference quality metric. This metric, Weighted Macro-Block Error Rate (WMBER), relies on the saliency map and the macro-block error detection. The macro-block error detectionprovides the impaired macro-blocks location in the frame. However, the impaired macro-blocks are concealed with more or less success during the decoding process. So the saliencymap provides the user perceived impairment strength for each macro-block.Several psycho-visual studies have shown that semantics play an important role in visualscene perception. These studies conclude that faces and text are the most attractive. Toimprove the spatio-temporal saliency model a semantic dimension is added. This semanticsaliency is based on the Viola & Jones face detector.To predict the Mean Opinion Score (MOS) from objective metric values like WMBER,WMSE, PSNR or SSIM, we propose to use a supervised learning approach. This approach iscalled Similarity Weighted Average (SWA). Several improvements are brought to the originalSWA.For the metrics evaluation a psycho-visual experiment with 50 subjects has been carriedout. To measure the saliency map models accuracy, a psycho-visual experiment with aneye-tracker has also been carried out. These two experiments habe been conducted in col-laboration with the Ben Gurion University, Israel. WMBER and WMSE performances arecompared with reference metrics like SSIM and PSNR. The proposed metrics are also testedon a database provided by IRCCyN research laboratory. Qualité vidéo Sans référence H.264 Haute-Définition Carte de saillance Image gelée Apprentissage supervisé Video quality assessment No reference H.264 High Definition Saliency maps Frozen frames Supervised learning
98	Shaft Transducerless Vector Control Of The Interior Permanent Magnet Motor With Speed And Position Estimation Using High Frequency Signal Injection And Flux Observer Methods Goksu, Omer 01 May 2008 (has links) (PDF) In this thesis, shaft transducerless vector control of Interior Permanent Magnet (IPM) motor with speed and position estimation using saliency based high frequency signal injection and fundamental model based flux observer methods will be investigated. The magnetic saliency characteristic of a 2.2-kW IPM motor will be experimentally extracted by means of high frequency signal injection. High frequency signal injection method will be used to estimate the speed and position at zero and low speed based on the magnetic saliency of the IPM motor. At high speed, fundamental model based flux observer method will be utilized for speed and position estimation. Seamless transition between the two estimation methods will be provided. Using the estimated speed and position information, the motor will be closed loop vector controlled and the drive motion performance over wide speed and load range will be investigated. The IPM motor drive and the estimation/control algorithms will be modeled and their performance will be demonstrated by detailed computer simulations. A three-phase voltage source inverter and a motor test bench will be built, and the estimation/control algorithms will be implemented on a DSP based motor control platform. The IPM motor drive system will be tested in the laboratory and the theory and simulation results will be verified by the experiments.
99	Implementation and evaluation of content-aware video retargeting techniques / Implementation och utvärdering av innehållsstyrd omformatering av videosekvenser Holmer, Stefan January 2008 (has links) <p>The purpose of this master thesis was to study different content-aware video retargeting techniques, concentrating on a generalization of seam carving for video. Focus have also been put on the possibility to combine different techniques to achieve better retargeting of both multi-shot video and single-shot video. This also involved significant studies of automatic cut detection and different measures of video content. The work resulted in a prototype application for semi-automatic video retargeting, developed in Matlab. Three different retargeting techniques, seam carving, automated pan & scan and subsampling using bi-cubic interpolation, have been implemented in the prototype. The techniques have been evaluated and compared to each other from a content preservation perspective and a perceived quality perspective.</p> / <p>Syftet med examensarbetet har varit att studera tekniker för ändring av bredd/höjd-förhållandet i videosekvenser, där hänsyn tas till innehållet i bilderna. Fokus har lagts på en generalisering av "seam carving" för video och möjligheterna att kombinera olika tekniker för att nå bättre kvalitet både för videosekvenser som består av endast ett, eller flera, klipp. Detta innefattade således också omfattande studier av automatisk klippdetektering och olika mått av videoinnehåll. Arbetet har resulterat i en prototypapplikation utvecklad i Matlab för halvautomatisk förändring av bildförhållande där hänsyn tas till innehållet i sekvenserna. I prototypen finns tre metoder implementerade, "seam carving", automatiserad "pan & scan" och nedsampling med bi-kubisk interpolering. Dessa metoder har utvärderats och jämförts med varandra från ett innehållsbevarande perspektiv och ett kvalitetsperspektiv.</p> seam carving retargeting aspect ratio scaling saliency cut shot detection widescreen fullscreen processing optical flow sampling pan scan energy Image analysis Bildanalys Electrical engineering Elektroteknik Computer engineering Datorteknik Signal processing Signalbehandling TECHNOLOGY TEKNIKVETENSKAP
100	Traitement des objets 3D et images par les méthodes numériques sur graphes / 3D object processing and Image processing by numerical methods El Sayed, Abdul Rahman 24 October 2018 (has links) La détection de peau consiste à détecter les pixels correspondant à une peau humaine dans une image couleur. Les visages constituent une catégorie de stimulus importante par la richesse des informations qu’ils véhiculent car avant de reconnaître n’importe quelle personne il est indispensable de localiser et reconnaître son visage. La plupart des applications liées à la sécurité et à la biométrie reposent sur la détection de régions de peau telles que la détection de visages, le filtrage d'objets 3D pour adultes et la reconnaissance de gestes. En outre, la détection de la saillance des mailles 3D est une phase de prétraitement importante pour de nombreuses applications de vision par ordinateur. La segmentation d'objets 3D basée sur des régions saillantes a été largement utilisée dans de nombreuses applications de vision par ordinateur telles que la correspondance de formes 3D, les alignements d'objets, le lissage de nuages de points 3D, la recherche des images sur le web, l’indexation des images par le contenu, la segmentation de la vidéo et la détection et la reconnaissance de visages. La détection de peau est une tâche très difficile pour différentes raisons liées en général à la variabilité de la forme et la couleur à détecter (teintes différentes d’une personne à une autre, orientation et tailles quelconques, conditions d’éclairage) et surtout pour les images issues du web capturées sous différentes conditions de lumière. Il existe plusieurs approches connues pour la détection de peau : les approches basées sur la géométrie et l’extraction de traits caractéristiques, les approches basées sur le mouvement (la soustraction de l’arrière-plan (SAP), différence entre deux images consécutives, calcul du flot optique) et les approches basées sur la couleur. Dans cette thèse, nous proposons des méthodes d'optimisation numérique pour la détection de régions de couleurs de peaux et de régions saillantes sur des maillages 3D et des nuages de points 3D en utilisant un graphe pondéré. En se basant sur ces méthodes, nous proposons des approches de détection de visage 3D à l'aide de la programmation linéaire et de fouille de données (Data Mining). En outre, nous avons adapté nos méthodes proposées pour résoudre le problème de la simplification des nuages de points 3D et de la correspondance des objets 3D. En plus, nous montrons la robustesse et l’efficacité de nos méthodes proposées à travers de différents résultats expérimentaux réalisés. Enfin, nous montrons la stabilité et la robustesse de nos méthodes par rapport au bruit. / Skin detection involves detecting pixels corresponding to human skin in a color image. The faces constitute a category of stimulus important by the wealth of information that they convey because before recognizing any person it is essential to locate and recognize his face. Most security and biometrics applications rely on the detection of skin regions such as face detection, 3D adult object filtering, and gesture recognition. In addition, saliency detection of 3D mesh is an important pretreatment phase for many computer vision applications. 3D segmentation based on salient regions has been widely used in many computer vision applications such as 3D shape matching, object alignments, 3D point-point smoothing, searching images on the web, image indexing by content, video segmentation and face detection and recognition. The detection of skin is a very difficult task for various reasons generally related to the variability of the shape and the color to be detected (different hues from one person to another, orientation and different sizes, lighting conditions) and especially for images from the web captured under different light conditions. There are several known approaches to skin detection: approaches based on geometry and feature extraction, motion-based approaches (background subtraction (SAP), difference between two consecutive images, optical flow calculation) and color-based approaches. In this thesis, we propose numerical optimization methods for the detection of skins color and salient regions on 3D meshes and 3D point clouds using a weighted graph. Based on these methods, we provide 3D face detection approaches using Linear Programming and Data Mining. In addition, we adapted our proposed methods to solve the problem of simplifying 3D point clouds and matching 3D objects. In addition, we show the robustness and efficiency of our proposed methods through different experimental results. Finally, we show the stability and robustness of our methods with respect to noise. Nuages de points 3D Détection faciale Détection de la peau Exploration de données Programmation linéaire Détection de saillance Correspondance de maillage 3D 3D point clouds Face detection Skin detection Data mining Linear programming Saliency detection 3D Mesh matching Point clouds simplification

Search results