Global ETD Search

371	Navigability estimation for autonomous vehicles using machine learning / Estimação de navegabilidade para veículos autônomos usando aprendizado de máquina Caio César Teodoro Mendes 08 June 2017 (has links) Autonomous navigation in outdoor, unstructured environments is one of the major challenges presents in the robotics field. One of its applications, intelligent autonomous vehicles, has the potential to decrease the number of accidents on roads and highways, increase the efficiency of traffic on major cities and contribute to the mobility of the disabled and elderly. For a robot/vehicle to safely navigate, accurate detection of navigable areas is essential. In this work, we address the task of visual road detection where, given an image, the objective is to classify its pixels into road or non-road. Instead of trying to manually derive an analytical solution for the task, we have used machine learning (ML) to learn it from a set of manually created samples. We have applied both traditional (shallow) and deep ML models to the task. Our main contribution regarding traditional ML models is an efficient and versatile way to aggregate spatially distant features, effectively providing a spatial context to such models. As for deep learning models, we have proposed a new neural network architecture focused on processing time and a new neural network layer called the semi-global layer, which efficiently provides a global context for the model. All the proposed methodology has been evaluated in the Karlsruhe Institute of Technology (KIT) road detection benchmark, achieving, in all cases, competitive results. / A navegação autônoma em ambientes externos não estruturados é um dos maiores desafios no campo da robótica. Uma das suas aplicações, os veículos inteligentes autônomos, tem o potencial de diminuir o número de acidentes nas estradas e rodovias, aumentar a eficiência do tráfego nas grandes cidades e contribuir para melhoria da mobilidade de deficientes e idosos. Para que um robô/veículo navegue com segurança, uma detecção precisa de áreas navegáveis é essencial. Neste trabalho, abordamos a tarefa de detecção visual de ruas onde, dada uma imagem, o objetivo é classificar cada um de seus pixels em rua ou não-rua. Ao invés de tentar derivar manualmente uma solução analítica para a tarefa, usamos aprendizado de máquina (AM) para aprendê-la a partir de um conjunto de amostras criadas manualmente. Nós utilizamos tanto modelos tradicionais (superficiais) quanto modelos profundos para a tarefa. A nossa principal contribuição em relação aos modelos tradicionais é uma forma eficiente e versátil de agregar características espacialmente distantes, fornecendo efetivamente um contexto espacial para esses modelos. Quanto aos modelos de aprendizagem profunda, propusemos uma nova arquitetura de rede neural focada no tempo de processamento e uma nova camada de rede neural, chamada camada semi-global, que fornece eficientemente um contexto global ao modelo. Toda a metodologia proposta foi avaliada no benchmark de detecção de ruas do Instituto de Tecnologia de Karlsruhe, alcançando, em todos os casos, resultados competitivos. Aprendizado de máquina Aprendizagem profunda Detecção de obstáculos Estimação de rua Visão estéreo Deep learning Machine learning Obstacle detection Road detection Stereo vision
372	Evaluation of Aerial Image Stereo Matching Methods for Forest Variable Estimation Svensk, Joakim January 2017 (has links) This work investigates the landscape of aerial image stereo matching (AISM) methods suitable for large scale forest variable estimation. AISM methods are an important source of remotely collected information used in modern forestry to keep track of a growing forest's condition. A total of 17 AISM methods are investigated, out of which 4 are evaluated by processing a test data set consisting of three aerial images. The test area is located in southern Sweden, consisting of mainly Norway Spruce and Scots Pine. From the resulting point clouds and height raster images, a total of 30 different metrics of both height and density types are derived. Linear regression is used to fit functions from metrics derived from AISM data to a set of forest variables including tree height (HBW), tree diameter (DBW), basal area, volume. As ground truth, data collected by dense airborne laser scanning is used. Results are presented as RMSE and standard deviation concluded from the linear regression. For tree height, tree diameter, basal area, volume the RMSE ranged from 7.442% to 10.11%, 11.58% to 13.96%, 32.01% to 35.10% and 34.01% to 38.26% respectively. The results concluded that all four tested methods achieved comparable estimation quality although showing small differences among them. Keystone and SURE performed somewhat better while MicMac placed third and Photoscan achieved the less accurate result.
373	A Prototype For An Interactive And Dynamic Image-Based Relief Rendering System / En prototyp för ett interaktivt och dynamisktbildbaserat relief renderingssystem Bakos, Niklas January 2002 (has links) In the research of developing arbitrary and unique virtual views from a real- world scene, a prototype of an interactive relief texture mapping system capable of processing video using dynamic image-based rendering, is developed in this master thesis. The process of deriving depth from recorded video using binocular stereopsis is presented, together with how the depth information is adjusted to be able to manipulate the orientation of the original scene. When the scene depth is known, the recorded organic and dynamic objects can be seen from viewpoints not available in the original video. Datorteknik Dynamic Image-Based Rendering Background Extraction Silhouette Depth Approximation Interactive Application Stereo Vision Virtual Views Datorteknik Computer Engineering Datorteknik
374	Detecting and Tracking Players in Football Using Stereo Vision Borg, Johan January 2007 (has links) The objective of this thesis is to investigate if it is possible to use stereo vision to find and track the players and the ball during a football game. The thesis shows that it is possible to detect all players that isn’t too occluded by another player. Situations when a player is occluded by another player is solved by tracking the players from frame to frame. The ball is also detected in most frames by looking for ball-like features. As with the players the ball is tracked from frame to frame so that when the ball is occluded, the positions is estimated by the tracker. Stereo vision Football Disparity estimation Camera alibration Image rectification
375	Développement et validation d’une méthode d’évaluation destranslations glénohumérales 3d sous EOS(mc) / Development and validation of a method for evaluating 3dglenohumeral translations with eos™ Lagacé, Pierre-yves 17 December 2012 (has links) L'analyse morpho-fonctionnelle précise de l'épaule, notamment au niveau de l'articulation glénohumérale (GH), permettrait d'améliorer notre compréhension des dysfonctions de l'épaule associées aux ruptures de la coiffe des rotateurs. Cependant, des difficultés techniques compliquent l'analyse de la morphologie et du mouvement de l'épaule dans un contexte clinique. Dans le cadre de la présente thèse, une méthode d'analyse 3D des translations GH basée sur l'acquisition de séquences de radiographies biplanes sous EOS™ a été développée. Une étude pilote sur 10 sujets sains a d'abord permis de vérifier la facilité d'application d'un protocole d'acquisition des images et d'évaluer la répétabilité de l'identification des repères anatomiques de l'épaule sur les radiographies. La méthode proposée en dernier lieu permet, suite à l'acquisition de radiographies, de reconstruire un modèle personnalisé morpho-réaliste de la scapula et un modèle simplifié de l'humérus. Ces modèles sont ensuite recalés interactivement sur les images acquises à différentes positions du bras et utilisés pour décrire les translations GH à ces positions. La méthode a été validée en termes de précision et de répétabilité sur des images acquises d'épaules cadavériques et de patients ayant des ruptures de la coiffe des rotateurs. La dernière partie du travail présenté ici consiste à appliquer la méthode développée à 30 sujets (25 patients et 5 sujets sains) pour lesquels l'état de la coiffe des rotateurs et la fonction de l'épaule ont été évalués, puis à étudier les corrélations entre les translations GH et la fonction de l'épaule. / Precise morpho-functional analysis of the shoulder, including of the glenohumeral (GH) joint, would allow improving our understanding of shoulder dysfunction associated to rotator cuff tears. However, technical difficulties make analysis of shoulder morphology and motion difficult in a clinical setting. The work carried out during this thesis allowed developing a method for 3D analysis of GH translations relying on the acquisition of sequences of biplanar radiographs with the EOS™ system. A pilot study carried out on 10 healthy subjects allowed verifying the ease of application of an image acquisition protocol and evaluating the repeatability of shoulder landmark identification on the radiographs. The final proposed method allows, following the acquisition of radiographs, obtaining a personalized morphologically realistic reconstruction of the scapula and a simplified model of the humerus. These models are then interactively registered to the images acquired at different arm positions and used to describe GH translations for these positions. The proposed method was validated in terms of precision and repeatability on images acquired of cadaveric shoulders and of patients with rotator cuff tears. The last part of the work presented here consists in applying the developed method to 30 subjects (25 patients and 5 asymptomatic subjects) for whom rotator cuff condition and shoulder function were assessed, and to study correlations between GH translations and shoulder function. Épaule Gléno-humérale Stéréo radiographie Pseudo-cinématique Tête humérale Shoulder Gleno-humeral joint Stereo radiography Pseudo kinematics Humeral head
376	Towards hardware accelerated rectification of high speed stereo image streams Bankarusamy, Sudhangathan January 2017 (has links) The process of combining two views of a scene in order to obtain depth information is called stereo vision. When the same is done using a computer it is then called computer stereo vision. Stereo vision is used in robotic application where depth of an object plays a role. Two cameras mounted on a rig is called a stereo camera system. Such a system is able to capture two views and enable robotic application to use the depth information to complete tasks. Anomalies are bound to occur in such a stereo rig, when both the cameras are not parallel to each other. Mounting of the cameras on a rig accurately has physical alignment limitations. Images taken from such a rig has inaccurate depth information and has to be rectified. Therefore rectification is a pre-requisite to computer stereo vision. One such a stereo rig used in this thesis is the GIMME2 stereo camera system. The system has two 10 mega-pixel cameras with on-board FPGA, RAM, processor running Linux operating system, multiple Ethernet ports and an SD card feature amongst others. Stereo rectification on memory constrained hardware is a challenging task as the process itself requires both the images to be stored in the memory. The FPGA on the GIMME2 systems must be used in order to achieve the best possible speed. Programming a system that does not have a display and for used for a specific purpose is called embedded programming. The purpose of this system is distance estimation and working with such a system falls in the Embedded Systems program. This thesis presents a method that makes rectification a step ahead for this particular system. The functionality of the algorithm is shown in MATLAB and using VHDL and is compared to available tools and systems. Rectification Image FPGA GIMME2 Distortion Stereo Vision VHDL MATLAB Camera Computer Systems Datorsystem Embedded Systems Inbäddad systemteknik
377	On precise three-dimensional environment modeling via UAV-based photogrammetric systems / Modélisation tridimensionnelle précise de l'environnement à l’aide des systèmes de photogrammétrie embarqués sur drones Shahbazi, Mozhdeh January 2016 (has links) Abstract : Images acquired from unmanned aerial vehicles (UAVs) can provide data with unprecedented spatial and temporal resolution for three-dimensional (3D) modeling. Solutions developed for this purpose are mainly operating based on photogrammetry concepts, namely UAV-Photogrammetry Systems (UAV-PS). Such systems are used in applications where both geospatial and visual information of the environment is required. These applications include, but are not limited to, natural resource management such as precision agriculture, military and police-related services such as traffic-law enforcement, precision engineering such as infrastructure inspection, and health services such as epidemic emergency management. UAV-photogrammetry systems can be differentiated based on their spatial characteristics in terms of accuracy and resolution. That is some applications, such as precision engineering, require high-resolution and high-accuracy information of the environment (e.g. 3D modeling with less than one centimeter accuracy and resolution). In other applications, lower levels of accuracy might be sufficient, (e.g. wildlife management needing few decimeters of resolution). However, even in those applications, the specific characteristics of UAV-PSs should be well considered in the steps of both system development and application in order to yield satisfying results. In this regard, this thesis presents a comprehensive review of the applications of unmanned aerial imagery, where the objective was to determine the challenges that remote-sensing applications of UAV systems currently face. This review also allowed recognizing the specific characteristics and requirements of UAV-PSs, which are mostly ignored or not thoroughly assessed in recent studies. Accordingly, the focus of the first part of this thesis is on exploring the methodological and experimental aspects of implementing a UAV-PS. The developed system was extensively evaluated for precise modeling of an open-pit gravel mine and performing volumetric-change measurements. This application was selected for two main reasons. Firstly, this case study provided a challenging environment for 3D modeling, in terms of scale changes, terrain relief variations as well as structure and texture diversities. Secondly, open-pit-mine monitoring demands high levels of accuracy, which justifies our efforts to improve the developed UAV-PS to its maximum capacities. The hardware of the system consisted of an electric-powered helicopter, a high-resolution digital camera, and an inertial navigation system. The software of the system included the in-house programs specifically designed for camera calibration, platform calibration, system integration, onboard data acquisition, flight planning and ground control point (GCP) detection. The detailed features of the system are discussed in the thesis, and solutions are proposed in order to enhance the system and its photogrammetric outputs. The accuracy of the results was evaluated under various mapping conditions, including direct georeferencing and indirect georeferencing with different numbers, distributions and types of ground control points. Additionally, the effects of imaging configuration and network stability on modeling accuracy were assessed. The second part of this thesis concentrates on improving the techniques of sparse and dense reconstruction. The proposed solutions are alternatives to traditional aerial photogrammetry techniques, properly adapted to specific characteristics of unmanned, low-altitude imagery. Firstly, a method was developed for robust sparse matching and epipolar-geometry estimation. The main achievement of this method was its capacity to handle a very high percentage of outliers (errors among corresponding points) with remarkable computational efficiency (compared to the state-of-the-art techniques). Secondly, a block bundle adjustment (BBA) strategy was proposed based on the integration of intrinsic camera calibration parameters as pseudo-observations to Gauss-Helmert model. The principal advantage of this strategy was controlling the adverse effect of unstable imaging networks and noisy image observations on the accuracy of self-calibration. The sparse implementation of this strategy was also performed, which allowed its application to data sets containing a lot of tie points. Finally, the concepts of intrinsic curves were revisited for dense stereo matching. The proposed technique could achieve a high level of accuracy and efficiency by searching only through a small fraction of the whole disparity search space as well as internally handling occlusions and matching ambiguities. These photogrammetric solutions were extensively tested using synthetic data, close-range images and the images acquired from the gravel-pit mine. Achieving absolute 3D mapping accuracy of 11±7 mm illustrated the success of this system for high-precision modeling of the environment. / Résumé : Les images acquises à l’aide d’aéronefs sans pilote (ASP) permettent de produire des données de résolutions spatiales et temporelles uniques pour la modélisation tridimensionnelle (3D). Les solutions développées pour ce secteur d’activité sont principalement basées sur des concepts de photogrammétrie et peuvent être identifiées comme des systèmes photogrammétriques embarqués sur aéronefs sans pilote (SP-ASP). Ils sont utilisés dans plusieurs applications environnementales où l’information géospatiale et visuelle est essentielle. Ces applications incluent notamment la gestion des ressources naturelles (ex. : agriculture de précision), la sécurité publique et militaire (ex. : gestion du trafic), les services d’ingénierie (ex. : inspection de bâtiments) et les services de santé publique (ex. : épidémiologie et gestion des risques). Les SP-ASP peuvent être subdivisés en catégories selon les besoins en termes de précision et de résolution. En effet, dans certains cas, tel qu’en ingénierie, l’information sur l’environnement doit être de haute précision et de haute résolution (ex. : modélisation 3D avec une précision et une résolution inférieure à un centimètre). Pour d’autres applications, tel qu’en gestion de la faune sauvage, des niveaux de précision et de résolution moindres peut être suffisants (ex. : résolution de l’ordre de quelques décimètres). Cependant, même dans ce type d’applications les caractéristiques des SP-ASP devraient être prises en considération dans le développement des systèmes et dans leur utilisation, et ce, pour atteindre les résultats visés. À cet égard, cette thèse présente une revue exhaustive des applications de l’imagerie aérienne acquise par ASP et de déterminer les challenges les plus courants. Cette étude a également permis d’établir les caractéristiques et exigences spécifiques des SP-ASP qui sont généralement ignorées ou partiellement discutées dans les études récentes. En conséquence, la première partie de cette thèse traite des aspects méthodologiques et d’expérimentation de la mise en place d’un SP-ASP. Le système développé a été évalué pour la modélisation précise d’une gravière et utilisé pour réaliser des mesures de changement volumétrique. Cette application a été retenue pour deux raisons principales. Premièrement, ce type de milieu fournit un environnement difficile pour la modélisation, et ce, en termes de changement d’échelle, de changement de relief du terrain ainsi que la grande diversité de structures et de textures. Deuxièment, le suivi de mines à ciel ouvert exige un niveau de précision élevé, ce qui justifie les efforts déployés pour mettre au point un SP-ASP de haute précision. Les composantes matérielles du système consistent en un ASP à propulsion électrique de type hélicoptère, d’une caméra numérique à haute résolution ainsi qu’une station inertielle. La composante logicielle est composée de plusieurs programmes développés particulièrement pour calibrer la caméra et la plateforme, intégrer les systèmes, enregistrer les données, planifier les paramètres de vol et détecter automatiquement les points de contrôle au sol. Les détails complets du système sont abordés dans la thèse et des solutions sont proposées afin d’améliorer le système et la qualité des données photogrammétriques produites. La précision des résultats a été évaluée sous diverses conditions de cartographie, incluant le géoréférencement direct et indirect avec un nombre, une répartition et des types de points de contrôle variés. De plus, les effets de la configuration des images et la stabilité du réseau sur la précision de la modélisation ont été évalués. La deuxième partie de la thèse porte sur l’amélioration des techniques de reconstruction éparse et dense. Les solutions proposées sont des alternatives aux techniques de photogrammétrie aérienne traditionnelle et adaptée aux caractéristiques particulières de l’imagerie acquise à basse altitude par ASP. Tout d’abord, une méthode robuste de correspondance éparse et d’estimation de la géométrie épipolaire a été développée. L’élément clé de cette méthode est sa capacité à gérer le pourcentage très élevé des valeurs aberrantes (erreurs entre les points correspondants) avec une efficacité de calcul remarquable en comparaison avec les techniques usuelles. Ensuite, une stratégie d’ajustement de bloc basée sur l’intégration de pseudoobservations du modèle Gauss-Helmert a été proposée. Le principal avantage de cette stratégie consistait à contrôler les effets négatifs du réseau d’images instable et des images bruitées sur la précision de l’autocalibration. Une implémentation éparse de cette stratégie a aussi été réalisée, ce qui a permis de traiter des jeux de données contenant des millions de points de liaison. Finalement, les concepts de courbes intrinsèques ont été revisités pour l’appariement stéréo dense. La technique proposée pourrait atteindre un haut niveau de précision et d’efficacité en recherchant uniquement dans une petite portion de l’espace de recherche des disparités ainsi qu’en traitant les occlusions et les ambigüités d’appariement. Ces solutions photogrammétriques ont été largement testées à l’aide de données synthétiques, d’images à courte portée ainsi que celles acquises sur le site de la gravière. Le système a démontré sa capacité a modélisation dense de l’environnement avec une très haute exactitude en atteignant une précision 3D absolue de l’ordre de 11±7 mm. Photogrammetry Computer vision Unmanned aerial systems Dense stereo matching Robust epipolar-geometry estimation Bundle adjustment Sensor integration
378	Study on Assessing QoE of 3DTV Using Subjective Methods ananth, Indirajith Vijai January 2013 (has links) The ever increasing popularity and enormous growth in 3D movie industry is the stimulating phenomenon for the penetration of 3D services into home entertainment systems. Providing a third dimension gives intense visual experience to the viewers. Being a new eld, there are several researches going on to measure the end user's viewing experience. Research groups including 3D TV manufacturers, service providers and standards organizations are interested to improve user experience. Recent research in 3D video quality measurements have revealed uncertain issues as well as more well known results. Measuring the perceptual stereoscopic video quality by subjective testing can provide practical results. This thesis studies and investigate three di erent rating scales (Video Quality, Visual Discomfort and Sense of Presence) and compares them by subjective testing, combined with two viewing distances at 3H and 5H, where H is the hight of display screen. This thesis work shows that single rating scale produces the same result as three di erent scales and viewing distance has very less or no impact on Quality of Experience (QoE) of 3DTV for 3H and 5H distances for symmetric coding impairments. / indirajithv@gmail.com 3DTV Stereo Quality of Experience Video Quality Sense of Presence Visual Discomfort Computer Sciences Datavetenskap (datalogi) Media and Communications Medie- och kommunikationsvetenskap Telecommunications Telekommunikation
379	Modélisation de l'interaction lumière/matière pour l'analyse de surfaces rugueuses texturées par stéréo photométrie / Light-matter interaction modelling for analysing textured rough surfaces by photometric stereo Bony, Alexandre 02 December 2013 (has links) Les techniques de reconstruction 3d sont devenues incontournables pour des applicationstelles que la caractérisation et l'analyse de surfaces. Les travaux réalisés au coursde cette thèse ont pour objectif d'améliorer la qualité des reconstructions 3d par stéréophotométrie.Cette méthode repose sur deux principes, l'inversion d'un modèle d'interactionlumière/matière (BRDF) et la configuration d'un système d'éclairage et de prises de vues.Pour des surfaces diffuses, la stéréo-photométrie est réalisée à partir d'un minimum detrois images acquises d'un point de vue fixe pour des directions d'éclairages différentes.Son avantage est d'extraire simultanément les propriétés géométriques et colorimétriquesdes surfaces analysées même en cas de forte rugosité. Néanmoins, son application exige laformulation de plusieurs hypothèses qui sont difficilement respectables dans un contexteréel. Ceci génère des erreurs significatives dans les reconstructions. Pour les réduire, nousproposons différentes contributions qui s'articulent autour de la prise en compte globale de lachaine d'acquisition. Les apports de nos travaux se situent aux niveaux de la caractérisationet de la modélisation du système d'éclairage, du capteur d'acquisition et de l'améliorationde la qualité des images. Nous nous sommes aussi intéressés à l'optimisation des protocolesde prises de vues dans le cas de spécularité surfacique ou d'ombrage dus à la présence derugosité. Les résultats obtenus montrent que la prise en compte de ces caractéristiques dansl'inversion d'un modèle de BRDF permet une nette amélioration des reconstructions et offrela possibilité de réduire la taille des systèmes d'acquisition. / Tridimensional reconstruction method has become essential for applications suchas the characterization and analysis of surfaces. In this thesis, aims are to increase the qualityof 3d reconstructions by photometric stereo. This method is based on two principles, reversinglight-matter interaction model and configuration of a lighting system. With diffuse surfaces,the photometric stereo use three captured images from a fixed point of view for differentillumination directions. Its main advantage is to extract the color and geometric propertiesfor the textured rough surfaces. However, its application requires to make assumptions thatare not credible in real cases. This problem generates significant errors in the reconstructions.To reduce them, we offer various solutions around the overall consideration of the acquisitionchain. Our contribution focuses on the characterization and modeling of the lighting system,the acquisition sensor and improved image quality. We are also interested to optimize acquisitionprotocol in the case of specular surface or shading due to the surface geometry. Ourresults show that the inclusion of these features in the inversion of a BRDF model allowsan improvement of 3d reconstructions as well as the possibility of reducing the size of theacquisition systems. Reconstruction 3d Acquisition d'image Photométrie Stéréo-photométrie Brdf Imagerie 3d reconstruction Image acquisition Photometry Photometric stereo Brdf Imaging 006.6
380	Automatic Volume Estimation of Timber from Multi-View Stereo 3D Reconstruction Rundgren, Emil January 2017 (has links) The ability to automatically estimate the volume of timber is becoming increasingly important within the timber industry. The large number of timber trucks arriving each day at Swedish timber terminals fortifies the need for a volume estimation performed in real-time and on-the-go as the trucks arrive. This thesis investigates if a volumetric integration of disparity maps acquired from a Multi-View Stereo (MVS) system is a suitable approach for automatic volume estimation of timber loads. As real-time execution is preferred, efforts were made to provide a scalable method. The proposed method was quantitatively evaluated on datasets containing two geometric objects of known volume. A qualitative comparison to manual volume estimates of timber loads was also made on datasets recorded at a Swedish timber terminal. The proposed method is shown to be both accurate and precise under specific circumstances. However, robustness is poor to varying weather conditions, although a more thorough evaluation of this aspect needs to be performed. The method is also parallelizable, which means that future efforts can be made to significantly decrease execution time. 3D Reconstruction Multi-View Stereo Automatic Volume Estimation Signed Distance Function Computer Vision Signal Processing Image Processing Signal Processing Signalbehandling

Search results