Global ETD Search

41	Modern Stereo Correspondence Algorithms : Investigation and Evaluation Olofsson, Anders January 2010 (has links) <p>Many different approaches have been taken towards solving the stereo correspondence problem and great progress has been made within the field during the last decade. This is mainly thanks to newly evolved global optimization techniques and better ways to compute pixel dissimilarity between views. The most successful algorithms are based on approaches that explicitly model smoothness assumptions made about the physical world, with image segmentation and plane fitting being two frequently used techniques.</p><p>Within the project, a survey of state of the art stereo algorithms was conducted and the theory behind them is explained. Techniques found interesting were implemented for experimental trials and an algorithm aiming to achieve state of the art performance was implemented and evaluated. For several cases, state of the art performance was reached.</p><p>To keep down the computational complexity, an algorithm relying on local winner-take-all optimization, image segmentation and plane fitting was compared against minimizing a global energy function formulated on pixel level. Experiments show that the local approach in several cases can match the global approach, but that problems sometimes arise – especially when large areas that lack texture are present. Such problematic areas are better handled by the explicit modeling of smoothness in global energy minimization.</p><p>Lastly, disparity estimation for image sequences was explored and some ideas on how to use temporal information were implemented and tried. The ideas mainly relied on motion detection to determine parts that are static in a sequence of frames. Stereo correspondence for sequences is a rather new research field, and there is still a lot of work to be made.</p> stereo correspondence stereo matching cost function cost aggregation image segmentation plane fitting RANSAC graph cuts belief propagation disparity depth estimation Image analysis Bildanalys Other engineering physics Övrig teknisk fysik Applied mathematics Tillämpad matematik
42	Road Surface Preview Estimation Using a Monocular Camera Ekström, Marcus January 2018 (has links) Recently, sensors such as radars and cameras have been widely used in automotives, especially in Advanced Driver-Assistance Systems (ADAS), to collect information about the vehicle's surroundings. Stereo cameras are very popular as they could be used passively to construct a 3D representation of the scene in front of the car. This allowed the development of several ADAS algorithms that need 3D information to perform their tasks. One interesting application is Road Surface Preview (RSP) where the task is to estimate the road height along the future path of the vehicle. An active suspension control unit can then use this information to regulate the suspension, improving driving comfort, extending the durabilitiy of the vehicle and warning the driver about potential risks on the road surface. Stereo cameras have been successfully used in RSP and have demonstrated very good performance. However, the main disadvantages of stereo cameras are their high production cost and high power consumption. This limits installing several ADAS features in economy-class vehicles. A less expensive alternative are monocular cameras which have a significantly lower cost and power consumption. Therefore, this thesis investigates the possibility of solving the Road Surface Preview task using a monocular camera. We try two different approaches: structure-from-motion and Convolutional Neural Networks.The proposed methods are evaluated against the stereo-based system. Experiments show that both structure-from-motion and CNNs have a good potential for solving the problem, but they are not yet reliable enough to be a complete solution to the RSP task and be used in an active suspension control unit. Road Surface Preview Computer Vision Depth Estimation Convolutional Neural Network CNN traffic safety monocular camera mono vision system mono camera Structure from motion sfm 3D Reconstruction Autonomous Driving Datorseende trafiksäkerhet djupuppskattning mono kamera 3D rekonstruktion autonoma fordon Signal Processing Signalbehandling
43	Um m?todo para determina??o da profundidade combinando vis?o est?reo e autocalibra??o para aplica??o em rob?tica m?vel Sousa Segundo, Jos? S?vio Alves de 30 April 2007 (has links) Made available in DSpace on 2014-12-17T14:55:09Z (GMT). No. of bitstreams: 1 JoseSASS.pdf: 1375081 bytes, checksum: 1561bdbc1ba8feb7671abf9ebca84641 (MD5) Previous issue date: 2007-04-30 / Coordena??o de Aperfei?oamento de Pessoal de N?vel Superior / This work proposes a method to determine the depth of objects in a scene using a combination between stereo vision and self-calibration techniques. Determining the rel- ative distance between visualized objects and a robot, with a stereo head, it is possible to navigate in unknown environments. Stereo vision techniques supply a depth measure by the combination of two or more images from the same scene. To achieve a depth estimates of the in scene objects a reconstruction of this scene geometry is necessary. For such reconstruction the relationship between the three-dimensional world coordi- nates and the two-dimensional images coordinates is necessary. Through the achievement of the cameras intrinsic parameters it is possible to make this coordinates systems relationship. These parameters can be gotten through geometric camera calibration, which, generally is made by a correlation between image characteristics of a calibration pattern with know dimensions. The cameras self-calibration allows the achievement of their intrinsic parameters without using a known calibration pattern, being possible their calculation and alteration during the displacement of the robot in an unknown environment. In this work a self-calibration method based in the three-dimensional polar coordinates to represent image features is presented. This representation is determined by the relationship between images features and horizontal and vertical opening cameras angles. Using the polar coordinates it is possible to geometrically reconstruct the scene. Through the proposed techniques combination it is possible to calculate a scene objects depth estimate, allowing the robot navigation in an unknown environment / Este trabalho prop?e um m?todo para determinar a profundidade de objetos em cena utilizando uma combina??o das t?cnicas de vis?o est?reo e autocalibra??o. Determinando a dist?ncia relativa entre objetos visualizados e um rob? m?vel, dotado de uma cabe?a est?reo, ? poss?vel efetuar sua navega??o em ambientes desconhecidos. As t?cnicas de vis?o est?reo fornecem uma medida de profundidade a partir da combina??o de duas ou mais imagens de uma mesma cena. Para a obten??o de estimativas da profundidade dos objetos presentes nesta cena ? necess?rio uma reconstru??o da geometria da mesma. Para tal reconstru??o ? necess?rio o relacionamento das coordenadas tridimensionais do mundo com as coordenadas bidimensionais das imagens. Atrav?s da obten??o dos par?metros intr?nsecos das c?meras ? poss?vel fazer o relacionamento entre os sistemas de coordenadas. Estes par?metros podem ser obtidos atrav?s da calibra??o geom?trica das c?meras, a qual ? geralmente feita atrav?s da visualiza??o de um objeto padr?o de calibra??o com dimens?es conhecidas. A autocalibra??o das c?meras permite a obten??o dos par?metros intr?nsecos das mesmas sem a utiliza??o de um padr?o conhecido de calibra??o, sendo poss?vel a obten??o e a altera??o destes durante o deslocamento do rob? m?vel em um ambiente desconhecido. ? apresentado neste trabalho um m?todo de autocalibra??o baseado na representa??o de caracter?sticas da imagem por coordenadas polares tridimensionais. Estas s?o determinadas relacionando-se caracter?sticas das imagens com os ?ngulos de abertura horizontal e vertical das c?meras. Utilizando-se estas coordenadas polares ? poss?vel efetuar uma reconstru??o geom?trica da cena de forma precisa. Atrav?s desta combina??o das t?cnicas proposta ? poss?vel obter-se uma estimativa da profundidade de objetos cena, permitindo a navega??o de um rob? m?vel aut?nomo em um ambiente desconhecido Vis?o est?reo Autocalibra??o Estima??o de profundidade Stereo Vision Self-calibration Depth Estimation CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA
44	Odhad hloubky pomocí konvolučních neuronových sítí / Depth Estimation by Convolutional Neural Networks Ivanecký, Ján January 2016 (has links) This thesis deals with depth estimation using convolutional neural networks. I propose a three-part model as a solution to this problem. The model contains a global context network which estimates coarse depth structure of the scene, a gradient network which estimates depth gradients and a refining network which utilizes the outputs of previous two networks to produce the final depth map. Additionally, I present a normalized loss function for training neural networks. Applying normalized loss function results in better estimates of the scene's relative depth structure, however it results in a loss of information about the absolute scale of the scene.
45	Calibrage de caméra fisheye et estimation de la profondeur pour la navigation autonome Brousseau, Pierre-André 08 1900 (has links) Ce mémoire s’intéresse aux problématiques du calibrage de caméras grand angles et de l’estimation de la profondeur à partir d’une caméra unique, immobile ou en mouvement. Les travaux effectués se situent à l’intersection entre la vision 3D classique et les nouvelles méthodes par apprentissage profond dans le domaine de la navigation autonome. Ils visent à permettre la détection d’obstacles par un drone en mouvement muni d’une seule caméra à très grand angle de vue. D’abord, une nouvelle méthode de calibrage est proposée pour les caméras fisheyes à très grand angle de vue par calibrage planaire à correspondances denses obtenues par lumière structurée qui peuvent être modélisée par un ensemble de caméras génériques virtuelles centrales. Nous démontrons que cette approche permet de modéliser directement des caméras axiales, et validons sur des données synthétiques et réelles. Ensuite, une méthode est proposée pour estimer la profondeur à partir d’une seule image, à partir uniquement des indices de profondeurs forts, les jonctions en T. Nous démontrons que les méthodes par apprentissage profond sont susceptibles d’apprendre les biais de leurs ensembles de données et présentent des lacunes d’invariance. Finalement, nous proposons une méthode pour estimer la profondeur à partir d’une caméra en mouvement libre à 6 degrés de liberté. Ceci passe par le calibrage de la caméra fisheye sur le drone, l’odométrie visuelle et la résolution de la profondeur. Les méthodes proposées permettent la détection d’obstacle pour un drone. / This thesis focuses on the problems of calibrating wide-angle cameras and estimating depth from a single camera, stationary or in motion. The work carried out is at the intersection between traditional 3D vision and new deep learning methods in the field of autonomous navigation. They are designed to allow the detection of obstacles by a moving drone equipped with a single camera with a very wide field of view. First, a new calibration method is proposed for fisheye cameras with very large field of view by planar calibration with dense correspondences obtained by structured light that can be modelled by a set of central virtual generic cameras. We demonstrate that this approach allows direct modeling of axial cameras, and validate it on synthetic and real data. Then, a method is proposed to estimate the depth from a single image, using only the strong depth cues, the T-junctions. We demonstrate that deep learning methods are likely to learn from the biases of their data sets and have weaknesses to invariance. Finally, we propose a method to estimate the depth from a camera in free 6 DoF motion. This involves calibrating the fisheye camera on the drone, visual odometry and depth resolution. The proposed methods allow the detection of obstacles for a drone. Vision par ordinateur Calibrage de Caméra Fisheye Caméra axiale Estimation de la profondeur Estimation du mouvement Navigation autonome Computer Vision Camera Calibration Axial Camera Depth Estimation Autonomous Navigation
46	3D Object Detection based on Unsupervised Depth Estimation Manoharan, Shanmugapriyan 25 January 2022 (has links) Estimating depth and detection of object instances in 3D space is fundamental in autonomous navigation, localization, and mapping, robotic object manipulation, and augmented reality. RGB-D images and LiDAR point clouds are the most illustrative formats of depth information. However, depth sensors offer many shortcomings, such as low effective spatial resolutions and capturing of a scene from a single perspective. The thesis focuses on reproducing denser and comprehensive 3D scene structure for given monocular RGB images using depth and 3D object detection. The first contribution of this thesis is the pipeline for the depth estimation based on an unsupervised learning framework. This thesis proposes two architectures to analyze structure from motion and 3D geometric constraint methods. The proposed architectures trained and evaluated using only RGB images and no ground truth depth data. The architecture proposed in this thesis achieved better results than the state-of-the-art methods. The second contribution of this thesis is the application of the estimated depth map, which includes two algorithms: point cloud generation and collision avoidance. The predicted depth map and RGB image are used to generate the point cloud data using the proposed point cloud algorithm. The collision avoidance algorithm predicts the possibility of collision and provides the collision warning message based on decoding the color in the estimated depth map. This algorithm design is adaptable to different color map with slight changes and perceives collision information in the sequence of frames. Our third contribution is a two-stage pipeline to detect the 3D objects from a monocular image. The first stage pipeline used to detect the 2D objects and crop the patch of the image and the same provided as the input to the second stage. In the second stage, the 3D regression network train to estimate the 3D bounding boxes to the target objects. There are two architectures proposed for this 3D regression network model. This approach achieves better average precision than state-of-theart for truncation of 15% or fully visible objects and lowers but comparable results for truncation more than 30% or partly/fully occluded objects. info:eu-repo/classification/ddc/000 ddc:000
47	Operational data extraction using visual perception Shunmugam, Nagarajan January 2021 (has links) The information era has led the manufacturer of trucks and logistics solution providers are inclined towards software as a service (SAAS) based solutions. With advancements in software technologies like artificial intelligence and deep learning, the domain of computer vision has achieved significant performance boosts that it competes with hardware based solutions. Firstly, data is collected from a large number of sensors which can increase production costs and carbon footprint in the environment. Secondly certain useful physical quantities/variables are impossible to measure or turns out to be very expensive solution. So in this dissertation, we are investigating the feasibility of providing the similar solution using a single sensor (dashboard- camera) to measure multiple variables. This provides a sustainable solution even when scaled up in huge fleets. The video frames that can be collected from the visual perception of the truck (i.e. the on-board camera of the truck) is processed by the deep learning techniques and operational data can be extracted. Certain techniques like the image classification and semantic segmentation outputs were experimented and shows potential to replace costly hardware counterparts like Lidar or radar based solutions. / Informationstiden har lett till att tillverkare av lastbilar och logistiklösningsleve -rantörer är benägna mot mjukvara som en tjänst (SAAS) baserade lösningar. Med framsteg inom mjukvaruteknik som artificiell intelligens och djupinlärnin har domänen för datorsyn uppnått betydande prestationsförstärkningar att konkurrera med hårdvarubaserade lösningar. För det första samlas data in från ett stort antal sensorer som kan öka produktionskostnaderna och koldioxidavtry -cket i miljön. För det andra är vissa användbara fysiska kvantiteter / variabler omöjliga att mäta eller visar sig vara en mycket dyr lösning. Så i denna avhandling undersöker vi möjligheten att tillhandahålla liknande lösning med hjälp av en enda sensor (instrumentbrädkamera) för att mäta flera variabler. Detta ger en hållbar lösning även när den skalas upp i stora flottor. Videoramar som kan samlas in från truckens visuella uppfattning (dvs. lastbilens inbyggda kamera) bearbetas av djupinlärningsteknikerna och operativa data kan extraher -as. Vissa tekniker som bildklassificering och semantiska segmenteringsutgång -ar experimenterades och visar potential att ersätta dyra hårdvaruprojekt som Lidar eller radarbaserade lösningar. Visual perception camera convolutional neural networks classification object detection semantic segmentation depth estimation gradient descent with restarts cosine annealing. Visuell uppfattning kamera neurologiska nätverk klassificering objektdetektering semantisk segmentering djupberäkning gradientnedstigning med omstart cosinusglödgning. Computer and Information Sciences Data- och informationsvetenskap
48	Modern Stereo Correspondence Algorithms : Investigation and Evaluation Olofsson, Anders January 2010 (has links) Many different approaches have been taken towards solving the stereo correspondence problem and great progress has been made within the field during the last decade. This is mainly thanks to newly evolved global optimization techniques and better ways to compute pixel dissimilarity between views. The most successful algorithms are based on approaches that explicitly model smoothness assumptions made about the physical world, with image segmentation and plane fitting being two frequently used techniques. Within the project, a survey of state of the art stereo algorithms was conducted and the theory behind them is explained. Techniques found interesting were implemented for experimental trials and an algorithm aiming to achieve state of the art performance was implemented and evaluated. For several cases, state of the art performance was reached. To keep down the computational complexity, an algorithm relying on local winner-take-all optimization, image segmentation and plane fitting was compared against minimizing a global energy function formulated on pixel level. Experiments show that the local approach in several cases can match the global approach, but that problems sometimes arise – especially when large areas that lack texture are present. Such problematic areas are better handled by the explicit modeling of smoothness in global energy minimization. Lastly, disparity estimation for image sequences was explored and some ideas on how to use temporal information were implemented and tried. The ideas mainly relied on motion detection to determine parts that are static in a sequence of frames. Stereo correspondence for sequences is a rather new research field, and there is still a lot of work to be made. stereo correspondence stereo matching cost function cost aggregation image segmentation plane fitting RANSAC graph cuts belief propagation disparity depth estimation Övrig annan teknik Computational Mathematics Beräkningsmatematik
49	Feature extraction on faces : from landmark localization to depth estimation Honari, Sina 12 1900 (has links) No description available. Neural networks Deep learning Convolutional networks Supervised learning Unsupervised learning Semi-supervised learning Coarse-to-fine architectures Landmark localization Depth estimation Face rotation Face replacement Réseaux neuronaux Apprentissage profond Réseaux neuronaux de convolution Apprentissage supervisé Apprentissage non-supervisé Apprentissage semi-supervisé Architectures grossières à fines Localisation de points clés Estimation de la profondeur Rotation de visage Échange de visage
50	Deep Convolutional Neural Networks for Real-Time Single Frame Monocular Depth Estimation Schennings, Jacob January 2017 (has links) Vision based active safety systems have become more frequently occurring in modern vehicles to estimate depth of the objects ahead and for autonomous driving (AD) and advanced driver-assistance systems (ADAS). In this thesis a lightweight deep convolutional neural network performing real-time depth estimation on single monocular images is implemented and evaluated. Many of the vision based automatic brake systems in modern vehicles only detect pre-trained object types such as pedestrians and vehicles. These systems fail to detect general objects such as road debris and roadside obstacles. In stereo vision systems the problem is resolved by calculating a disparity image from the stereo image pair to extract depth information. The distance to an object can also be determined using radar and LiDAR systems. By using this depth information the system performs necessary actions to avoid collisions with objects that are determined to be too close. However, these systems are also more expensive than a regular mono camera system and are therefore not very common in the average consumer car. By implementing robust depth estimation in mono vision systems the benefits from active safety systems could be utilized by a larger segment of the vehicle fleet. This could drastically reduce human error related traffic accidents and possibly save many lives. The network architecture evaluated in this thesis is more lightweight than other CNN architectures previously used for monocular depth estimation. The proposed architecture is therefore preferable to use on computationally lightweight systems. The network solves a supervised regression problem during the training procedure in order to produce a pixel-wise depth estimation map. The network was trained using a sparse ground truth image with spatially incoherent and discontinuous data and output a dense spatially coherent and continuous depth map prediction. The spatially incoherent ground truth posed a problem of discontinuity that was addressed by a masked loss function with regularization. The network was able to predict a dense depth estimation on the KITTI dataset with close to state-of-the-art performance. deep learning machine learning mono vision system lightweight CNN convolutional neural network depth estimation lidar kitti vehicle camera mono camera camera real-time real time ad autonomous driving adas advanced driver assistance systems mono depth computer vision regression pixel-wise pixel wise object detection general object detection pedestrian detection vehicle detection supervised learning supervised tensorflow python keras opencv autoliv

Search results