Global ETD Search

1	Pedestrian Detection Based on Data and Decision Fusion Using Stereo Vision and Thermal Imaging Sun, Roy 25 April 2016 (has links) Pedestrian detection is a canonical instance of object detection that remains a popular topic of research and a key problem in computer vision due to its diverse applications. These applications have the potential to positively improve the quality of life. In recent years, the number of approaches to detecting pedestrians in monocular and binocular images has grown steadily. However, the use of multispectral imaging is still uncommon. This thesis work presents a novel approach to data and feature fusion of a multispectral imaging system for pedestrian detection. It also includes the design and building of a test rig which allows for quick data collection of real-world driving. An application of the mathematical theory of trifocal tensor is used to post process this data. This allows for pixel level data fusion across a multispectral set of data. Performance results based on commonly used SVM classification architectures are evaluated against the collected data set. Lastly, a novel cascaded SVM architecture used in both classification and detection is discussed. Performance improvements through the use of feature fusion is demonstrated. pedestrian detection data fusion trifocal tensor feature fusion decision fusion stereo vision thermal vision
2	Uncalibrated Vision-Based Control and Motion Planning of Robotic Arms in Unstructured Environments Shademan, Azad Unknown Date No description available. Robotics Visual Servoing Motion Planning Robust Statistics Vision-Based Control Uncalibrated Trifocal Tensor Three-View Geometry
3	Avancements dans l'estimation de pose et la reconstruction 3D de scènes à 2 et 3 vues / Advances on Pose Estimation and 3D Resconstruction of 2 and 3-View Scenes Fernandez Julia, Laura 13 December 2018 (has links) L'étude des caméras et des images a été un sujet prédominant depuis le début de la vision par ordinateur, l'un des principaux axes étant l'estimation de la pose et la reconstruction 3D. Le but de cette thèse est d'aborder et d'étudier certains problèmes et méthodes spécifiques du pipeline de la structure-from-motion afin d'améliorer la précision, de réaliser de vastes études pour comprendre les avantages et les inconvénients des modèles existants et de créer des outils mis à la disposition du public. Plus spécifiquement, nous concentrons notre attention sur les pairs stéréoscopiques et les triplets d'images et nous explorons certaines des méthodes et modèles capables de fournir une estimation de la pose et une reconstruction 3D de la scène.Tout d'abord, nous abordons la tâche d'estimation de la profondeur pour les pairs stéréoscopiques à l'aide de la correspondance de blocs. Cette approche suppose implicitement que tous les pixels du patch ont la même profondeur, ce qui produit l'artefact commun dénommé "foreground-fattening effect". Afin de trouver un support plus approprié, Yoon et Kweon ont introduit l'utilisation de poids basés sur la similarité des couleurs et la distance spatiale, analogues à ceux utilisés dans le filtre bilatéral. Nous présentons la théorie de cette méthode et l'implémentation que nous avons développée avec quelques améliorations. Nous discutons de quelques variantes de la méthode et analysons ses paramètres et ses performances.Deuxièmement, nous considérons l'ajout d'une troisième vue et étudions le tenseur trifocal, qui décrit les contraintes géométriques reliant les trois vues. Nous explorons les avantages offerts par cet opérateur dans la tâche d'estimation de pose d'un triplet de caméras par opposition au calcul des poses relatives paire par paire en utilisant la matrice fondamentale. De plus, nous présentons une étude et l’implémentation de plusieurs paramétrisations du tenseur. Nous montrons que l'amélioration initiale de la précision du tenseur trifocal n'est pas suffisante pour avoir un impact remarquable sur l'estimation de la pose après ajustement de faisceau et que l'utilisation de la matrice fondamentale avec des triplets d'image reste pertinente.Enfin, nous proposons d'utiliser un modèle de projection différent de celui de la caméra à sténopé pour l'estimation de la pose des caméras en perspective. Nous présentons une méthode basée sur la factorisation matricielle due à Tomasi et Kanade qui repose sur la projection orthographique. Cette méthode peut être utilisée dans des configurations où d'autres méthodes échouent, en particulier lorsque l'on utilise des caméras avec des objectifs à longue distance focale. La performance de notre implémentation de cette méthode est comparée à celle des méthodes basées sur la perspective, nous considérons que l'exactitude obtenue et la robustesse démontré en font un élément à considérer dans toute procédure de la SfM / The study of cameras and images has been a prominent subject since the beginning of computer vision, one of the main focus being the pose estimation and 3D reconstruction. The goal of this thesis is to tackle and study some specific problems and methods of the structure-from-motion pipeline in order to provide improvements in accuracy, broad studies to comprehend the advantages and disadvantages of the state-of-the-art models and useful implementations made available to the public. More specifically, we center our attention to stereo pairs and triplets of images and discuss some of the methods and models able to provide pose estimation and 3D reconstruction of the scene.First, we address the depth estimation task for stereo pairs using block-matching. This approach implicitly assumes that all pixels in the patch have the same depth producing the common artifact known as the ``foreground fattening effect''. In order to find a more appropriate support, Yoon and Kweon introduced the use of weights based on color similarity and spatial distance, analogous to those used in the bilateral filter. We present the theory of this method and the implementation we have developed with some improvements. We discuss some variants of the method and analyze its parameters and performance.Secondly, we consider the addition of a third view and study the trifocal tensor, which describes the geometric constraints linking the three views. We explore the advantages offered by this operator in the pose estimation task of a triplet of cameras as opposed to computing the relative poses pair by pair using the fundamental matrix. In addition, we present a study and implementation of several parameterizations of the tensor. We show that the initial improvement in accuracy of the trifocal tensor is not enough to have a remarkable impact on the pose estimation after bundle adjustment and that using the fundamental matrix with image triplets remains relevant.Finally, we propose using a different projection model than the pinhole camera for the pose estimation of perspective cameras. We present a method based on the matrix factorization due to Tomasi and Kanade that relies on the orthographic projection. This method can be used in configurations where other methods fail, in particular, when using cameras with long focal length lenses. The performance of our implementation of this method is compared to that given by the perspective-based methods, we consider that the accuracy achieved and its robustness make it worth considering in any SfM procedure Tenseur trifocal Reconstruction 3D Projection orthographique Stereovision Estimation de pose Stereovision Pose estimation Orthographic projection 3D reconstruction Trifocal tensor
4	Rekonstrukce 3D objektů z více pohledů / Structure From Motion From Multiple Views Mrkvička, Daniel January 2019 (has links) This thesis deals with the reconstruction of the scene using two or more images. It describes the whole reconstruction process consisting of detecting points in images, finding the appropriate geometry between images and resulting projection of these points into the space of scene. The thesis also includes a description of the application, which demonstrates the described methods.
5	Fundamental numerical schemes for parameter estimation in computer vision. Scoleri, Tony January 2008 (has links) An important research area in computer vision is parameter estimation. Given a mathematical model and a sample of image measurement data, key parameters are sought to encapsulate geometric properties of a relevant entity. An optimisation problem is often formulated in order to find these parameters. This thesis presents an elaboration of fundamental numerical algorithms for estimating parameters of multi-objective models of importance in computer vision applications. The work examines ways to solve unconstrained and constrained minimisation problems from the view points of theory, computational methods, and numerical performance. The research starts by considering a particular form of multi-equation constraint function that characterises a wide class of unconstrained optimisation tasks. Increasingly sophisticated cost functions are developed within a consistent framework, ultimately resulting in the creation of a new iterative estimation method. The scheme operates in a maximum likelihood setting and yields near-optimal estimate of the parameters. Salient features of themethod are that it has simple update rules and exhibits fast convergence. Then, to accommodate models with functional dependencies, two variant of this initial algorithm are proposed. These methods are improved again by reshaping the objective function in a way that presents the original estimation problem in a reduced form. This procedure leads to a novel algorithm with enhanced stability and convergence properties. To extend the capacity of these schemes to deal with constrained optimisation problems, several a posteriori correction techniques are proposed to impose the so-called ancillary constraints. This work culminates by giving two methods which can tackle ill-conditioned constrained functions. The combination of the previous unconstrained methods with these post-hoc correction schemes provides an array of powerful constrained algorithms. The practicality and performance of themethods are evaluated on two specific applications. One is planar homography matrix computation and the other trifocal tensor estimation. In the case of fitting a homography to image data, only the unconstrained algorithms are necessary. For the problem of estimating a trifocal tensor, significant work is done first on expressing sets of usable constraints, especially the ancillary constraints which are critical to ensure that the computed object conforms to the underlying geometry. Evidently here, the post-correction schemes must be incorporated in the computational mechanism. For both of these example problems, the performance of the unconstrained and constrained algorithms is compared to existing methods. Experiments reveal that the new methods perform with high accuracy to match a state-of-the-art technique but surpass it in execution speed. / Thesis (Ph.D.) - University of Adelaide, School of Mathemtical Sciences, Discipline of Pure Mathematics, 2008 Computer vision. Computer vision -- Mathematical models. Parameter estimation -- Data processing. Tensor algebra.
6	三焦張量在多視角幾何中的計算與應用 / Computation and Applications of Trifocal Tensor in Multiple View Geometry 李紹暐, Li, Shau Wei Unknown Date (has links) 電腦視覺三維建模的精確度，仰賴影像中對應點的準確性。以前的研究大多採取兩張影像，透過極線轉換(epipolar transfer)取得影像間基礎矩陣(fundamental matrix)的關係，然後進行比對或過濾不良的對應點以求取精確的對應點。然極線轉換存在退化的問題，如何避免此退化問題以及降低兩張影像之間轉換錯誤的累積，成為求取精確三維建模中極待解決的課題。本論文中，我們提出一套機制，透過三焦張量(trifocal tensor)的觀念來過濾影像間不良的對應點，提高整體對應點的準確度，從而能計算較精確的投影矩陣進行三維建模。我們由多視角影像出發，先透過Bundler求取對應點，然後採用三焦張量過濾Bundler產生的對應點，並輔以最小中值平方法(LMedS)提升選點之準確率，再透過權重以及重複過濾等機制來調節並過濾對應點，從而取得精確度較高的對應點組合，最後求取投影矩陣進行電腦視覺中的各項應用。實作中，我們測詴了三組資料，包含一組以3ds Max自行建置的資料與兩組網路中取得的資料。我們先從三張影像驗證三焦張量的幾何特性與其過濾對應點的可行性，再將此方法延伸至多張影像，同樣也能證實透過三焦張量確實能提升對應點的準確度，甚至可以過濾出輸入資料中較不符合彼此間幾何性的影像。 / The accuracy of 3D model constructions in computer vision depends on the accuracy of the corresponding points extracted from the images. Previous studies in this area mostly use two images and compute the fundamental matrix through the use of the epipolar geometry and then proceed for corresponding point matching and filtering out the outliers in order to get accurate corresponding points. However, the epipoler transform suffers from the degenerate problems and, also, the accumulated conversion errors during the corresponding matches both will degrade the model accuracy. Solving these problems become crucial in reconstructing accurate 3D models from multiple images. In this thesis, we proposed a mechanism to obtain accurate corresponding points for 3D model reconstruction from multiple images. The concept of trifocal tensor is used to remove the outliers in order to improve the overall accuracy of the corresponding points. We first use Bundler to search the corresponding points in the feature points extracted from multiple view images. Then we use trifocal tensor to determine and remove the outliers in the corresponding points generated by Bundler. LMedS is used in these processes to improve the accuracy of the selected points. One can also improve the accuracy of the corresponding points through the use of weighting function as well as repeated filtering mechanism. With these high precision corresponding points, we can compute more accurate fundamental matrix in order to reconstruct the 3D models and other applications in computer vision. We have tested three sets of data, one of that is self-constructed data using the 3ds Max and the other two are downloaded from the internet. We started by demonstrating the geometric properties of trifocal tensor associated with three images and showed that it can be used to filter out the bad corresponding points. Then, we successfully extended this mechanism to more images and successfully improved the accuracy of the corresponding points among these images. 三焦張量多視角影像極線轉換最小中值平方法投影矩陣 trifocal tensor multiple view images epipolar transfer LMedS projection matrix
7	以四旋翼UAS酬載熱感測器製作數值表面溫度模型供地溫研究 / Generation of digital surface temperature model from images collected by thermal sensor on quadcopter UAS for geothermal study 謝耀震, Hsieh, Yao-Chen Unknown Date (has links) 熱像儀，能感測可見光感測器無法取得的訊息，因此若能透過熱像儀器進行環境偵測，便能得到一般可見光感測器無法獲取的資料。本研究擬以四旋翼UAS酬載熱像儀得到局部區域高解析度之地面熱資訊以便作為地溫研究之背景資料使用。而一般地溫研究區，不易佈設控制點，因此本研究除於無人機上酬載熱像儀之外，並將搭載Trimble BD970 GNSS OEM接收模組，嘗試以少量地面控制點、以及GNSS動態後處理的方式取得取像時對應的GNSS觀測量輔助熱像定位定向。本研究中針對國立政治大學旁的指南溪實驗區與陽明山國家公園的小油坑實驗區，使用AI-RIDER YJ-1000-HC四旋翼UAS分別酬載熱像儀FLIR Tau 640和巨哥XM6，並且同時搭載Trimble BD970 GNSS OEM接收模組、以及GNSS動態後處理的方式取得取像時對應的GNSS觀測量搭配少量地面控制點輔助熱像定位定向，過程中透過三焦張量剔除自動匹配之誤匹配連結點。實驗結果顯示，兩實驗區所產製之DSM於不易變動區域精度經現有資料檢核均在±1m，而指南溪實驗區產製出地面解析度11公分的數值表面模型(Digital Surface Model, DSM)與正射熱像，且正射熱像平面精度達為47公分；小油坑實驗區產製出地面解析度14公分之DSM與正射熱像，正射熱像平面精度則為67公分，雖然DSM和正射熱像精度無法符合一般常規的測量規範，但成果仍然可以證明熱像直接產製DSM以及正射熱像之可行性，兩實驗區最後皆生成數值溫度表面模型(Digital Surface Temparature Model, DSTM)，顯示本研究所提方法之可行性，所生成之成果可供後續地溫研究使用。 / Thermal infrared images show the temperature change of sensed scenes. Therefore, thermal infrared camera can sense some important information that optical digital cameras cannot do for the environment monitoring. In this study, the Quadcopter UAS for thermal image collection applied to geothermal study will be developed. FIIR Tau 640 and Magnity Eletric XM6 thermal infrared sensor will be used in this thermal image collection system separately two test areas, Zhinan River nearby NCCU and Xiaoyoukeng, in the Yangmingshan National Park. Additionally, Trimble BD970 GNSS OEM board will be carried on the Quadcopter UAS to collect dual-frequency GNSS observations for determining the flying trajectory by Post-processed kinematic (PPK) technique to support the positioning and orientating of collected thermal images, and the trifocal tensor will be used to delete wrong matching tie images points. From the tests, the differences between produced DSM and existing DSM data are ± 1 m on uneasy change ground surface in two test areas. The resolution of produced DSM and thermal orthoimages are about 11 cm in Zhinan River, and 14cm in Xiaoyoukeng area. The accuracy of thermal orthoimages is 47cm in Zhinan River and 67cm in Xiaoyoukeng area. The accuracy of thermal orthoimages may not comply with a normal surveying standard, but it proves the possibility of DSM and orthorectifed thermal images generated from thermal images directly. Digital Surface Temparature Model (DSTM) produced in both tests can be used for volcanic geothermal monitoring in the future. 無人機熱像定位定向光束法空三平差熱像儀率定三焦張量 Unmanned aircraft system Thermal images Positioning and orientation Bundle adjustment aerial triangulation Thermal camera calibration Trifocal tensor

1

Page generated in 2.6871 seconds