Global ETD Search

211	Acquiring 3D Full-body Motion from Noisy and Ambiguous Input Lou, Hui 2012 May 1900 (has links) Natural human motion is highly demanded and widely used in a variety of applications such as video games and virtual realities. However, acquisition of full-body motion remains challenging because the system must be capable of accurately capturing a wide variety of human actions and does not require a considerable amount of time and skill to assemble. For instance, commercial optical motion capture systems such as Vicon can capture human motion with high accuracy and resolution while they often require post-processing by experts, which is time-consuming and costly. Microsoft Kinect, despite its high popularity and wide applications, does not provide accurate reconstruction of complex movements when significant occlusions occur. This dissertation explores two different approaches that accurately reconstruct full-body human motion from noisy and ambiguous input data captured by commercial motion capture devices. The first approach automatically generates high-quality human motion from noisy data obtained from commercial optical motion capture systems, eliminating the need for post-processing. The second approach accurately captures a wide variety of human motion even under significant occlusions by using color/depth data captured by a single Kinect camera. The common theme that underlies two approaches is the use of prior knowledge embedded in pre-recorded motion capture database to reduce the reconstruction ambiguity caused by noisy and ambiguous input and constrain the solution to lie in the natural motion space. More specifically, the first approach constructs a series of spatial-temporal filter bases from pre-captured human motion data and employs them along with robust statistics techniques to filter noisy motion data corrupted by noise/outliers. The second approach formulates the problem in a Maximum a Posterior (MAP) framework and generates the most likely pose which explains the observations as well as consistent with the patterns embedded in the pre-recorded motion capture database. We demonstrate the effectiveness of our approaches through extensive numerical evaluations on synthetic data and comparisons against results created by commercial motion capture systems. The first approach can effectively denoise a wide variety of noisy motion data, including walking, running, jumping and swimming while the second approach is shown to be capable of accurately reconstructing a wider range of motions compared with Microsoft Kinect. motion capture, motion denoising motion tracking pose estimation data-driven priors optimization Principal Component Analysis depth camera
212	Statistical and geometric methods for visual tracking with occlusion handling and target reacquisition Lee, Jehoon 17 January 2012 (has links) Computer vision is the science that studies how machines understand scenes and automatically make decisions based on meaningful information extracted from an image or multi-dimensional data of the scene, like human vision. One common and well-studied field of computer vision is visual tracking. It is challenging and active research area in the computer vision community. Visual tracking is the task of continuously estimating the pose of an object of interest from the background in consecutive frames of an image sequence. It is a ubiquitous task and a fundamental technology of computer vision that provides low-level information used for high-level applications such as visual navigation, human-computer interaction, and surveillance system. The focus of the research in this thesis is visual tracking and its applications. More specifically, the object of this research is to design a reliable tracking algorithm for a deformable object that is robust to clutter and capable of occlusion handling and target reacquisition in realistic tracking scenarios by using statistical and geometric methods. To this end, the approaches developed in this thesis make extensive use of region-based active contours and particle filters in a variational framework. In addition, to deal with occlusions and target reacquisition problems, we exploit the benefits of coupling 2D and 3D information of an image and an object. In this thesis, first, we present an approach for tracking a moving object based on 3D range information in stereoscopic temporal imagery by combining particle filtering and geometric active contours. Range information is weighted by the proposed Gaussian weighting scheme to improve segmentation achieved by active contours. In addition, this work present an on-line shape learning method based on principal component analysis to reacquire track of an object in the event that it disappears from the field of view and reappears later. Second, we propose an approach to jointly track a rigid object in a 2D image sequence and to estimate its pose in 3D space. In this work, we take advantage of knowledge of a 3D model of an object and we employ particle filtering to generate and propagate the translation and rotation parameters in a decoupled manner. Moreover, to continuously track the object in the presence of occlusions, we propose an occlusion detection and handling scheme based on the control of the degree of dependence between predictions and measurements of the system. Third, we introduce the fast level-set based algorithm applicable to real-time applications. In this algorithm, a contour-based tracker is improved in terms of computational complexity and the tracker performs real-time curve evolution for detecting multiple windows. Lastly, we deal with rapid human motion in context of object segmentation and visual tracking. Specifically, we introduce a model-free and marker-less approach for human body tracking based on a dynamic color model and geometric information of a human body from a monocular video sequence. The contributions of this thesis are summarized as follows: 1. Reliable algorithm to track deformable objects in a sequence consisting of 3D range data by combining particle filtering and statistics-based active contour models. 2. Effective handling scheme based on object's 2D shape information for the challenging situations in which the tracked object is completely gone from the image domain during tracking. 3. Robust 2D-3D pose tracking algorithm using a 3D shape prior and particle filters on SE(3). 4. Occlusion handling scheme based on the degree of trust between predictions and measurements of the tracking system, which is controlled in an online fashion. 5. Fast level set based active contour models applicable to real-time object detection. 6. Model-free and marker-less approach for tracking of rapid human motion based on a dynamic color model and geometric information of a human body. Computer vision Visual tracking Image segmentation Pose estimation Shape analysis Particle filtering Active contours Image processing Pattern recognition systems Algorithms
213	Triangulation Based Fusion of Sonar Data with Application in Mobile Robot Mapping and Localization Wijk, Olle January 2001 (has links) No description available. mobile robots sensor fusion sonars odometry triangulation based fusion mapping occupancy grids pose tracking localization Kalman filter condensation navigation
214	Visual Perception of Objects and their Parts in Artificial Systems Schoeler, Markus 12 October 2015 (has links) No description available. 510 Object Recognition Object Segmentation Object Partitioning Function Recognition Pose Estimation Scene Segmentation Object Categorization Object Classification Informatik (PPN619939052)
215	Fusion de données visuo-inertielles pour l'estimation de pose et l'autocalibrage Scandaroli, Glauco Garcia 14 June 2013 (has links) (PDF) Les systèmes multi-capteurs exploitent les complémentarités des différentes sources sensorielles. Par exemple, le capteur visuo-inertiel permet d'estimer la pose à haute fréquence et avec une grande précision. Les méthodes de vision mesurent la pose à basse fréquence mais limitent la dérive causée par l'intégration des données inertielles. Les centrales inertielles mesurent des incréments du déplacement à haute fréquence, ce que permet d'initialiser la vision et de compenser la perte momentanée de celle-ci. Cette thèse analyse deux aspects du problème. Premièrement, nous étudions les méthodes visuelles directes pour l'estimation de pose, et proposons une nouvelle technique basée sur la corrélation entre des images et la pondération des régions et des pixels, avec une optimisation inspirée de la méthode de Newton. Notre technique estime la pose même en présence des changements d'illumination extrêmes. Deuxièmement, nous étudions la fusion des données a partir de la théorie de la commande. Nos résultats principaux concernent le développement d'observateurs pour l'estimation de pose, biais IMU et l'autocalibrage. Nous analysons la dynamique de rotation d'un point de vue non linéaire, et fournissons des observateurs stables dans le groupe des matrices de rotation. Par ailleurs, nous analysons la dynamique de translation en tant que système linéaire variant dans le temps, et proposons des conditions d'observabilité uniforme. Les analyses d'observabilité nous permettent de démontrer la stabilité uniforme des observateurs proposés. La méthode visuelle et les observateurs sont testés et comparés aux méthodes classiques avec des simulations et de vraies données visuo-inertielles. [SPI:OTHER] Engineering Sciences/Other Estimation d'état Observateurs d'état Observabilité Fonctions de Lyapunov Estimation de pose Calibrage caméra-centrale inertielle Vision par ordinateur
216	VALIDATION PLATFORM FOR ULTRASOUND-BASED MONITORING OF THERMAL ABLATION PEIKARI, HAMED 30 September 2011 (has links) PURPOSE: Thermal ablation therapy is an emerging local cancer treatment to destroy cancer tissue using heat. However variations in blood flow and energy absorption rates make it extremely challenging to monitor thermal changes. Insufficient ablation may lead to recurrence of the cancer while excessive ablation may damage adjacent healthy tissues. Ultrasound could be a convenient and inexpensive imaging modality for real-time monitoring of the ablation. For the development and optimization of these methods, it is essential to have ground truth data and a reliable and quantitative validation technique before beginning clinical trials on humans. In this dissertation, my primary focus was to solve the image-to-physical space registration problem using stereotactic fiducials that provide accurate correlation of ultrasound and pathology (ground truth) images. METHOD: A previously developed validation test-bed prototype was evaluated using phantom experiments to identify the shortcomings and limitations. In order to develop an improved validation platform, a simulator was implemented for evaluating registration methods as well as different line fiducial structures. New fiducial line structures were proposed, and new methods were implemented to overcome the limitations of the old system. The new methods were then tested using simulation results and phantom studies. Phantom experiments were conducted to improve the visibility of fiducials, as well as the quality of acquired ultrasound and pathology image datasets. RESULTS: The new system outperforms the previous one in terms of accuracy, robustness, and simplicity. The new registration method is robust to missing fiducials. I also achieved complete fiducial visibility in all images. Enhancing the tissue fixation medium improved the ultrasound data quality. The quality of pathology images were improved by a new imaging method. Simulation results show improvement in pose recovery accuracy using my proposed fiducial structure. This was validated by phantom studies reducing spatial misalignment between the US and pathology image sets. CONCLUSION: A new generation of test-bed was developed that provides a reliable and quantitative validation technique for evaluating and optimizing ablation monitoring methods. / Thesis (Master, Computing) -- Queen's University, 2011-09-29 20:31:55.159 Pathology Registration Ultrasound Line fiducial pattern Pose recovery Image-guided procedures Tumor ablation Validation platform Thermal therapy Medical imaging
217	Safe Robotic Manipulation to Extract Objects from Piles : From 3D Perception to Object Selection Mojtahedzadeh, Rasoul January 2016 (has links) This thesis is concerned with the task of autonomous selection of objects to remove (unload) them from a pile in robotic manipulation systems. Applications such as the automation of logistics processes and service robots require an ability to autonomously manipulate objects in the environment. A collapse of a pile of objects due to an inappropriate choice of the object to be removed from the pile cannot be afforded for an autonomous robotic manipulation system. This dissertation presents an indepth analysis of the problem and proposes methods and algorithms to empower robotic manipulation systems to select a safe object from a pile elaborately and autonomously. The contributions presented in this thesis are three-fold. First, a set of algorithms is proposed for extracting a minimal set of high level symbolic relations, namely, gravitational act and support relations, of physical interactions between objects composing a pile. The symbolic relations, extracted by a geometrical reasoning method and a static equilibrium analysis can be readily used by AI paradigms to analyze the stability of a pile and reason about the safest set of objects to be removed. Considering the problem of undetected objects and the uncertainty in the estimated poses as they exist in realistic perception systems, a probabilistic approach is proposed to extract the support relations and to make a probabilistic decision about the set of safest objects using notions from machine learning and decision theory. Second, an efﬁcient search based algorithm is proposed in an internal representation to automatically resolve the inter-penetrations between the shapes of objects due to errors in the poses estimated by an existing object detection module. Reﬁning the poses by resolving the inter-penetrations results in a geometrically consistent model of the environment, and was found to reduce the overall pose error of the objects. This dissertation presents the concept of minimum translation search for object pose reﬁnement and discusses a discrete search paradigm based on the concept of depth of penetration between two polyhedrons. Third, an application centric evaluation of ranging sensors for selecting a set of appropriate sensors for the task of object detection in the design process of a real-world robotics manipulation system is presented. The performance of the proposed algorithms are tested on data sets generated in simulation and from real-world scenarios. Object Selection Object Pose Refinement Gravitational Support Relation Inter-penetration Resolving 3D Ranging Sensor Evaluation Computer Science Datavetenskap (datalogi)
218	Modelování polohy hlavy pomocí stereoskopické rekonstrukce / Head pose estimation via stereoscopic reconstruction Hříbková, Veronika January 2018 (has links) The thesis deals with head pose estimation in stereo data. The theoretical part provides the basis for understanding the geometry of the camera, its parameters and the method of calibration. The following describes the principles of stereo analysis and creating of disparity maps. In the research section, the methods used for head pose modelling are presented and an analysis of selected published articles is given. In the course of the master’s thesis, a system of two cameras for stereoscopic acquisition of motion of the head was designed and several measurements were carried out. The obtained data was prepared for creation of disparity maps and further processing. Based on the detection of facial features, in particular the inner and outer corners of the eyes and corners of the mouth, and their correspondences, a simple geometric model in shape of triangle was created to illustrate the inclination of the facial plane in space. By computing the angle of inclination in three axes, the current head pose is obtained. Motion is modelled by tracking detected points during video sequences.
219	Stanovení pozice objektu / Detection of object position Baáš, Filip January 2019 (has links) Master’s thesis deals with object pose estimation using monocular camera. As an object is considered every rigid, shape fixed entity with strong edges, ideally textureless. Object position in this work is represented by transformation matrix, which describes object translation and rotation towards world coordinate system. First chapter is dedicated to explanation of theory of geometric transformations and intrinsic and extrinsic parameters of camera. This chapter also describes detection algorithm Chamfer Matching, which is used in this work. Second chapter describes all development tools used in this work. Third, fourth and fifth chapter are dedicated to practical realization of this works goal and achieved results. Last chapter describes created application, that realizes known object pose estimation in scene.
220	Ovládání desktopu pomocí senzoru Kinect / Kinect-Based Desktop Controller Czompál, Zsolt January 2012 (has links) The aim of this master's thesis was to create a user interface that allows controlling the PC using voice commands, static poses and dynamic gestures. It contains an implementation of a natural user interface that emulates the functionality of the keyboard and the mouse. In this thesis for detecting and tracking the user was used the sensor Kinect. For implementation I have chosen platform Windows and the Kinect for Windows SDK with programming language C#. During the implementation and testing of the interface, sensor Kinect for Xbox 360 was used. The result of this master's thesis is a natural user interface that allows controlling desktop applications without using keyboard or mouse, and that is comfortable and user-friendly.

Search results