Object recognition is one of the most important problems in computer vision. Traditional object recognition techniques are usually performed on optical images that are 2D projections of the 3D world. Information about the depth of objects in the scene is not provided explicitly in these images and thus, it makes 2D object recognition techniques sensitive to changes in illumination and shadowing. As surface acquisition methods such as LADAR or range scanners are becoming more popular, there is an increasing interest in the use of three-dimensional geometric data in object recognition to overcome these limitations. However, the matching of 3D free-form surfaces is also a difficult problem due to the shape and topological complexity of 3D surfaces. In addition, the problem is further complicated by other issues such as variations in surface sampling resolution, occlusion, clutter and sensor noise. The huge amount of information required to describe a 3D surface is also another challenge that 3D surface matching techniques have to deal with. This thesis investigates the problems of 3D surface matching that include 3D surface registration and object recognition from range images. It focuses on developing a novel and efficient framework for aligning 3D surfaces in different coordinate systems and from this, recognizing 3D models from scenes with high levels of occlusion and clutter using multi-scale local features. The first part of the thesis presents two different schemes for extracting salient geometric features from 3D surfaces using surface curvature measures known as the curvedness and shape index. By deriving the scale-space representation of the input surface, surface positions with high local curvature or high local shape variations are selected as features at various degrees of scale. One advantage of the proposed approaches is their applicability to both 3D meshes with connectivity information and unstructured point clouds. In the second part of the thesis, an application of the multi-scale feature extraction framework to 3D surface registration and object recognition is proposed. A Delaunay tetrahedrization is performed on the features extracted from each input range image to obtain a set of triangles. Possible correspondences are found by matching all possible pairs of triangles between the scene and model surfaces. From these correspondences, possible transformations between the two surfaces can be hypothesized and tested. In order to increase the accuracy and efficiency of the algorithm, various surface geometric and rigidity constraints are applied to prune unlikely correspondences. By finding the match that aligns the largest number of features between the two surfaces, the best transformation can be estimated. In the case of surface registration, this transformation can be used to coarse-align two different views of the same object. In the case of 3D object recognition, it provides information about the possible pose (location and orientation) of the model in the scene surface. Experimental results on a variety of 3D models and real scenes are shown to verify the effectiveness and robustness of the approach. / http://proxy.library.adelaide.edu.au/login?url= http://library.adelaide.edu.au/cgi-bin/Pwebrecon.cgi?BBID=1474505 / Thesis (M.App.Sc.) -- University of Adelaide, School of Electrical and Electronic Engineering, 2009
Identifer | oai:union.ndltd.org:ADTP/282027 |
Date | January 2009 |
Creators | Ho, Huy Tho |
Source Sets | Australiasian Digital Theses Program |
Detected Language | English |
Page generated in 0.0015 seconds