Global ETD Search

161	Design of an Aquatic Quadcopter with Optical Wireless Communications Unknown Date (has links) With a focus on dynamics and control, an aquatic quadcopter with optical wireless communications is modeled, designed, constructed, and tested. Optical transmitter and receiver circuitry is designed and discussed. By utilization of the small angle assumption, the nonlinear dynamics of quadcopter movement are linearized around an equilibrium state of zero motion. The set of equations are then tentatively employed beyond limit of the small angle assumption, as this work represents an initial explorative study. Specific constraints are enforced on the thrust output of all four rotors to reduce the multiple-input multiple-output quadcopter dynamics to a set of single-input single-output systems. Root locus and step response plots are used to analyze the roll and pitch rotations of the quadcopter. Ultimately a proportional integral derivative based control system is designed to control the pitch and roll. The vehicle’s yaw rate is similarly studied to develop a proportional controller. The prototype is then implemented via an I2C network of Arduino microcontrollers and supporting hardware. / Includes bibliography. / Thesis (M.S.)--Florida Atlantic University, 2016. / FAU Electronic Theses and Dissertations Collection Wireless communication systems. Artificial intelligence. Optical pattern recognition. Computer vision.
162	Calculating degenerate structures via convex optimization with applications in computer vision and pattern recognition. / CUHK electronic theses & dissertations collection January 2012 (has links) 在諸多電腦視覺和模式識別的問題中，採集到的圖像和視頻資料通常是高維的。直接計算這些高維資料常常面臨計算可行性和穩定性等方面的困難。然而，現實世界中的資料通常由少數物理因素產生，因而本質上存在退化的結構。例如，它們可以用子空間、子空間的集合、流形或者分層流形等模型來描述。計算並運用這些內在退化結構不僅有助於深入理解問題的本質，而且能夠幫助解決實際應用中的難題。 / 隨著近些年凸優化理論和應用的發展，一些NP難題諸如低稚矩陣的計算和稀疏表示的問題已經有了近乎完美和高效的求解方法。本論文旨在研究如何應用這些技術來計算高維資料中的退化結構，並著重研究子空間和子空間的集合這兩種結構，以及它們在現實應用方面的意義。這些應用包括:人臉圖像的配准、背景分離以及自動植物辨別。 / 在人臉圖像配准的問題中，同一人臉在不同光照下的面部圖像經過逐圖元配准後應位於一個低維的子空間中。基於此假設，我們提出了一個新的圖像配准方法，能夠對某未知人臉的多副不同光照、表情和姿態下的圖像進行聯合配准，使得每一幅面部圖像的圖元與事先訓練的一般人臉模型相匹配。其基本思想是追尋一個低維的且位於一般人臉子空間附近的仿射子空間。相比于傳統的基於外觀模型的配准方法(例如主動外觀模型)依賴于準確的外觀模型的缺點，我們提出的方法僅需要一個一般人臉模型就可以很好地對該未知人臉的多副圖像進行聯合配准，即使該人臉與訓練該模型的樣本相差很大。實驗結果表明，該方法的配准精度在某些情況下接近于理想情形，即：當該目標人臉的模型事先已知時，傳統方法所能夠達到的配准精度。 / In a wide range of computer vision and pattern recognition problems, the captured images and videos often live in high-dimensional observation spaces. Directly computing them may suffer from computational infeasibility and numerical instability. On the other hand, the data in the real world are often generated due to limited number of physical causes, and thus embed degenerate structures in the nature. For instance, they can be modeled by a low-dimensional subspace, a union of subspaces, a manifold or even a manifold stratification. Discovering and harnessing such intrinsic structures not only brings semantic insight into the problems at hand, but also provides critical information to overcome challenges encountered in the practice. / Recent years have witnessed great development in both the theory and application of convex optimization. Efficient and elegant solutions have been found for NP-hard problems such as low-rank matrix recovery and sparse representation. In this thesis, we study the problem of discovering degenerate structures of high-¬dimensional inputs using these techniques. Especially we focus ourselves on low-dimensional subspaces and their unions, and address their application in overcoming the challenges encoun-tered under three practical scenarios: face image alignment, background subtraction and automatic plant identification. / In facial image alignment, we propose a method that jointly brings multiple images of an unseen face into alignment with a pre-trained generic appearance model despite different poses, expressions and illumination conditions of the face in the images. The idea is to pursue an intrinsic affine subspace of the target face that is low-dimensional while at the same time lies close to the generic subspace. Compared with conventional appearance-based methods that rely on accurate appearance mod-els, ours works well with only a generic one and performs much better on unseen faces even if they significantly differ from those for training the generic model. The result is approximately good as that in an idealistic case where a specific model for the target face is provided. / For background subtraction, we propose a background model that captures the changes caused by the background switching among a few configurations, like traffic lights statuses. The background is modeled as a union of low-dimensional subspaces, each characterizing one configuration of the background, and the proposed algorithm automatically switches among them and identifies violating elements as foreground pixels. Moreover, we propose a robust learning approach that can work with foreground-present training samples at the background modeling stage it builds a correct background model with outlying foreground pixels automatically pruned out. This is practically important when foreground-free training samples are difficult to obtain in scenarios such as traffic monitoring. / For automatic plant identification, we propose a novel and practical method that recognizes plants based on leaf shapes extracted from photographs. Different from existing studies that are mostly focused on simple leaves, the proposed method is de-signed to recognize both simple and compound leaves. The key to that is, instead of either measuring geometric features or matching shape features as in conventional methods, we describe leaves by counting on them the numbers of certain shape patterns. The patterns are learned in a way that they form a degenerate polytope (a spe-cial union of affine subspaces) in the feature space, and can simulate, to some extent, the "keys" used by botanists - each pattern reflects a common feature of several dif-ferent species and all the patterns together can form a discriminative rule for recog-nition. Experiments conducted on a variety of datasets show that our algorithm sig-nificantly outperforms the state-of-art methods in terms of recognition accuracy, ef-ficiency and storage, and thus has a good promise for practicing. / In conclusion, our performed studies show that: 1) the visual data with semantic meanings are often not random - although they can be high-dimensional, they typically embed degenerate structures in the observation space. 2) With appropriate assumptions made and clever computational tools developed, these structures can be efficiently and stably calculated. 3) The employment of these intrinsic structures helps overcoming practical challenges and is critical for computer vision and pattern recognition algorithms to achieve good performance. / Detailed summary in vernacular field only. / Detailed summary in vernacular field only. / Detailed summary in vernacular field only. / 在背景分離的問題中，靜態場景在不同光照情形下的背景可以被描述為一個線性子空間。然而在實際應用中，背景的局部和突然的變化有可能違背此假設，尤其是當背景在幾個狀態之間切換的情形下，例如交通燈在不同組合狀態之間切換。為了解決該問題，本論文中提出了一個新的背景模型，它將背景描述為一些子空間的集合，每個子空間對應一個背景狀態。我們將背景分離的問題轉化為稀疏逼近的問題，因此演算法能夠自動在多個狀態中切換並成功檢測出前景物體。此外，本論文提出了一個魯棒的字典學習方法。在訓練背景模型的過程中，它能夠處理含有前景物體的圖像，並在訓練過程中自動將前景部分去掉。這個優點在難以收集完整背景訓練樣本的應用情形(譬如交通監視等)下有明顯的優勢。 / 在植物種類自動辨別的問題中，本論文中提出了一個新的有效方法，它通過提取和對比植物葉片的輪廓對植物進行識別和分類。不同于傳統的基於測量幾何特徵或者在形狀特徵之間配對的方法，我們提出使用葉子上某些外形模式的數量來表達樹葉。這些模式在特徵空間中形成一個退化的多面體結構(一種特殊的仿射空間的集合)，而且在某種程度上能夠類比植物學中使用的分類檢索表每個模式都反映了一些不同植物的某個共性，例如某種邊緣、某種形狀、某種子葉的佈局等等;而所有模式組合在一起能夠形成具有很高區分度的分類準則。通過對演算法在四個數據庫上的測試，我們發現本論文提出的方法無論在識別精度還是在效率和存儲方面都相比于目前主流方法有顯著提高，因此具有很好的應用性。 / 總之，我們進行的一些列研究說明:(1) 有意義的視覺資料通常是內在相關的，儘管它們的維度可能很高，但是它們通常都具有某種退化的結構。(2) 合理的假設和運用計算工具可以高效、穩健地發現這些結構。(3) 利用這些結構有助於解決實際應用中的難題，且能夠使得電腦視覺和模式識別演算法達到好的性能。 / Zhao, Cong. / Thesis (Ph.D.)--Chinese University of Hong Kong, 2012. / Includes bibliographical references (leaves 107-121). / Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web. / Abstract also in Chinese. / Dedication --- p.i / Acknowledgements --- p.ii / Abstract --- p.v / Abstract (in Chinese) --- p.viii / Publication List --- p.xi / Nomenclature --- p.xii / Contents --- p.xiv / List of Figures --- p.xviii / Chapter Chapter 1 --- Introduction --- p.1 / Chapter 1.1 --- Motivation --- p.1 / Chapter 1.2 --- Background --- p.2 / Chapter 1.2.1 --- Subspaces --- p.3 / Chapter 1.2.2 --- Unions of Subspaces --- p.6 / Chapter 1.2.3 --- Manifolds and Stratifications --- p.8 / Chapter 1.3 --- Thesis Outline --- p.10 / Chapter Chapter 2 --- Joint Face Image Alignment --- p.13 / Chapter 2.1 --- Introduction --- p.14 / Chapter 2.2 --- Related Works --- p.16 / Chapter 2.3 --- Background --- p.18 / Chapter 2.3.1 --- Active Appearance Model --- p.18 / Chapter 2.3.2 --- Multi-Image Alignment using AAM --- p.20 / Chapter 2.3.3 --- Limitations in Practice --- p.21 / Chapter 2.4 --- The Proposed Method --- p.23 / Chapter 2.4.1 --- Two Important Assumptions --- p.23 / Chapter 2.4.2 --- The Subspace Pursuit Problem --- p.27 / Chapter 2.4.3 --- Reformulation --- p.27 / Chapter 2.4.4 --- Efficient Solution --- p.30 / Chapter 2.4.5 --- Discussions --- p.32 / Chapter 2.5 --- Experiments --- p.34 / Chapter 2.5.1 --- Settings --- p.34 / Chapter 2.5.2 --- Results and Discussions --- p.36 / Chapter 2.6 --- Summary --- p.38 / Chapter Chapter 3 --- Background Subtraction --- p.40 / Chapter 3.1 --- Introduction --- p.41 / Chapter 3.2 --- Related Works --- p.43 / Chapter 3.3 --- The Proposed Method --- p.48 / Chapter 3.3.1 --- Background Modeling --- p.48 / Chapter 3.3.2 --- Background Subtraction --- p.49 / Chapter 3.3.3 --- Foreground Object Detection --- p.52 / Chapter 3.3.4 --- Background Modeling by Dictionary Learning --- p.53 / Chapter 3.4 --- Robust Dictionary Learning --- p.54 / Chapter 3.4.1 --- Robust Sparse Coding --- p.56 / Chapter 3.4.2 --- Robust Dictionary Update --- p.57 / Chapter 3.5 --- Experimentation --- p.59 / Chapter 3.5.1 --- Local and Sudden Changes --- p.59 / Chapter 3.5.2 --- Non-structured High-frequency Changes --- p.62 / Chapter 3.5.3 --- Discussions --- p.65 / Chapter 3.6 --- Summary --- p.66 / Chapter Chapter 4 --- Plant Identification using Leaves --- p.67 / Chapter 4.1 --- Introduction --- p.68 / Chapter 4.2 --- Related Works --- p.70 / Chapter 4.3 --- Review of IDSC Feature --- p.71 / Chapter 4.4 --- The Proposed Method --- p.73 / Chapter 4.4.1 --- Independent-IDSC Feature --- p.75 / Chapter 4.4.2 --- Common Shape Patterns --- p.77 / Chapter 4.4.3 --- Leaf Representation by Counts --- p.80 / Chapter 4.4.4 --- Leaf Recognition by NN Classifier --- p.82 / Chapter 4.5 --- Experiments --- p.82 / Chapter 4.5.1 --- Settings --- p.82 / Chapter 4.5.2 --- Performance --- p.83 / Chapter 4.5.3 --- Shared Dictionaries v.s. Shared Features --- p.88 / Chapter 4.5.4 --- Pooling --- p.89 / Chapter 4.6 --- Discussions --- p.90 / Chapter 4.6.1 --- Time Complexity --- p.90 / Chapter 4.6.2 --- Space Complexity --- p.91 / Chapter 4.6.3 --- System Description --- p.92 / Chapter 4.7 --- Summary --- p.92 / Chapter 4.8 --- Acknowledgement --- p.94 / Chapter Chapter 5 --- Conclusion and Future Work --- p.95 / Chapter 5.1 --- Thesis Contributions --- p.95 / Chapter 5.2 --- Future Work --- p.97 / Chapter 5.2.1 --- Theory Side --- p.98 / Chapter 5.2.2 --- Practice Side --- p.98 / Chapter Appendix-I --- Joint Face Alignment Results --- p.100 / Bibliography --- p.107 Computer vision--Mathematical models Pattern recognition systems Optical pattern recognition Mathematical optimization Convex functions Plants--Identification--Data processing
163	High-level, part-based features for fine-grained visual categorization Berg, Thomas January 2017 (has links) Object recognition--"What is in this image?"--is one of the basic problems of computer vision. Most work in this area has been on finding basic-level object categories such as plant, car, and bird, but recently there has been an increasing amount of work in fine-grained visual categorization, in which the task is to recognize subcategories of a basic-level category, such as blue jay and bluebird. Experimental psychology has found that while basic-level categories are distinguished by the presence or absence of parts (a bird has a beak but car does not), subcategories are more often distinguished by the characteristics of their parts (a starling has a narrow, yellow beak while a cardinal has a wide, red beak). In this thesis we tackle fine-grained visual categorization, guided by this observation. We develop alignment procedures that let us compare corresponding parts, build classifiers tailored to finding the interclass differences at each part, and then combine the per-part classifiers to build subcategory classifiers. Using this approach, we outperform previous work in several fine-grained categorization settings: bird species identification, face recognition, and face attribute classification. In addition, the construction of subcategory classifiers from part classifiers allows us to automatically determine which parts are most relevant when distinguishing between any two subcategories. We can use this to generate illustrations of the differences between subcategories. To demonstrate this, we have built a digital field guide to North American birds which includes automatically generated images highlighting the key differences between visually similar species. This guide, "Birdsnap," also identifies bird species in users' uploaded photos using our subcategory classifiers. We have released Birdsnap as a web site and iPhone application. Optical pattern recognition Optical data processing Computer vision Computer vision--Computer programs Visual perception--Data processing Computer science
164	Study Of Shear In Dry Granular Flows Through Vertical Channels Moka, Sudheshna 01 1900 (has links) (PDF) No description available. Flow Visualization Optical Pattern Recognition Granular Flows Shear Layers Image Processing Vertical Channels Particle Tracking Velocimetry (PVC) Applied Mechanics
165	Global Optimizing Flows for Active Contours Sundaramoorthi, Ganesh 09 July 2007 (has links) This thesis makes significant contributions to the object detection problem in computer vision. The object detection problem is, given a digital image of a scene, to detect the relevant object in the image. One technique for performing object detection, called ``active contours,' optimizes a constructed energy that is defined on contours (closed curves) and is tailored to image features. An optimization method can be used to perform the optimization of the energy, and thereby deform an initially placed contour to the relevant object. The typical optimization technique used in almost every active contour paper is evolving the contour by the energy's gradient descent flow, i.e., the steepest descent flow, in order to drive the initial contour to (hopefully) the minimum curve. The problem with this technique is that often times the contour becomes stuck in a sub-optimal and undesirable local minimum of the energy. This problem can be partially attributed to the fact that the gradient flows of these energies make use of only local image and contour information. By local, we mean that in order to evolve a point on the contour, only information local to that point is used. Therefore, in this thesis, we introduce a new class of flows that are global in that the evolution of a point on the contour depends on global information from the entire curve. These flows help avoid a number of problems with traditional flows including helping in avoiding undesirable local minima. We demonstrate practical applications of these flows for the object detection problem, including applications to both image segmentation and visual object tracking. Active contours Image segmentation Visual tracking PDE Gradient flows Image analysis Data processing Optical pattern recognition Computer vision Mathematical optimization
166	Statistical and geometric methods for shape-driven segmentation and tracking Dambreville, Samuel 05 March 2008 (has links) Computer Vision aims at developing techniques to extract and exploit information from images. The successful applications of computer vision approaches are multiple and have benefited diverse fields such as manufacturing, medicine or defense. Some of the most challenging tasks performed by computer vision systems are arguably segmentation and tracking. Segmentation can be defined as the partitioning of an image into homogeneous or meaningful regions. Tracking also aims at extracting meaning or information from images, however, it is a dynamic task that operates on temporal (video) sequences. Active contours have been proven to be quite valuable at performing the two aforementioned tasks. The active contours framework is an example of variational approaches, in which a problem is compactly (and elegantly) described and solved in terms of energy functionals. The objective of the proposed research is to develop statistical and shape-based tools inspired from or completing the geometric active contours methodology. These tools are designed to perform segmentation and tracking. The approaches developed in the thesis make an extensive use of partial differential equations and differential geometry to address the problems at hand. Most of the proposed approaches are cast into a variational framework. The contributions of the thesis can be summarized as follows: 1. An algorithm is presented that allows one to robustly track the position and the shape of a deformable object. 2. A variational segmentation algorithm is proposed that adopts a shape-driven point of view. 3. Diverse frameworks are introduced for including prior knowledge on shapes in the geometric active contour framework. 4. A framework is proposed that combines statistical information extracted from images with shape information learned a priori from examples 5. A technique is developed to jointly segment a 3D object of arbitrary shape in a 2D image and estimate its 3D pose with respect to a referential attached to a unique calibrated camera. 6. A methodology for the non-deterministic evolution of curves is presented, based on the theory of interacting particles systems. Active contours Computer vision Variational methods Image analysis Optical pattern recognition Computer vision Automatic tracking Geometry, Differential Algorithms
167	Pixel and patch based texture synthesis using image segmentation Tran, Minh Tue January 2010 (has links) [Truncated abstract] Texture exists all around us and serves as an important visual cue for the human visual system. Captured within an image, we identify texture by its recognisable visual pattern. It carries extensive information and plays an important role in our interpretation of a visual scene. The subject of this thesis is texture synthesis, which is de ned as the creation of a new texture that shares the fundamental visual characteristics of an existing texture such that the new image and the original are perceptually similar. Textures are used in computer graphics, computer-aided design, image processing and visualisation to produce realistic recreations of what we see in the world. For example, the texture on an object communicates its shape and surface properties in a 3D scene. Humans can discriminate between two textures and decide on their similarity in an instant, yet, achieving this algorithmically is not a simple process. Textures range in complexity and developing an approach that consistently synthe- sises this immense range is a dfficult problem to solve and motivates this research. Typically, texture synthesis methods aim to replicate texture by transferring the recognisable repeated patterns from the sample texture to synthesised output. Feature transferal can be achieved by matching pixels or patches from the sample to the output. As a result, two main approaches, pixel-based and patch-based, have es- tablished themselves in the active eld of texture synthesis. This thesis contributes to the present knowledge by introducing two novel texture synthesis methods. Both methods use image segmentation to improve synthesis results. ... The sample is segmented and the boundaries of the middle patch are confined to follow segment boundaries. This prevents texture features from being cut o prematurely, a common artifact of patch-based results, and eliminates the need for patch boundary comparisons that most other patch- based synthesis methods employ. Since no user input is required, this method is simple and straight-forward to run. The tiling of pre-computed tile pairs allows outputs that are relatively large to the sample size to be generated quickly. Output results show great success for textures with stochastic and semi-stochastic clustered features but future work is needed to suit more highly structured textures. Lastly these two texture synthesis methods are applied to the areas of image restoration and image replacement. These two areas of image processing involve replacing parts of an image with synthesised texture and are often referred to as constrained texture synthesis. Images can contain a large amount of complex information, therefore replacing parts of an image while maintaining image fidelity is a difficult problem to solve. The texture synthesis approaches and constrained synthesis implementations proposed in this thesis achieve successful results comparable with present methods. Computer vision Computer-aided design Image processing -- Digital techniques Optical pattern recognition Visual perception Texture synthesis Image segmentation
168	Facial expression recognition for multi-player on-line games Zhan, Ce. January 2008 (has links) Thesis (M.Comp.Sc.)--University of Wollongong, 2008. / Typescript. Includes bibliographical references: leaf 88-98.
169	3-D face recognition Eriksson, Anders 12 1900 (has links) Thesis (MEng) -- Stellenbosch University , 1999. / ENGLISH ABSTRACT: In recent years face recognition has been a focus of intensive research but has still not achieved its full potential, mainly due to the limited abilities of existing systems to cope with varying pose and illumination. The most popular techniques to overcome this problem are the use of 3-D models or stereo information as this provides a system with the necessary information about the human face to ensure good recognition performance on faces with largely varying poses. In this thesis we present a novel approach to view-invariant face recognition that utilizes stereo information extracted from calibrated stereo image pairs. The method is invariant of scaling, rotation and variations in illumination. For each of the training image pairs a number of facial feature points are located in both images using Gabor wavelets. From this, along with the camera calibration information, a sparse 3-D mesh of the face can be constructed. This mesh is then stored along with the Gabor wavelet coefficients at each feature point, resulting in a model that contains both the geometric information of the face as well as its texture, described by the wavelet coefficients. The recognition is then conducted by filtering the test image pair with a Gabor filter bank, projecting the stored models feature points onto the image pairs and comparing the Gabor coefficients from the filtered image pairs with the ones stored in the model. The fit is optimised by rotating and translating the 3-D mesh. With this method reliable recognition results were obtained on a database with large variations in pose and illumination. / AFRIKAANSE OPSOMMING: Alhoewel gesigsherkenning die afgelope paar jaar intensief ondersoek is, het dit nog nie sy volle potensiaal bereik nie. Dit kan hoofsaaklik toegeskryf word aan die feit dat huidige stelsels nie aanpasbaar is om verskillende beligting en posisie van die onderwerp te hanteer nie. Die bekendste tegniek om hiervoor te kompenseer is die gebruik van 3-D modelle of stereo inligting. Dit stel die stelsel instaat om akkurate gesigsherkenning te doen op gesigte met groot posisionele variansie. Hierdie werk beskryf 'n nuwe metode om posisie-onafhanklike gesigsherkenning te doen deur gebruik te maak van stereo beeldpare. Die metode is invariant vir skalering, rotasie en veranderinge in beligting. 'n Aantal gesigspatrone word gevind in elke beeldpaar van die oplei-data deur gebruik te maak van Gabor filters. Hierdie patrone en kamera kalibrasie inligting word gebruik om 'n 3-D raamwerk van die gesig te konstrueer. Die gesigmodel wat gebruik word om toetsbeelde te klassifiseer bestaan uit die gesigraamwerk en die Gabor filter koeffisiente by elke patroonpunt. Klassifisering van 'n toetsbeeldpaar word gedoen deur die toetsbeelde te filter met 'n Gabor filterbank. Die gestoorde modelpatroonpunte word dan geprojekteer op die beeldpaar en die Gabor koeffisiente van die gefilterde beelde word dan vergelyk met die koeffisiente wat gestoor is in die model. Die passing word geoptimeer deur rotosie en translasie van die 3-D raamwerk. Die studie het getoon dat hierdie metode akkurate resultate verskaf vir 'n databasis met 'n groot variansie in posisie en beligting. Computer vision Neural networks (Computer science) Optical pattern recognition Dissertations -- Electronic engineering
170	The mathematics of object recognition in machine and human vision Kim, Sunyoung 01 January 2003 (has links) The purpose of this project was to see why projective geometry is related to the sort of sensors that machines and humans use for vision. Optical pattern recognition Orthographic projection Image processing Pattern recognition systems Polynomials Algebra Applied Mathematics

Search results