Spelling suggestions: "subject:"emplate catching"" "subject:"emplate batching""
51 |
A Novel Approach for Continuous Speech Tracking and Dynamic Time Warping. Adaptive Framing Based Continuous Speech Similarity Measure and Dynamic Time Warping using Kalman Filter and Dynamic State ModelKhan, Wasiq January 2014 (has links)
Dynamic speech properties such as time warping, silence removal and background noise interference are the most challenging issues in continuous speech signal matching. Among all of them, the time warped speech signal matching is of great interest and has been a tough challenge for the researchers. An adaptive framing based continuous speech tracking and similarity measurement approach is introduced in this work following a comprehensive research conducted in the diverse areas of speech processing. A dynamic state model is introduced based on system of linear motion equations which models the input (test) speech signal frame as a unidirectional moving object along the template speech signal. The most similar corresponding frame position in the template speech is estimated which is fused with a feature based similarity observation and the noise variances using a Kalman filter. The Kalman filter provides the final estimated frame position in the template speech at current time which is further used for prediction of a new frame size for the next step. In addition, a keyword spotting approach is proposed by introducing wavelet decomposition based dynamic noise filter and combination of beliefs. The Dempster’s theory of belief combination is deployed for the first time in relation to keyword spotting task. Performances for both; speech tracking and keyword spotting approaches are evaluated using the statistical metrics and gold standards for the binary classification. Experimental results proved the superiority of the proposed approaches over the existing methods. / The appendices files are not available online.
|
52 |
Development of a framework for the design of expanded metal facades : Using artificial intelligence to streamline pre-production workLarsson, Linnéa, Ståhlbrand, Moa January 2022 (has links)
The field of design automation aims to automate repetitive tasks in a workflow in order to free up time for more productive work. In this thesis, design automation with the help of AI techniques is investigated to streamline the pre-production work of expanded metal facades. Two different problems concerning pre-production work are investigated in this thesis. The first one focuses on how to translate architectural drawings in pdf format to a bill of material. The second problem aims to develop a non-linear method for calculating the free area of the expanded metal facades. The method used for this project is an adaptation of the product development process with the inspiration of knowledge-based engineering. For the first project, the AI method template matching was successfully used. With a script using this method, most of the panels are identified, except for panels where the drawings do not provide clear lines or where lines around the panels do not exist. The line quality in the architectural drawings was shown to impact the size estimation of the panels. In the second project, a non-linear machine learning model was developed. However, it was not managed within this project to get a good enough accuracy. The main reason for this is that it is suspected that the data is not accurate enough, nor are the 78 data points enough to train the model.
|
53 |
IS-implementation : a tri-motors theory of organizational change : case study of how an IT-enabled process of organizational change because of the presence of a teleological, life-cycle, and dialectical motor unfolds within a Dutch government organizationWinkel, Geellis January 2010 (has links)
The reason for the study is that IT-enabled organizational change processes such as information system implementations have high costs and disappointing results. Studies to identify causes of the mentioned failures are mainly based on a variance approach. This study applies another approach which is not yet performed in this field of research and affects several themes. Based on a process approach data is compared with ideal-process theories to identify the generative mechanisms causing the unfolding of the process. Thus, the study identifies a recipe and not the ingredients.
|
54 |
Markov Random Field Based Road Network Extraction From High Resoulution Satellite ImagesOzturk, Mahir 01 February 2013 (has links) (PDF)
Road Networks play an important role in various applications such as urban and rural planning, infrastructure planning, transportation management, vehicle navigation. Extraction of Roads from Remote Sensed satellite images for updating road database in geographical information systems (GIS) is generally done manually by a human operator. However, manual extraction of roads is time consuming and labor intensive process. In the existing literature, there are a great number of researches published for the purpose of automating the road extraction process. However, automated processes still yield some erroneous and incomplete results and human intervention is still required.
The aim of this research is to propose a framework for road network extraction from high spatial resolution multi-spectral imagery (MSI) to improve the accuracy of road extraction systems. The proposed framework begins with a spectral classification using One-class Support Vector Machines (SVM) and Gaussian Mixture Models (GMM) classifiers. Spectral Classification exploits the spectral signature of road surfaces to classify road pixels. Then, an iterative template matching filter is proposed to refine spectral classification results. K-medians clustering algorithm is employed to detect candidate road centerline points. Final road network formation is achieved by Markov Random Fields. The extracted road network is evaluated against a reference dataset using a set of quality metrics.
|
55 |
Greitas ir tikslus objekto parametrų nustatymas mašininės regos sistemose / Fast and accurate object parameters detection in machine vision systemKazakevičius, Tadas 10 June 2011 (has links)
Objekto atpažinimas ir pozicijos nustatymas gali būti pritaikomas daugeliui pramonėje egzistuojančių uždavinių. Šio darbo pagrindinis tikslas yra sukurti mašininės regos sistemą, kuria būtų galima greitai ir tiksliai rasti objekto poziciją pagal pasirinktą objekto modelį. Šiame darbe gilinamasi į GPU veikimo principus ir privalumus apdorojant vaizdus GLSL programavimo kalba. Apžvelgiami praktikoje taikomų metodų, skirtų objekto pozicijai nustatyti, veikimo principai, jų privalumai ir trūkumai. Taip pat šiame darbe aprašomas suformuotas ir įgyvendintas realaus laiko metodas, naudojantis GPU teikiama sparta atlikti vartotojo pasirinkto modelio paiešką. Pabaigoje pateikiami pasiekti įgyvendinto metodo spartos rodikliai, privalumai ir trūkumai. Darbą sudaro: įvadas, mašininėje regoje pasitaikančių problemų tyrinėjimas, objekto paieškos metodų apžvalga, darbo su grafinėmis vaizdo plokštėmis privalumai ir trūkumai, objekto paieškos su grafine vaizdo plokšte metodas, pasiekti rezultatai, išvados ir literatūros sąrašas. Darbo apimtis – 53 p. teksto be priedų, 30 pav., 2 lent., 26 literatūros šaltiniai. / Object recognition and parameter detection could be used in many areas from electronics to food industry. One of the most important problems in laser industry is to transform laser work trajectories based on constant object model. In the real life applications model could be rotated or translated due to the fact that object must be put in laser work area. The translation and rotation of object must be found to fit user defined constant model. There are many methods for object parameters detection, but image processing tasks require a lot of computing power. Recent research on image processing with graphics processing units - GPU, shows huge performance results compared with central processing units – CPU. The purpose of this work is to find out the main fundamentals for fast and accurate object parameter detection in machine vision systems. In this work it is focused on object parameter detection with GPU. Moreover, the analysis and comparison of different object parameters detection methods are proposed. Object parameter detection system was implemented with C++ and GLSL shading language, thus the system could be adapted to different computer hardware and operating systems. Work size – 53 p. text, 30 illustrations, 2 tables, 26 bibliographic sources.
|
56 |
Automated video-based measurement of eye closure using a remote camera for detecting drowsiness and behavioural microsleepsMalla, Amol Man January 2008 (has links)
A device capable of continuously monitoring an individual’s levels of alertness in real-time is highly desirable for preventing drowsiness and lapse related accidents. This thesis presents the development of a non-intrusive and light-insensitive video-based system that uses computer-vision methods to localize face, eyes, and eyelids positions to measure level of eye closure within an image, which, in turn, can be used to identify visible facial signs associated with drowsiness and behavioural microsleeps.
The system was developed to be non-intrusive and light-insensitive to make it practical and end-user compliant. To non-intrusively monitor the subject without constraining their movement, the video was collected by placing a camera, a near-infrared (NIR) illumination source, and an NIR-pass optical filter at an eye-to-camera distance of 60 cm from the subject. The NIR-illumination source and filter make the system insensitive to lighting conditions, allowing it to operate in both ambient light and complete darkness without visually distracting the subject.
To determine the image characteristics and to quantitatively evaluate the developed methods, reference videos of nine subjects were recorded under four different lighting conditions with the subjects exhibiting several levels of eye closure, head orientations, and eye gaze. For each subject, a set of 66 frontal face reference images was selected and manually annotated with multiple face and eye features.
The eye-closure measurement system was developed using a top-down passive feature-detection approach, in which the face region of interest (fROI), eye regions of interests (eROIs), eyes, and eyelid positions were sequentially localized. The fROI was localized using an existing Haar-object detection algorithm. In addition, a Kalman filter was used to stabilize and track the fROI in the video. The left and the right eROIs were localized by scaling the fROI with corresponding proportional anthropometric constants. The position of an eye within each eROI was detected by applying a template-matching method in which a pre-formed eye-template image was cross-correlated with the sub-images derived from the eROI. Once the eye position was determined, the positions of the upper and lower eyelids were detected using a vertical integral-projection of the eROI. The detected positions of the eyelids were then used to measure eye closure.
The detection of fROI and eROI was very reliable for frontal-face images, which was considered sufficient for an alertness monitoring system as subjects are most likely facing straight ahead when they are drowsy or about to have microsleep. Estimation of the y- coordinates of the eye, upper eyelid, and lower eyelid positions showed average median errors of 1.7, 1.4, and 2.1 pixels and average 90th percentile (worst-case) errors of 3.2, 2.7, and 6.9 pixels, respectively (1 pixel 1.3 mm in reference images). The average height of a fully open eye in the reference database was 14.2 pixels. The average median and 90th percentile errors of the eye and eyelid detection methods were reasonably low except for the 90th percentile error of the lower eyelid detection method. Poor estimation of the lower eyelid was the primary limitation for accurate eye-closure measurement.
The median error of fractional eye-closure (EC) estimation (i.e., the ratio of closed portions of an eye to average height when the eye is fully open) was 0.15, which was sufficient to distinguish between the eyes being fully open, half closed, or fully closed. However, compounding errors in the facial-feature detection methods resulted in a 90th percentile EC estimation error of 0.42, which was too high to reliably determine extent of eye-closure. The eye-closure measurement system was relatively robust to variation in facial-features except for spectacles, for which reflections can saturate much of the eye-image. Therefore, in its current state, the eye-closure measurement system requires further development before it could be used with confidence for monitoring drowsiness and detecting microsleeps.
|
57 |
A General System for Supervised Biomedical Image SegmentationChen, Cheng 15 March 2013 (has links)
Image segmentation is important with applications to several problems in biology and medicine. While extensively researched, generally, current segmentation methods perform adequately in the applications for which they were designed, but often require extensive modifications or calibrations before used in a different application. We describe a system that, with few modifications, can be used in a variety of image segmentation problems. The system is based on a supervised learning strategy that utilizes intensity neighborhoods to assign each pixel in a test image its correct class based on training data. In summary, we have several innovations: (1) A general framework for such a system is proposed, where rotations and variations of intensity neighborhoods in scales are modeled, and a multi-scale classification framework is utilized to segment unknown images; (2) A fast algorithm for training data selection and pixel classification is presented, where a majority voting based criterion is proposed for selecting a small subset from raw training set. When combined with 1-nearest neighbor (1-NN) classifier, such an algorithm is able to provide descent classification accuracy within reasonable computational complexity. (3) A general deformable model for optimization of segmented regions is proposed, which takes the decision values from previous pixel classification process as input, and optimize the segmented regions in a partial differential equation (PDE) framework. We show that the performance of this system in several different biomedical applications, such as tissue segmentation tasks in magnetic resonance and histopathology microscopy images, as well as nuclei segmentation from fluorescence microscopy images, is similar or better than several algorithms specifically designed for each of these applications.
In addition, we describe another general segmentation system for biomedical applications where a strong prior on shape is available (e.g. cells, nuclei). The idea is based on template matching and supervised learning, and we show the examples of segmenting cells and nuclei from microscopy images. The method uses examples selected by a user for building a statistical model which captures the texture and shape variations of the nuclear structures from a given data set to be segmented. Segmentation of subsequent, unlabeled, images is then performed by finding the model instance that best matches (in the normalized cross correlation sense) local neighborhood in the input image. We demonstrate the application of our method to segmenting cells and nuclei from a variety of imaging modalities, and quantitatively compare our results to several other methods. Quantitative results using both simulated and real image data show that, while certain methods may work well for certain imaging modalities, our software is able to obtain high accuracy across several imaging modalities studied. Results also demonstrate that, relative to several existing methods, the template based method we propose presents increased robustness in the sense of better handling variations in illumination, variations in texture from different imaging modalities, providing more smooth and accurate segmentation borders, as well as handling better cluttered cells and nuclei.
|
58 |
Lightweight User Agents / Användaragenter med små avtyckEstgren, Martin January 2016 (has links)
The unit for information security and IT architecture at The Swedish Defence Research Agency (FOI) conducts work with a cyber range called CRATE (Cyber Range and Training Environment). Currently, simulation of user activity involves scripts inside the simulated network. This solution is not ideal because of the traces it leaves in the system and the general lack of standardised GUI API between different operating systems. FOI are interested in testing the use of artificial user agent located outside the virtual environment using computer vision and the virtualisation API to execute actions and extract information from the system. This paper focuses on analysing the reliability of template matching, a computer vision algorithm used to localise objects in images using already identified images of said object as templates. The analysis will evaluate both the reliability of localising objects and the algorithms ability to correctly identify if an object is present in the virtual environment. Analysis of template matching is performed by first creating a prototype of the agent's sensory system and then simulate scenarios which the agent might encounter. By simulating the environment, testing parameters can be manipulated and monitored in a reliable way. The parameters manipulated involves both the amount and type of image noise in the template and screenshot, the agent’s discrimination threshold for what constitutes a positive match, and information about the template such as template generality. This paper presents the performance and reliability of the agent in regards to what type of image noise affects the result, the amount of correctly identified objects given different discrimination thresholds, and computational time of template matching when different image filters are applied. Furthermore the best cases for each study are presented as comparison for the other results. In the end of the thesis we present how for screenshots with objects very similar to the templates used by the agent, template matching can result in a high degree of accuracy in both object localization and object identification and that a small reduction of similarity between template and screenshot to reduce the agent's ability to reliably identifying specific objects in the environment.
|
59 |
A Comparison of Image Processing Techniques for Bird DetectionReyes, Elsa 01 June 2014 (has links)
Orchard fruits and vegetable crops are vulnerable to wild birds and animals. These wild birds and animals can cause critical damage to the produce. Traditional methods of scaring away birds such as scarecrows are not long-term solutions but short-term solutions. This is a huge problem especially near areas like San Luis Obispo where there are vineyards. Bird damage can be as high as 50% for grapes being grown in vineyards. The total estimated revenue lost annually in the 10 counties in California due to bird and rodent damage to 22 selected crops ranged from $168 million to $504 million (in 2009 dollars). A more effective and permanent system needs to be put into place. Monitoring systems in agricultural settings could potentially provide a lot of data for image processing. Most current monitoring systems however don’t focus on image processing but instead really heavily on sensors. Just having sensors for certain systems work, but for birds, monitoring it is not an option because they are not domesticated like pigs, cows etc. in which most these agricultural monitoring systems work on. Birds can fly in and out of the area whereas domesticated animals can be confined to certain physical regions. The most crucial step in a smart scarecrow system would be how a threat would v be detected. Image processing methods can be effectively applied to detecting items in video footage. This paper will focus on bird detection and will analyze motion detection with image subtraction, bird detection with template matching, and bird detection with the Viola-Jones Algorithm. Of the methods considered, bird detection with the Viola-Jones Algorithm had the highest accuracy (87%) with a somewhat low false positive rate. This image processing step would ideally be incorporated with hardware (such as a microcontroller or FPGA, sensors, a camera etc.) to form a smart scarecrow system.
|
60 |
Detekce tváří v obraze / Face recognitionŠkrobák, Dalibor January 2008 (has links)
This thesis is focused on face detection in static picture. Theoretical part contains color spaces (RGB, HSI, YCbCr), methods for skin detection (explicit, parametric or non-parametric methods), image metric, edge detection, mathematical morphology, methods for classification faces (appearance-based methods, feature invariant approaches, knowledge-based methods, template matching methods). Practical part of this thesis contains concept and practical realization two algorithms for segmentation skin in static image (simple method based on Cr chroma components and statistical method). Practical part contains concept and practical realization two algorithms for classification face (appearance-based method and template matching method) too.
|
Page generated in 0.1382 seconds