Global ETD Search

241	Sledování a rozpoznávání lidí na videu / Tracking and Recognition of People in Video Šajboch, Antonín January 2016 (has links) The master's thesis deals with detecting and tracking people in the video. To get optimal recognition was used convolution neural network, which extracts vector features from the enclosed frame the face. The extracted vector is further classified. Recognition process must take place in a real time and also with respect are selected optimal methods. There is a new dataset faces, which was obtained from a video record at the faculty area. Videos and dataset were used for experiments to verify the accuracy of the created system. The recognition accuracy is about 85% . The proposed system can be used, for example, to register people, counting passages or to report the occurrence of an unknown person in a building.
242	Identifikace osob pomocí hlubokých neuronových sítí / Deep Neural Networks for Person Identification Duban, Michal January 2016 (has links) This master's thesis deals with design and implementation of convolutional neural networks used in person re-identification. Implemented convolutional neural networks were tested on two datasets CUHK01 a CUHK03. Results, comparable with state of the art methods were acheved on these datasets. Designed networks were implemented in Caffe framework.
243	Air Reconnaissance Analysis using Convolutional Neural Network-based Object Detection Fasth, Niklas, Hallblad, Rasmus January 2020 (has links) The Swedish armed forces use the Single Source Intelligent Cell (SSIC), developed by Saab, for analysis of aerial reconnaissance video and report generation. The analysis can be time-consuming and demanding for a human operator. In the analysis workflow, identifying vehicles is an important part of the work. Artificial Intelligence is widely used for analysis in many industries to aid or replace a human worker. In this paper, the possibility to aid the human operator with air reconnaissance data analysis is investigated, specifically, object detection for finding cars in aerial images. Many state-of-the-art object detection models for vehicle detection in aerial images are based on a Convolutional Neural Network (CNN) architecture. The Faster R-CNN- and SSD-based models are both based on this architecture and are implemented. Comprehensive experiments are conducted using the models on two different datasets, the open Video Verification of Identity (VIVID) dataset and a confidential dataset provided by Saab. The datasets are similar, both consisting of aerial images with vehicles. The initial experiments are conducted to find suitable configurations for the proposed models. Finally, an experiment is conducted to compare the performance of a human operator and a machine. The results from this work prove that object detection can be used to supporting the work of air reconnaissance image analysis regarding inference time. The current performance of the object detectors makes applications, where speed is more important than accuracy, most suitable. Deep Learning Object detection Convolutional neural network Faster R-CNN Single Shot MultiBox Detector Aerial images Data annotation Engineering and Technology Teknik och teknologier
244	Vyhledávání v obrázkových kolekcích na základě lokálních regionů a reprezentací z hlubokých neuronových sítí / Searching Image Collections Using Deep Representations of Local Regions Bátoryová, Jana January 2020 (has links) In a known-item search task (KIS), the goal is to find a previously seen image in a multimedia collection. In this thesis, we discuss two different approaches based on the visual description of the image. In the first one, the user creates a collage of images (using images from an external search engine), based on which we provide the most similar results from the dataset. Our results show that preprocessing the images in the dataset by splitting them into several parts is a better way to work with the spatial information contained in the user input. We compared the approach to a baseline, which does not utilize this spatial information and an approach that alters a layer in a deep neural network. We also present an alternative approach to the KIS task, search by faces. In this approach, we work with the faces extracted from the images. We investigate face representation for the ability to sort the faces based on their similarity. Then we present a structure that allows easy exploration of the set of faces. We provide a demo, implementing all presented techniques.
245	Sequence classification on gamified behavior data from a learning management system : Predicting student outcome using neural networks and Markov chain Elmäng, Niclas January 2020 (has links) This study has investigated whether it is possible to classify time series data originating from a gamified learning management system. By using the school data provided by the gamification company Insert Coin AB, the aim was to distribute the teacher’s supervision more efficiently among students who are more likely to fail. Motivating this is the possibility that the student retention and completion rate can be increased. This was done by using Long short-term memory and convolutional neural networks and Markov chain to classify time series of event data. Since the classes are balanced the classification was evaluated using only the accuracy metric. The results for the neural networks show positive results but overfitting seems to occur strongly for the convolutional network and less so for the Long short-term memory network. The Markov chain show potential but further work is needed to mitigate the problem of a strong correlation between sequence length and likelihood. Long Short-term Memory Convolutional neural network Markov Chain Time series Classification Gamification Information Systems, Social aspects
246	Hyperparameters impact in a convolutional neural network Bylund, Andreas, Erikssen, Anton, Mazalica, Drazen January 2020 (has links) Machine learning and image recognition is a big and growing subject in today's society. Therefore the aim of this thesis is to compare convolutional neural networks with different hyperparameter settings and see how the hyperparameters affect the networks test accuracy in identifying images of traffic signs. The reason why traffic signs are chosen as objects to evaluate hyperparameters is due to the author's previous experience in the domain. The object itself that is used for image recognition does not matter. Any dataset with images can be used to see the hyperparameters affect. Grid search is used to create a large amount of models with different width and depth, learning rate and momentum. Convolution layers, activation functions and batch size are all tested separately. These experiments make it possible to evaluate how the hyperparameters affect the networks in their performance of recognizing images of traffic signs. The models are created using Keras API and then trained and tested on the dataset Traffic Signs Preprocessed. The results show that hyperparameters affect test accuracy, some affect more than others. Configuring learning rate and momentum can in some cases result in disastrous results if they are set too high or too low. Activation function also show to be a crucial hyperparameter where it in some cases produce terrible results. Machine learning Convolutional neural network Traffic signs preprocessed Hyperparameters Supervised learning Keras Information Systems, Social aspects
247	Evaluation of the CNN Based Architectures on the Problem of Wide Baseline Stereo Matching / Utvärdering av system för stereomatchning som är baserade på neurala nätverk med faltning Li, Vladimir January 2016 (has links) Three-dimensional information is often used in robotics and 3D-mapping. There exist several ways to obtain a three-dimensional map. However, the time of flight used in the laser scanners or the structured light utilized by Kinect-like sensors sometimes are not sufficient. In this thesis, we investigate two CNN based stereo matching methods for obtaining 3D-information from a grayscaled pair of rectified images.While the state-of-the-art stereo matching method utilize a Siamese architecture, in this project a two-channel and a two stream network are trained in an attempt to outperform the state-of-the-art. A set of experiments were performed to achieve optimal hyperparameters. By changing one parameter at the time, the networks with architectures mentioned above are trained. After a completed training the networks are evaluated with two criteria, the error rate, and the runtime.Due to time limitations, we were not able to find optimal learning parameters. However, by using settings from [17] we train a two-channel network that performed almost on the same level as the state-of-the-art. The error rate on the test data for our best architecture is 2.64% while the error rate for the state-of-the-art Siamese network is 2.62%. We were not able to achieve better performance than the state-of-the-art, but we believe that it is possible to reduce the error rate further. On the other hand, the state-of-the-art Siamese stereo matching network is more efficient and faster during the disparity estimation. Therefore, if the time efficiency is prioritized, the Siamese based network should be considered. Stereo matching Stereo Reconstruction Computer Stereo Vision CNN Convolutional Neural Network Disparity Estimation 3D reconstruction KITTI
248	Semantic-Driven Unsupervised Image-to-Image Translation for Distinct Image Domains Ackerman, Wesley 15 September 2020 (has links) We expand the scope of image-to-image translation to include more distinct image domains, where the image sets have analogous structures, but may not share object types between them. Semantic-Driven Unsupervised Image-to-Image Translation for Distinct Image Domains (SUNIT) is built to more successfully translate images in this setting, where content from one domain is not found in the other. Our method trains an image translation model by learning encodings for semantic segmentations of images. These segmentations are translated between image domains to learn meaningful mappings between the structures in the two domains. The translated segmentations are then used as the basis for image generation. Beginning image generation with encoded segmentation information helps maintain the original structure of the image. We qualitatively and quantitatively show that SUNIT improves image translation outcomes, especially for image translation tasks where the image domains are very distinct. computer science machine learning image-to-image translation generative adversarial network deep learning unsupervised learning convolutional neural network Physical Sciences and Mathematics
249	Image Segmentation on Lymph Node Images using Machine Learning to improve Colorectal Cancer Diagnosis Ågren, Elias January 2022 (has links) In cancer diagnosis there is a goal of having the treatment being tailored to each patient. This in order to increase efficiency and reduce side effects. Using more data on each patient can help in achieving this. One such data source is histological images on tissues, such as lymph nodes. This report sets out to find a method in which such images on lymph nodes can be automatically segmented. This so that they can later be analysed and maybe tell in what stage a cancer is in. Such work is today done by hand, and this makes it a subjective process, that might differ between doctors and institutions. If there was a method done by a computer, the process would be replicable and objective. Also, a lot of time would be saved. The results show that such a method is reachable in this early stage of development. It is also quite efficient when segmenting the lymph node itself. The segmentation of smaller areas of the lymph nodes is not as efficient, but with further work in the area it might improve enough to be useful. Some issues are still had since the method relies in part on a person to decide a parameter in order to get a clean segmentation. The final conclusion is that one model is to prefer compared to the others and that further work on this might make it a useful tool in analysing histological images. Image Segmentation Lymph Node Images Machine Learning Colorectal Cancer Diagnosis Convolutional Neural Network U-Net Cancer Staging Histological Images Analysis Mathematics Matematik
250	Deep Active Learning for Image Classification using Different Sampling Strategies Saleh, Shahin January 2021 (has links) Convolutional Neural Networks (CNNs) have been proved to deliver great results in the area of computer vision, however, one fundamental bottleneck with CNNs is the fact that it is heavily dependant on the ground truth, that is, labeled training data. A labeled dataset is a group of samples that have been tagged with one or more labels. In this degree project, we mitigate the data greedy behavior of CNNs by applying deep active learning with various kinds of sampling strategies. The main focus will be on the sampling strategies random sampling, least confidence sampling, margin sampling, entropy sampling, and K- means sampling. We choose to study the random sampling strategy since it will work as a baseline to the other sampling strategies. Moreover, the least confidence sampling, margin sampling, and entropy sampling strategies are uncertainty based sampling strategies, hence, it is interesting to study how they perform in comparison with the geometrical based K- means sampling strategy. These sampling strategies will help to find the most informative/representative samples amongst all unlabeled samples, thus, allowing us to label fewer samples. Furthermore, the benchmark datasets MNIST and CIFAR10 will be used to verify the performance of the various sampling strategies. The performance will be measured in terms of accuracy and less data needed. Lastly, we concluded that by using least confidence sampling and margin sampling we reduced the number of labeled samples by 79.25% in comparison with the random sampling strategy for the MNIST dataset. Moreover, by using entropy sampling we reduced the number of labeled samples by 67.92% for the CIFAR10 dataset. / Faltningsnätverk har visat sig leverera bra resultat inom området datorseende, men en fundamental flaskhals med Faltningsnätverk är det faktum att den är starkt beroende av klassificerade datapunkter. I det här examensarbetet hanterar vi Faltningsnätverkens giriga beteende av klassificerade datapunkter genom att använda deep active learning med olika typer av urvalsstrategier. Huvudfokus kommer ligga på urvalsstrategierna slumpmässigt urval, minst tillförlitlig urval, marginal baserad urval, entropi baserad urval och K- means urval. Vi väljer att studera den slumpmässiga urvalsstrategin eftersom att den kommer användas för att mäta prestandan hos de andra urvalsstrategierna. Dessutom valde vi urvalsstrategierna minst tillförlitlig urval, marginal baserad urval, entropi baserad urval eftersom att dessa är osäkerhetsbaserade strategier som är intressanta att jämföra med den geometribaserade strategin K- means. Dessa urvalsstrategier hjälper till att hitta de mest informativa/representativa datapunkter bland alla oklassificerade datapunkter, vilket gör att vi behöver klassificera färre datapunkter. Vidare kommer standard dastaseten MNIST och CIFAR10 att användas för att verifiera prestandan för de olika urvalsstrategierna. Slutligen drog vi slutsatsen att genom att använda minst tillförlitlig urval och marginal baserad urval minskade vi mängden klassificerade datapunkter med 79, 25%, i jämförelse med den slumpmässiga urvalsstrategin, för MNIST- datasetet. Dessutom minskade vi mängden klassificerade datapunkter med 67, 92% med hjälp av entropi baserad urval för CIFAR10datasetet. Convolutional Neural Network Deep Active Learning Deep Learning Image Classification Sampling Strategies SemiSupervised Learning. Bildklassificering Faltningsnätverk Deep Active Learning Djupinlärning Semiövervakat lärande Urvalsstrategier. Computer and Information Sciences Data- och informationsvetenskap

Search results