Global ETD Search

41	Metadata Validation Using a Convolutional Neural Network : Detection and Prediction of Fashion Products Nilsson Harnert, Henrik January 2019 (has links) In the e-commerce industry, importing data from third party clothing brands require validation of this data. If the validation step of this data is done manually, it is a tedious and time-consuming task. Part of this task can be replaced or assisted by using computer vision to automatically find clothing types, such as T-shirts and pants, within imported images. After a detection of clothing type is computed, it is possible to recommend the likelihood of clothing products correlating to data imported with a certain accuracy. This was done alongside a prototype interface that can be used to start training, finding clothing types in an image and to mask annotations of products. Annotations are areas describing different clothing types and are used to train an object detector model. A model for finding clothing types is trained on Mask R-CNN object detector and achieves 0.49 mAP accuracy. A detection take just above one second on an Nvidia GTX 1070 8 GB graphics card. Recommending one or several products based on a detection take 0.5 seconds and the algorithm used is k-nearest neighbors. If prediction is done on products of which is used to build the model of the prediction algorithm almost perfect accuracy is achieved while products in images for another products does not achieve nearly as good results. Computer vision Object detection Fashion detection Convolutional neural network Metadata validation Computer Systems Datorsystem
42	Human Activity Recognition Based on Transfer Learning Pang, Jinyong 06 July 2018 (has links) Human activity recognition (HAR) based on time series data is the problem of classifying various patterns. Its widely applications in health care owns huge commercial benefit. With the increasing spread of smart devices, people have strong desires of customizing services or product adaptive to their features. Deep learning models could handle HAR tasks with a satisfied result. However, training a deep learning model has to consume lots of time and computation resource. Consequently, developing a HAR system effectively becomes a challenging task. In this study, we develop a solid HAR system using Convolutional Neural Network based on transfer learning, which can eliminate those barriers. Convolutional neural network Deep learning Human activity recognition Transfer Learning Statistics and Probability
43	Identification of Individuals from Ears in Real World Conditions Hansley, Earnest Eugene 05 April 2018 (has links) A number of researchers have shown that ear recognition is a viable alternative to more common biometrics such as fingerprint, face and iris because the ear is relatively stable over time, the ear is non-invasive to capture, the ear is expressionless, and both the ear’s geometry and shape have significant variation among individuals. Researchers have used different approaches to enhance ear recognition. Some researchers have improved upon existing algorithms, some have developed algorithms from scratch to assist with recognizing individuals by ears, and some researchers have taken algorithms tried and tested for another purpose, i.e., face recognition, and applied them to ear recognition. These approaches have resulted in a number of state-of-the-art effective methods to identify individuals by ears. However, most ear recognition research has been done using ear images that were captured in an ideal setting: ear images have near perfect lighting for image quality, ears are in the same position for each subject, and ears are without earrings, hair occlusions, or anything else that could block viewing of the entire ear. In order for ear recognition to be practical, current approaches must be improved. Ear recognition must move beyond ideal settings and demonstrate effectiveness in an unconstrained environment reflective of real world conditions. Ear recognition approaches must be scalable to handle large groups of people. And, ear recognition should demonstrate effectiveness across a diverse population. This dissertation advances ear recognition from ideal settings to real world settings. We devised an ear recognition framework that outperformed state-of-the-art recognition approaches using the most challenging sets of publicly available ear images and the most voluminous set of unconstrained ear images that we are aware of. We developed a Convolutional Neural Network-based solution for ear normalization and description, we designed a two-stage landmark detector, and we fused learned and handcrafted descriptors. Using our framework, we identified some individuals that are wearing earrings and that have other occlusions, such as hair. The results suggest that our framework can be a gateway for identification of individuals in real world conditions. Biometrics Unconstrained Ear Recognition Handcrafted Features Learned Features Convolutional Neural Network Computer Sciences
44	Chinese Text Classification Based On Deep Learning Wang, Xutao January 2018 (has links) Text classification has always been a concern in area of natural language processing, especially nowadays the data are getting massive due to the development of internet. Recurrent neural network (RNN) is one of the most popular method for natural language processing due to its recurrent architecture which give it ability to process serialized information. In the meanwhile, Convolutional neural network (CNN) has shown its ability to extract features from visual imagery. This paper combine the advantages of RNN and CNN and proposed a model called BLSTM-C for Chinese text classification. BLSTM-C begins with a Bidirectional long short-term memory (BLSTM) layer which is an special kind of RNN to get a sequence output based on the past context and the future context. Then it feed this sequence to CNN layer which is utilized to extract features from the previous sequence. We evaluate BLSTM-C model on several tasks such as sentiment classification and category classification and the result shows our model’s remarkable performance on these text tasks. Text classification Recurrent neural network Convolutional neural network Computer Systems Datorsystem
45	Designing an Artificial Neural Network for state evaluation in Arimaa : Using a Convolutional Neural Network / Design av ett Artificiellt Neuralt Nätverk för evaluering av tillstånd i Arimaa Keisala, Simon January 2017 (has links) Agents being able to play board games such as Tic Tac Toe, Chess, Go and Arimaa has been, and still is, a major difficulty in Artificial Intelligence. For the mentioned board games, there is a certain amount of legal moves a player can do in a specific board state. Tic Tac Toe have in average around 4-5 legal moves, with a total amount of 255168 possible games. Both Chess, Go and Arimaa have an increased amount of possible legal moves to do, and an almost infinite amount of possible games, making it impossible to have complete knowledge of the outcome. This thesis work have created various Neural Networks, with the purpose of evaluating the likelihood of winning a game given a certain board state. An improved evaluation function would compensate for the inability of doing a deeper tree search in Arimaa, and the anticipation is to compete on equal skills against another well-performing agent (meijin) having one less search depth. The results shows great potential. From a mere one hundred games against meijin, the network manages to separate good from bad positions, and after another one hundred games able to beat meijin with equal search depth. It seems promising that by improving the training and by testing different sizes for the neural network that a neural network could win even with one less search depth. The huge branching factor of Arimaa makes such an improvement of the evaluation beneficial, even if the evaluation would be 10 000 times more slow. neural network machine learning arimaa evaluation reinforcement learning convolutional neural network Computer Sciences Datavetenskap (datalogi)
46	Automating Text Categorization with Machine Learning : Error Responsibility in a multi-layer hierarchy Helén, Ludvig January 2017 (has links) The company Ericsson is taking steps towards embracing automating techniques and applying them to their product development cycle. Ericsson wants to apply machine learning techniques to automate the evaluation of a text categorization problem of error reports, or trouble reports (TRs). An excess of 100,000 TRs are handled annually. This thesis presents two possible solutions for solving the routing problems where one technique uses traditional classifiers (Multinomial Naive Bayes and Support Vector Machines) for deciding the route through the company hierarchy where a specific TR belongs. The other solution utilizes a Convolutional Neural Network for translating the TRs into low-dimensional word vectors, or word embeddings, in order to be able to classify what group within the company should be responsible for the handling of the TR. The traditional classifiers achieve up to 83% accuracy and the Convolutional Neural Network achieve up to 71% accuracy in the task of predicting the correct class for a specific TR. error handling cnn convolutional neural network text classification Computer Sciences Datavetenskap (datalogi)
47	A Deep Learning Approach to Autonomous Relative Terrain Navigation Campbell, Tanner, Campbell, Tanner January 2017 (has links) Autonomous relative terrain navigation is a problem at the forefront of many space missions involving close proximity operations to any target body. With no definitive answer, there are many techniques to help cope with this issue using both passive and active sensors, but almost all require high fidelity models of the associated dynamics in the environment. Convolutional Neural Networks (CNNs) trained with images rendered from a digital terrain map (DTM) of the body’s surface can provide a way to side-step the issue of unknown or complex dynamics while still providing reliable autonomous navigation. This is achieved by directly mapping an image to a relative position to the target body. The portability of trained CNNs allows “offline” training that can yield a matured network capable of being loaded onto a spacecraft for real-time position acquisition. In this thesis the lunar surface is used as the proving ground for this optical navigation technique, but the methods used are not unique to the Moon, and are applicable in general. Artificial Intelligence Autonomous Navigation Convolutional Neural Network Deep Neural Network Relative Terrain Navigation Spacecraft GNC
48	Learning Structured and Deep Representations for Traffc Scene Understanding Yu, Zhiding 01 December 2017 (has links) Recent advances in representation learning have led to an increasing variety of vision-based approaches in traffic scene understanding. This includes general vision problems such as object detection, depth estimation, edge/boundary/contour detection, semantic segmentation and scene classification, as well as application-driven problems such as pedestrian detection, vehicle detection, lane marker detection and road segmentation, etc. In this thesis, we approach some of these problems by exploring structured and invariant representations from the visual input. Our research is mainly motivated by two facts: 1. Traffic scenes often contain highly structured layouts. Exploring structured priors is expected to help considerably in improving the scene understanding performance. 2. A major challenge of traffic scene understanding lies in the diverse and changing nature of the contents. It is therefore important to find robust visual representations that are invariant against such variability. We start from highway scenarios where we are interested in detecting the hard road borders and estimating the drivable space before such physical boundary. To this end, we treat the task as a joint detection and tracking problem, and formulate it with structured Hough voting (SVH): A conditional random field model that explores both intra-frame geometric and interframe temporal information to generate more accurate and stable predictions. Turning from highway scenes to urban scenes, we consider dense prediction problems such as category-aware semantic edge detection and semantic segmentation. Category-aware semantic edge detection is challenging as the model is required to jointly localize object contours and classify each edge pixel to one or multiple predefined classes. We propose CASENet, a multilabel deep network with state of the art edge detection performance. To address the label misalignment problem in edge learning, we also propose SEAL, a framework towards simultaneous edge alignment and learning. Failure across different domains has been a common bottleneck of semantic segmentation methods. In this thesis, we address the problem of adapting a segmentation model trained on a source domain to another different target domain without knowing the target domain labels, and propose a class-balanced self-training approach for such unsupervised domain adaptation. We adopt the \synthetic-to-real" setting where a model is pre-trained on GTA-5 and adapted to real world datasets such as Cityscapes and Nexar, as well as the \cross-city" setting where a model is pre-trained on Cityscapes, and adapted to unseen data from Rio, Tokyo, Rome and Taipei. Experiment shows the superior performance of our method compared to state of the art methods, such as adversarial training based domain adaptation. computer vision convolutional neural network deep learning scene understanding structured prediction
49	Identifying illicit graphic in the online community using the neural network framework Vega Ezpeleta, Emilio January 2017 (has links) In this paper two convolutional neural networks are estimated to classify whether an image contains a swastika or not. The images are gathered from the gaming platform Steam and by scraping a web search engine. The architecture of the networks is kept moderate and the difference between the models is the final layer. The first model uses an average type operation while the second uses the conventional fully-connected layer at the end. The results show that the performance of the two models is similar and the test error is in the 6-9 % range. Convolutional neural network CNN Neural networks Image recognition Statistical learning Probability Theory and Statistics Sannolikhetsteori och statistik
50	Visual Perception, Prediction and Understanding with Relations January 2020 (has links) abstract: Rapid development of computer vision applications such as image recognition and object detection has been enabled by the emerging deep learning technologies. To improve the accuracy further, deeper and wider neural networks with diverse architecture are proposed for better feature extraction. Though the performance boost is impressive, only marginal improvement can be achieved with significantly increased computational overhead. One solution is to compress the exploding-sized model by dropping less important weights or channels. This is an effective solution that has been well explored. However, by utilizing the rich relation information of the data, one can also improve the accuracy with reasonable overhead. This work makes progress toward efficient and accurate visual tasks including detection, prediction and understanding by using relations. For object detection, a novel approach, Graph Assisted Reasoning (GAR), is proposed to utilize a heterogeneous graph to model object-object relations and object-scene relations. GAR fuses the features from neighboring object nodes as well as scene nodes. In this way, GAR produces better recognition than that produced from individual object nodes. Moreover, compared to previous approaches using Recurrent Neural Network (RNN), GAR's light-weight and low-coupling architecture further facilitate its integration into the object detection module. For trajectories prediction, a novel approach, namely Diverse Attention RNN (DAT-RNN), is proposed to handle the diversity of trajectories and modeling of neighboring relations. DAT-RNN integrates both temporal and spatial relations to improve the prediction under various circumstances. Last but not least, this work presents a novel relation implication-enhanced (RIE) approach that improves relation detection through relation direction and implication. With the relation implication, the SGG model is exposed to more ground truth information and thus mitigates the overfitting problem of the biased datasets. Moreover, the enhancement with relation implication is compatible with various context encoding schemes. Comprehensive experiments on benchmarking datasets demonstrate the efficacy of the proposed approaches. / Dissertation/Thesis / Doctoral Dissertation Engineering 2020 Computer engineering computer vision convolutional neural network deep learning relation detection

Search results