Global ETD Search

11	Detection and tracking of spruce seedlings in spatiospectral images / Detektion och följning av granplantor i spatiospektrala bilder Löwbeer, Emma, Åkesson, Erik January 2020 (has links) I projektet detekteras och följs granplantor i spatiospektrala bilder för att därefter skapa en hyperspektral datakub för av varje gran. För att detektera granarna prövas fyra metoder: manuell detektion, detektion med segmentering, detektion med SVM och detektion med neuralt nätverk. Minnesanvändning och körningstid jämförs mellan två implementationer, där hyperspektral rekonstruktion görs med olika metoder. objektdetektion neurala nätverk segmentering Faster R-CNN SVM detektion spårning hyperspektrala bilder spatiospektrala bilder granplantor datorseende Media and Communication Technology Medieteknik
12	Detection and tracking of spruce seedlings in spatiospectral images / Detektion och följning av granplantor i spatiospektrala bilder Löwbeer, Emma, Åkesson, Erik January 2020 (has links) I projektet detekteras och följs granplantor i spatiospektrala bilder för att därefter skapa en hyperspektral datakub för av varje gran. För att detektera granarna prövas fyra metoder: manuell detektion, detektion med segmentering, detektion med SVM och detektion med neuralt nätverk. Minnesanvändning och körningstid jämförs mellan två implementationer, där hyperspektral rekonstruktion görs med olika metoder. objektdetektion neurala nätverk segmentering Faster R-CNN SVM detektion spårning hyperspektrala bilder spatiospektrala bilder granplantor datorseende Media and Communication Technology Medieteknik
13	Machine visual feedback through CNN detectors : Mobile object detection for industrial application Rexhaj, Kastriot January 2019 (has links) This paper concerns itself with object detection as a possible solution to Valmet’s quest for a visual-feedback system that can help operators and other personnel to more easily interact with their machines and equipment. New advancements in deep learning, specifically CNN models, have been exploring neural networks with detection-capabilities. Object detection has historically been mostly inaccessible to the industry due the complex solutions involving various tricky image processing algorithms. In that regard, deep learning offers a more easily accessible way to create scalable object detection solutions. This study has therefore chosen to review recent literature detailing detection models with a selective focus on factors making them realizable on ARM hardware and in turn mobile devices like phones. An attempt was made to single out the most lightweight and hardware efficient model and implement it as a prototype in order to help Valmet in their decision process around future object detection products. The survey led to the choice of a SSD-MobileNetsV2 detection architecture due to promising characteristics making it suitable for performance-constrained smartphones. This CNN model was implemented on Valmet’s phone of choice, Samsung Galaxy S8, and it successfully achieved object detection functionality. Evaluation shows a mean average precision of 60 % in detecting objects and a 4.7 FPS performance on the chosen phone model. TensorFlow was used for developing, training and evaluating the model. The report concludes with recommending Valmet to pursue solutions built on-top of these kinds of models and further wishes to express an optimistic outlook on this type of technology for the future. Realizing performance of this magnitude on a mid-tier phone using deep learning (which historically is very computationally intensive) sets us up for great strides with this type of technology in the future; and along with better smartphones, great benefits are expected to both industry and consumers. / Den här rapporten behandlar objekt detektering som en möjlig lösning på Valmets efterfrågan av ett visuellt återkopplingssystem som kan hjälpa operatörer och annan personal att lättare interagera med maskiner och utrustning. Nya framsteg inom djupinlärning har dem senaste åren möjliggjort framtagande av neurala nätverksarkitekturer med detekteringsförmågor. Då industrisektorn svårare tar till sig högst specialiserade algoritmer och komplexa bildbehandlingsmetoder (som tidigare varit fallet med objekt detektering) så ger djupinlärningsmetoder istället upphov till att skapa självlärande system som är återanpassningsbara och närmast intuitiva i dem fall där sådan teknologi åberopas. Den här studien har därför valt att studera ett par sådana teknologier för att hitta möjliga implementeringar som kan realiseras på något så enkelt som en mobiltelefon. Urvalet har därför bestått i att hitta detekteringsmodeller som är hårdvarumässigt resurssnåla och implementera ett sådant system för att agera prototyp och underlag till Valmets vidare diskussioner kring objekt-detekteringsslösningar. Studien valde att implementera en SSD-MobileNetsV2 modellarkitektur då den uppvisade lovande egenskaper kring hårdvarukraven. Modellen implementerades och utvärderades på Valmets mest förekommande telefon Samsung Galaxy S8 och resultatet visade på en god förmåga för modellen att detektera objekt. Den valda modellen gav 60 % precision på utvärderingsbilderna och lyckades nå 4.7 FPS på den implementerade telefonen. TensorFlow användes för programmering och som stödjande mjukvaruverktyg för träning, utvärdering samt vidare implementering. Studien påpekar optimistiska förväntningar av denna typ av teknologi; kombinerat med bättre smarttelefoner i framtiden kan det leda till revolutionerande lösningar för både industri och konsumenter. Object detection CNN SSD Faster R-CNN YOLO TensorFlow ARM Android Deep learning Machine vision. Annan elektroteknik och elektronik
14	Detekce cizích objektů v rentgenových snímcích hrudníku s využitím metod strojového učení / Detection of foreign objects in X-ray chest images using machine learning methods Matoušková, Barbora January 2021 (has links) Foreign objects in Chest X-ray (CXR) cause complications during automatic image processing. To prevent errors caused by these foreign objects, it is necessary to automatically find them and ommit them in the analysis. These are mainly buttons, jewellery, implants, wires and tubes. At the same time, finding pacemakers and other placed devices can help with automatic processing. The aim of this work was to design a method for the detection of foreign objects in CXR. For this task, Faster R-CNN method with a pre-trained ResNet50 network for feature extraction was chosen which was trained on 4 000 images and lately tested on 1 000 images from a publicly available database. After finding the optimal learning parameters, it was managed to train the network, which achieves 75% precision, 77% recall and 76% F1 score. However, a certain part of the error is formed by non-uniform annotations of objects in the data because not all annotated foreign objects are located in the lung area, as stated in the description.
15	Air Reconnaissance Analysis using Convolutional Neural Network-based Object Detection Fasth, Niklas, Hallblad, Rasmus January 2020 (has links) The Swedish armed forces use the Single Source Intelligent Cell (SSIC), developed by Saab, for analysis of aerial reconnaissance video and report generation. The analysis can be time-consuming and demanding for a human operator. In the analysis workflow, identifying vehicles is an important part of the work. Artificial Intelligence is widely used for analysis in many industries to aid or replace a human worker. In this paper, the possibility to aid the human operator with air reconnaissance data analysis is investigated, specifically, object detection for finding cars in aerial images. Many state-of-the-art object detection models for vehicle detection in aerial images are based on a Convolutional Neural Network (CNN) architecture. The Faster R-CNN- and SSD-based models are both based on this architecture and are implemented. Comprehensive experiments are conducted using the models on two different datasets, the open Video Verification of Identity (VIVID) dataset and a confidential dataset provided by Saab. The datasets are similar, both consisting of aerial images with vehicles. The initial experiments are conducted to find suitable configurations for the proposed models. Finally, an experiment is conducted to compare the performance of a human operator and a machine. The results from this work prove that object detection can be used to supporting the work of air reconnaissance image analysis regarding inference time. The current performance of the object detectors makes applications, where speed is more important than accuracy, most suitable. Deep Learning Object detection Convolutional neural network Faster R-CNN Single Shot MultiBox Detector Aerial images Data annotation Engineering and Technology Teknik och teknologier
16	Dataset Evaluation Method for Vehicle Detection Using TensorFlow Object Detection API / Utvärderingsmetod för dataset inom fordonsigenkänning med användning avTensorFlow Object Detection API Furundzic, Bojan, Mathisson, Fabian January 2021 (has links) Recent developments in the field of object detection have highlighted a significant variation in quality between visual datasets. As a result, there is a need for a standardized approach of validating visual dataset features and their performance contribution. With a focus on vehicle detection, this thesis aims to develop an evaluation method utilized for comparing visual datasets. This method was utilized to determine the dataset that contributed to the detection model with the greatest ability to detect vehicles. The visual datasets compared in this research were BDD100K, KITTI and Udacity, each one being trained on individual models. Applying the developed evaluation method, a strong indication of BDD100K's performance superiority was determined. Further analysis and feature extraction of dataset size, label distribution and average labels per image was conducted. In addition, real-world experimental conduction was performed in order to validate the developed evaluation method. It could be determined that all features and experimental results pointed to BDD100K's superiority over the other datasets, validating the developed evaluation method. Furthermore, the TensorFlow Object Detection API's ability to improve performance gain from a visual dataset was studied. Through the use of augmentations, it was concluded that the TensorFlow Object Detection API serves as a great tool to increase performance gain for visual datasets. / Inom fältet av objektdetektering har ny utveckling demonstrerat stor kvalitetsvariation mellan visuella dataset. Till följd av detta finns det ett behov av standardiserade valideringsmetoder för att jämföra visuella dataset och deras prestationsförmåga. Detta examensarbete har, med ett fokus på fordonsigenkänning, som syfte att utveckla en pålitlig valideringsmetod som kan användas för att jämföra visuella dataset. Denna valideringsmetod användes därefter för att fastställa det dataset som bidrog till systemet med bäst förmåga att detektera fordon. De dataset som användes i denna studien var BDD100K, KITTI och Udacity, som tränades på individuella igenkänningsmodeller. Genom att applicera denna valideringsmetod, fastställdes det att BDD100K var det dataset som bidrog till systemet med bäst presterande igenkänningsförmåga. En analys av dataset storlek, etikettdistribution och genomsnittliga antalet etiketter per bild var även genomförd. Tillsammans med ett experiment som genomfördes för att testa modellerna i verkliga sammanhang, kunde det avgöras att valideringsmetoden stämde överens med de fastställda resultaten. Slutligen studerades TensorFlow Object Detection APIs förmåga att förbättra prestandan som erhålls av ett visuellt dataset. Genom användning av ett modifierat dataset, kunde det fastställas att TensorFlow Object Detection API är ett lämpligt modifieringsverktyg som kan användas för att öka prestandan av ett visuellt dataset. Deep Learning Vehicle Detection Machine Learning Dataset Evaluation Method Artificial Intelligence TensorFlow Object Detection SSD Faster R-CNN CNN Neural Networks Engineering and Technology Teknik och teknologier
17	Real Time Vehicle Detection for Intelligent Transportation Systems Shurdhaj, Elda, Christián, Ulehla January 2023 (has links) This thesis aims to analyze how object detectors perform under winter weather conditions, specifically in areas with varying degrees of snow cover. The investigation will evaluate the effectiveness of commonly used object detection methods in identifying vehicles in snowy environments, including YOLO v8, Yolo v5, and Faster R-CNN. Additionally, the study explores the method of labeling vehicle objects within a set of image frames for the purpose of high-quality annotations in terms of correctness, details, and consistency. Training data is the cornerstone upon which the development of machine learning is built. Inaccurate or inconsistent annotations can mislead the model, causing it to learn incorrect patterns and features. Data augmentation techniques like rotation, scaling, or color alteration have been applied to enhance some robustness to recognize objects under different alterations. The study aims to contribute to the field of deep learning by providing valuable insights into the challenges of detecting vehicles in snowy conditions and offering suggestions for improving the accuracy and reliability of object detection systems. Furthermore, the investigation will examine edge devices' real-time tracking and detection capabilities when applied to aerial images under these weather conditions. What drives this research is the need to delve deeper into the research gap concerning vehicle detection using drones, especially in adverse weather conditions. It highlights the scarcity of substantial datasets before Mokayed et al. published the Nordic Vehicle Dataset. Using unmanned aerial vehicles(UAVs) or drones to capture real images in different settings and under various snow cover conditions in the Nordic region contributes to expanding the existing dataset, which has previously been restricted to non-snowy weather conditions. In recent years, the leverage of drones to capture real-time data to optimize intelligent transport systems has seen a surge. The potential of drones in providing an aerial perspective efficiently collecting data over large areas to precisely and timely monitor vehicular movement is an area that is imperative to address. To a greater extent, snowy weather conditions can create an environment of limited visibility, significantly complicating data interpretation and object detection. The emphasis is set on edge devices' real-time tracking and detection capabilities, which in this study introduces the integration of edge computing in drone technologies to explore the speed and efficiency of data processing in such systems. Unmanned aerial vehicles (UAVs) vehicle detection CNN architecture snow cover NVD Yolov5s Yolov8s Faster R-CNN real-time processing edge devices Computer Sciences Datavetenskap (datalogi)
18	Détection de tableaux dans des documents : une étude de TableBank Yockell, Eugénie 04 1900 (has links) L’extraction d’information dans des documents est une nécessité, particulièrement dans notre ère actuelle où il est commun d’employer un téléphone portable pour photographier des documents ou des factures. On trouve aussi une utilisation répandue de documents PDF qui nécessite de traiter une imposante quantité de documents digitaux. Par leur nature, les données des documents PDF sont complexes à extraire, nécessitant d’être analysés comme des images. Dans cette recherche, on se concentre sur une information particulière à prélever: des tableaux. En effet, les tableaux retrouvés dans les docu- ments représentent une entité significative, car ils contiennent des informations décisives. L’utilisation de modèles neuronaux pour performer des extractions automatiques permet considérablement d’économiser du temps et des efforts. Dans ce mémoire, on définit les métriques, les modèles et les ensembles de données utilisés pour la tâche de détection de tableaux. On se concentre notamment sur l’étude des ensembles de données TableBank et PubLayNet, en soulignant les problèmes d’an- notations présents dans l’ensemble TableBank. On relève que différentes combinaisons d’ensembles d’entraînement avec TableBank et PubLayNet semblent améliorer les perfor- mances du modèle Faster R-CNN, ainsi que des méthodes d’augmentations de données. On compare aussi le modèle de Faster R-CNN avec le modèle CascadeTabNet pour la détection de tableaux où ce premier demeure supérieur. D’autre part, on soulève un enjeu qui est peu discuté dans la tâche de détection d’objets, soit qu’il existe une trop grande quantité de métriques. Cette problématique rend la comparaison de modèles ardue. On génère ainsi les résultats de modèles selon plusieurs métriques afin de démontrer qu’elles conduisent généralement vers différents modèles gagnants, soit le modèle ayant les meilleures performances. On recommande aussi les métriques les plus pertinentes à observer pour la détection de tableaux, c’est-à- dire APmedium/APmedium, Pascal AP85 ou COCO AP85 et la métrique de TableBank. / Extracting information from documents is a necessity, especially in today’s age where it is common to use a cell phone to photograph documents or invoices. There is also the widespread use of PDF documents that requires processing a large amount of digital documents. Due to their nature, the data in PDF documents are complex to retrieve, needing to be analyzed as images. In this research, we focus on a particular information to be extracted: tables. Indeed, the tables found in documents represent a significant entity, as they contain decisive information. The use of neural networks to perform automatic retrieval saves time and effort. In this research, the metrics, models and datasets used for the table detection task are defined. In particular, we focus on the study of the TableBank and PubLayNet datasets, highlighting the problems of annotations present in the TableBank set. We point out that different combinations of training sets using TableBank and PubLayNet appear to improve the performance of the Faster R-CNN model, as well as data augmentation methods. We also compare the Faster R-CNN model with the CascadeTabNet model for table detection where the former remains superior. In addition, we raise an issue that is not often discussed in the object detection task, namely that there are too many metrics. This problem makes model comparison difficult. We therefore generate results from models with several metrics in order to demonstrate the influence of these metrics in defining the best performing model. We also recommend the most relevant metrics to observe for table detection, APmedium/APmedium, Pascal AP85 or COCO AP85 and the TableBank metric. Détection d’objets Détection de tableaux TableBank PubLayNet CascadeTabNet Faster R-CNN ResNeXt Object detection Table Detection
19	Effektivisering av automatiserad igenkänning av registreringsskyltar med hjälp av artificiella neurala nätverk för användning inom smarta hem Drottsgård, Alexander, Andreassen, Jens January 2019 (has links) Konceptet automatiserad igenkänning och avläsning av registreringsskyltarhar utvecklats mycket de senaste åren och användningen av Artificiellaneurala nätverk har introducerats i liten skala med lovande resultat. Viundersökte möjligheten att använda detta i ett automatiserat system förgarageportar och implementerade en prototyp för testning. Den traditionellaprocessen för att läsa av en skylt kräver flera steg, i vissa fall upp till fem.Dessa steg ger alla en felmarginal som aggregerat kan leda till över 30% riskför ett misslyckat resultat. I denna uppsats adresseras detta problem och medhjälp av att använda oss utav Artificiella neurala nätverk utvecklades enkortare process med endast två steg för att läsa en skylt, (1) lokaliseraregistreringsskylten (2) läsa karaktärerna på registreringsskylten. Dettaminskar antalet steg till hälften av den traditionella processen samt minskarrisken för fel med 13%. Vi gjorde en Litteraturstudie för att identifiera detlämpligaste neurala nätverket för uppgiften att lokalisera registreringsskyltarmed vår miljös begränsningar samt möjligheter i åtanke. Detta ledde tillanvändandet av Faster R-CNN, en algoritm som använder ett antal artificiellaneurala nätverk. Vi har använt metoden Design och Creation för att skapa enproof of concept prototyp som använder vårt föreslagna tillvägagångssätt föratt bevisa att det är möjligt att implementera detta i en verklig miljö. / The concept of automated recognition and reading of license plates haveevolved a lot the last years and the use of Artificial neural networks have beenintroduced in a small scale with promising results. We looked into thepossibility of using this in an automated garage port system and weimplemented a prototype for testing. The traditional process for reading alicense plate requires multiple steps, sometimes up to five. These steps all givea margin of error which aggregated sometimes leads to over 30% risk forfailure. In this paper we addressed this issue and with the help of a Artificialneural network. We developed a process with only two steps for the entireprocess of reading a license plate, (1) localize license plate (2) read thecharacters on the plate. This reduced the number of steps to half of theprevious number and also reduced the risk for errors with 13%. We performeda Literature Review to find the best suited algorithm for the task oflocalization of the license plate in our specific environment. We found FasterR-CNN, a algorithm which uses multiple artificial neural networks. We usedthe method Design and Creation to implement a proof of concept prototypeusing our approach which proved that this is possible to do in a realenvironment. AI Neuralt nätverk Maskininlärning Faster R-CNN Registreringsskylt Registreringsskyltsigenkänning Nummerplåt Nummerplåtsigenkänning Artificiell intelligens Artificial Intelligence Neural Network Machine learning License Plate Recognition Number Plate Recognition Engineering and Technology Teknik och teknologier
20	Proposal networks in object detection / Förslagsnätverk för objektdetektering Grossman, Mikael January 2019 (has links) Locating and extracting useful data from images is a task that has been revolutionized in the last decade as computing power has risen to such a level to use deep neural networks with success. A type of neural network that uses the convolutional operation called convolutional neural network (CNN) is suited for image related tasks. Using the convolution operation creates opportunities for the network to learn their own ﬁlters, that previously had to be hand engineered. For locating objects in an image the state-of-the-art Faster R-CNN model predicts objects in two parts. Firstly, the region proposal network (RPN) extracts regions from the picture where it is likely to ﬁnd an object. Secondly, a detector veriﬁes the likelihood of an object being in that region.For this thesis, we review the current literature on artiﬁcial neural networks, object detection methods, proposal methods and present our new way of generating proposals. By replacing the RPN with our network, the multiscale proposal network (MPN), we increase the average precision (AP) with 12% and reduce the computation time per image by 10%. / Lokalisering av användbar data från bilder är något som har revolutionerats under det senaste decenniet när datorkraften har ökat till en nivå då man kan använda artiﬁciella neurala nätverk i praktiken. En typ av ett neuralt nätverk som använder faltning passar utmärkt till bilder eftersom det ger möjlighet för nätverket att skapa sina egna ﬁlter som tidigare skapades för hand. För lokalisering av objekt i bilder används huvudsakligen Faster R-CNN arkitekturen. Den fungerar i två steg, först skapar RPN boxar som innehåller regioner där nätverket tror det är störst sannolikhet att hitta ett objekt. Sedan är det en detektor som veriﬁerar om boxen är på ett objekt .I denna uppsats går vi igenom den nuvarande litteraturen i artiﬁciella neurala nätverk, objektdektektering, förslags metoder och presenterar ett nytt förslag att generera förslag på regioner. Vi visar att genom att byta ut RPN med vår metod (MPN) ökar vi precisionen med 12% och reducerar tiden med 10%. Deep learning Machine learning Computer vision Applied mathematics Statistics Artiﬁcial Neural Networks Object detection Faster R-CNN RPN Proposal Network Maskininlärning Neurala nätverk Objektdetektering Tillämpad matematik Matematisk statistik RPN Förslags nätverk Probability Theory and Statistics Sannolikhetsteori och statistik

Search results