• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 90
  • 12
  • 6
  • 4
  • 3
  • 2
  • 1
  • 1
  • Tagged with
  • 145
  • 145
  • 145
  • 76
  • 52
  • 51
  • 24
  • 23
  • 22
  • 21
  • 20
  • 19
  • 19
  • 19
  • 18
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
41

Localization of UAVs Using Computer Vision in a GPS-Denied Environment

Aluri, Ram Charan 05 1900 (has links)
The main objective of this thesis is to propose a localization method for a UAV using various computer vision and machine learning techniques. It plays a major role in planning the strategy for the flight, and acts as a navigational contingency method, in event of a GPS failure. The implementation of the algorithms employs high processing capabilities of the graphics processing unit, making it more efficient. The method involves the working of various neural networks, working in synergy to perform the localization. This thesis is a part of a collaborative project between The University of North Texas, Denton, USA, and the University of Windsor, Ontario, Canada. The localization has been divided into three phases namely object detection, recognition, and location estimation. Object detection and position estimation were discussed in this thesis while giving a brief understanding of the recognition. Further, future strategies to aid the UAV to complete the mission, in case of an eventuality, like the introduction of an EDGE server and wireless charging methods, was also given a brief introduction.
42

RMNv2: Reduced Mobilenet V2 an Efficient Lightweight Model for Hardware Deployment

Ayi, Maneesh 05 1900 (has links)
Indiana University-Purdue University Indianapolis (IUPUI) / Humans can visually see things and can differentiate objects easily but for computers, it is not that easy. Computer Vision is an interdisciplinary field that allows computers to comprehend, from digital videos and images, and differentiate objects. With the Introduction to CNNs/DNNs, computer vision is tremendously used in applications like ADAS, robotics and autonomous systems, etc. This thesis aims to propose an architecture, RMNv2, that is well suited for computer vision applications such as ADAS, etc. RMNv2 is inspired by its original architecture Mobilenet V2. It is a modified version of Mobilenet V2. It includes changes like disabling downsample layers, Heterogeneous kernel-based convolutions, mish activation, and auto augmentation. The proposed model is trained from scratch in the CIFAR10 dataset and produced an accuracy of 92.4% with a total number of parameters of 1.06M. The results indicate that the proposed model has a model size of 4.3MB which is like a 52.2% decrease from its original implementation. Due to its less size and competitive accuracy the proposed model can be easily deployed in resource-constrained devices like mobile and embedded devices for applications like ADAS etc. Further, the proposed model is also implemented in real-time embedded devices like NXP Bluebox 2.0 and NXP i.MX RT1060 for image classification tasks.
43

Převod prózy do poezie pomocí neuronových sítí / Converting prose into poetry using neural networks

Gokirmak, Memduh January 2021 (has links)
Title: Converting Prose into Poetry with Neural Networks Author: Memduh Gokirmak Institute: Institute of Formal and Applied Linguistics Supervisor: Martin Popel, Institute of Formal and Applied Linguistics Abstract: We present here our attempts to create a system that generates poetry based on a sequence of text provided to it by a user. We explore the use of machine translation and language model technologies based on the neural network architecture. We use different types of data across three languages in our research, and employ and develop metrics to track the quality of the output of the systems we develop. We find that combining machine translation techniques to generate training data to this end with fine-tuning of pre-trained language models provides the most satisfactory generated poetry. Keywords: poetry machine translation language models iii
44

Top-down Modulation in Human Visual Cortex / ヒト視覚皮質におけるトップダウン変調

Mohamed, Abdelhack 25 March 2019 (has links)
京都大学 / 0048 / 新制・課程博士 / 博士(情報学) / 甲第21909号 / 情博第692号 / 新制||情||119(附属図書館) / 京都大学大学院情報学研究科知能情報学専攻 / (主査)教授 神谷 之康, 教授 熊田 孝恒, 教授 西野 恒 / 学位規則第4条第1項該当 / Doctor of Informatics / Kyoto University / DGAM
45

Interpreting and Diagnosing Deep Learning Models: A Visual Analytics Approach

Wang, Junpeng 11 July 2019 (has links)
No description available.
46

Deep Neural Networks for Object Detection in Satellite Imagery

Fritsch, Frederik January 2023 (has links)
With the development of small satellites it has become easier and cheaper to deploy satellites for earth observation from space. While optical sensors capture high-resolution data, this data is traditionally sent to earth for analysis which puts a high constraint on the data link and increases the time for making data based decisions. This thesis explores the possibilities of deploying an AI model in small satellites for detecting objects in satellite imagery and therefore reduce the amount of data that needs to be transmitted. The neural network model YOLOv8 was trained on the xView and DIOR dataset and evaluated in a hardware restricted execution environment. The model achieved a mAP50 of 0.66 and could process satellite images at a speed of 309m2/s.
47

ADVANCED CHARACTERIZATION OF BATTERY CELL DYNAMICS

Messing, Marvin January 2021 (has links)
Battery Electric Vehicles (BEV) are gaining market share but still must overcome several engineering challenges related to the lithium-ion battery packs powering them. The batteries must be carefully managed to optimize safety and performance. The estimation of battery states, which cannot be measured directly, is an important part of battery management and remains an active area of research since small gains in estimation accuracy can help reduce cost and increase BEV range. This thesis presents several improvements to battery state estimation using different methods. Electrochemical Impedance Spectroscopy (EIS) is receiving increased attention from researchers as a method for state estimation and diagnostics for real-time applications. Due to battery relaxation behaviour, long rest times are commonly used before performing the EIS measurement. In this work, methods were developed to significantly shorten the required rest times, and a State of Health (SoH) estimation strategy was proposed by taking advantage of the relaxation effect as measured by EIS. This method was demonstrated to have an estimation error of below 1%. At low temperatures, the accuracy of the battery model becomes poor due to the non-linear battery response to current. By using an adaptive filter called the Interacting Multiple Model (IMM) filter, the next part of this work showed how to significantly improve low temperature State of Charge (SoC) estimation. Further reduction in estimation errors was achieved by pairing the IMM with the Smooth Variable Structure Filter (SVSF), for SoC estimation errors below 2%. The work presented in this thesis also includes the application of Deep Neural Networks (DNN) for SoC estimation from EIS data. Finally, an extensive aging study was conducted and an accelerated protocol was compared to a realistic drive cycle based protocol using EIS as a characterization tool. / Thesis / Doctor of Philosophy (PhD) / Replacing conventional gasoline/diesel powered cars with battery powered vehicles is part of a solution to the climate crisis. However, the initial costs paired with range anxiety stops many from switching to electric cars. Both cost and range are related to the battery pack. To achieve the best possible range for the lowest possible cost, battery packs must be carefully controlled by sophisticated algorithms. Unfortunately, battery range or health cannot be measured directly, but must be inferred through measurable indicators. This thesis explores battery behavior under different operating conditions and develops improved methods which can be used to determine battery health and/or range. A powerful method usually used only in laboratory settings is studied and improved to make it more suitable for implementation in electric cars. In this work it is used for accurate battery health determination. Furthermore, a strategy for improving battery range determination at low temperatures is also proposed.
48

Classification of Atypical Femur Fracture with Deep Neural Networks / Klassificering av atypisk femurfraktur med djupa neuronnät

Chen, Yupei January 2019 (has links)
Atypical Femur Fracture(AFF) is a type of stress fracture that occurs in conjunction with prolonged bisphosphonate treatment. In practice, AFF is very rarely identified from Normal Femur Fracture(NFF) correctly on the first diagnostic X-ray examination. This project aims at developing an algorithm based on deep neural networks to assist clinicians with the diagnosis of atypical femurfracture. Two diagnostic pipelines were constructed using the Convolutional Neural Network (CNN) as the core classifier. One is a fully automatic pipeline, where the X-rays image is directly input into the network with only standardized pre-processing steps. Another interactive pipeline requires the user to re-orient the femur bones above the fractures to a vertical position and move the fracture line to the image center, before the repositioned image is sent to the CNNs. Three most popular CNNs architectures, namely VGG19, InceptionV3 and ResNet50,were tested for classifying the images to either AFF or NFF. Transfer learning technique was used to pre-train these networks using images form ImageNet. The diagnosis accuracy was evaluated using 5-fold cross-validation. With the fully automatic diagnosis pipeline, we achieved diagnosis accuracy of 82.7%, 89.4%, 90.5%, with VGG19, InceptionV3 and ResNet50, respectively. With the interactive diagnostic pipeline, the diagnosis accuracy was improved to 92.2%, 93.4% and 94.4%, respectively. To further validate the results, class activation mapping is used for indicating the discriminative image regions that the neural networks learn to identify a certain class.
49

Text Analysis in Fashion : Keyphrase Extraction

Lin, Yuhao January 2020 (has links)
The ability to extract useful information from texts and present them in the form of structured attributes is an important step towards making product comparison algorithm in fashion smarter and better. Some previous work exploits statistical features like the word frequency and graph models to predict keyphrases. In recent years, deep neural networks have proved to be the state-of-the-art methods to handle language modeling. Successful examples include Long Short Term Memory (LSTM), Gated Recurrent Units (GRU), Bidirectional Encoder Representations from Transformers(BERT) and their variations. In addition, some word embedding techniques like word2vec[1] are also helpful to improve the performance. Besides these techniques, a high-quality dataset is also important to the effectiveness of models. In this project, we aim to develop reliable and efficient machine learning models for keyphrase extraction. At Norna AB, we have a collection of product descriptions from different vendors without keyphrase annotations, which motivates the use of unsupervised methods. They should be capable of extracting useful keyphrases that capture the features of a product. To further explore the power of deep neural networks, we also implement several deep learning models. The dataset has two parts, the first part is called the fashion dataset where keyphrases are extracted by our unsupervised method. The second part is a public dataset in the domain of news. We find that deep learning models are also capable of extracting meaningful keyphrases and outperform the unsupervised model. Precision, recall and F1 score are used as evaluation metrics. The result shows that the model that uses LSTM and CRF achieves the optimal performance. We also compare the performance of different models with respect to keyphrase lengths and keyphrase numbers. The result indicates that all models perform better on predicting short keyphrases. We also show that our refined model has the advantage of predicting long keyphrases, which is challenging in this field. / Förmågan att extrahera användbar information från texter och presentera den i form av strukturerade attribut är ett viktigt steg mot att göra produktjämförelsesalgoritmen på ett smartare och bättre sätt. Vissa tidigare arbeten utnyttjar statistiska funktioner som ordfrekvens och grafmodeller för att förutsäga nyckelfraser. Under de senaste åren har djupa neurala nätverk visat sig vara de senaste metoderna för att hantera språkmodellering. Framgångsrika exempel inkluderar Long Short Term Memory (LSTM), Gated Recurrent Units (GRU), Bidirectional Encoder Representations from Transformers (BERT) och deras variationer. Dessutom kan vissa ordinbäddningstekniker som word2vec[1] också vara till hjälp för att förbättra prestandan. Förutom dessa tekniker är en datauppsättning av hög kvalitet också viktig för modellernas effektivitet. I detta projekt strävar vi efter att utveckla pålitliga och effektiva maskininlärningsmodeller för utvinning av nyckelfraser. På Norna AB har vi en samling produktbeskrivningar från olika leverantörer utan nyckelfrasnoteringar, vilket motiverar användningen av metoder utan tillsyn. De bör kunna extrahera användbara nyckelfraser som fångar funktionerna i en produkt. För att ytterligare utforska kraften i djupa neurala nätverk implementerar vi också flera modeller för djupinlärning. Datasetet har två delar, den första delen kallas modedataset där nyckelfraser extraheras med vår metod utan tillsyn. Den andra delen är en offentlig dataset i nyhetsdomänen. Vi finner att deep learning-modeller också kan extrahera meningsfulla nyckelfraser och överträffa den oövervakade modellen. Precision, återkallning och F1-poäng används som utvärderingsmått. Resultatet visar att modellen som använder LSTM och CRF uppnår optimal prestanda. Vi jämför också prestanda för olika modeller med avseende på keyphrase längder och nyckelfras nummer. Resultatet indikerar att alla modeller presterar bättre på att förutsäga korta tangentfraser. Vi visar också att vår raffinerade modell har fördelen att förutsäga långa tangentfraser, vilket är utmanande inom detta område.
50

Time-domain Deep Neural Networks for Speech Separation

Sun, Tao 24 May 2022 (has links)
No description available.

Page generated in 0.1137 seconds