• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 1771
  • 57
  • 52
  • 38
  • 36
  • 31
  • 18
  • 11
  • 10
  • 7
  • 4
  • 4
  • 2
  • 2
  • 1
  • Tagged with
  • 2558
  • 2558
  • 1059
  • 910
  • 800
  • 587
  • 555
  • 466
  • 465
  • 432
  • 417
  • 409
  • 389
  • 386
  • 356
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
71

Pretraining Deep Learning Models for Natural Language Understanding

Shao, Han 18 May 2020 (has links)
No description available.
72

Enabling Trimap-Free Image Matting via Multitask Learning

LI, CHENGQI January 2021 (has links)
Trimap-free natural image matting problem is an important computer vision task in which we extract foreground objects from given images without extra trimap input. Compared with trimap-based matting algorithms, trimap-free algorithms are easier to make false detection when the foreground object is not well defined. To solve the problem, we design a novel structure (SegMatting) to handle foreground segmentation and alpha matte prediction simultaneously, which is able to produce high-quality mattes based on RGB inputs alone. This entangled structure enables information exchange between the binary segmentation task and the alpha matte prediction task interactively, and we further design a hybrid loss to adaptively balance two tasks during the multitask learning process. Additionally, we adopt a salient object detection dataset to pretrain our network so that we could obtain a more accurate foreground segment before our training process. Experiments indicate that the proposed SegMatting qualitatively and quantitatively outperforms most previous trimap-free models with a significant margin, while remains competitive among trimap-based methods. / Thesis / Master of Science in Electrical and Computer Engineering (MSECE)
73

Deep adaptive anomaly detection using an active learning framework

Sekyi, Emmanuel 18 April 2023 (has links) (PDF)
Anomaly detection is the process of finding unusual events in a given dataset. Anomaly detection is often performed on datasets with a fixed set of predefined features. As a result of this, if the normal features bear a close resemblance to the anomalous features, most anomaly detection algorithms exhibit poor performance. This work seeks to answer the question, can we deform these features so as to make the anomalies standout and hence improve the anomaly detection outcome? We employ a Deep Learning and an Active Learning framework to learn features for anomaly detection. In Active Learning, an Oracle (usually a domain expert) labels a small amount of data over a series of training rounds. The deep neural network is trained after each round to incorporate the feedback from the Oracle into the model. Results on the MNIST, CIFAR-10 and Galaxy Zoo datasets show that our algorithm, Ahunt, significantly outperforms other anomaly detection algorithms used on a fixed, static, set of features. Ahunt can therefore overcome a poor choice of features that happen to be suboptimal for detecting anomalies in the data, learning more appropriate features. We also explore the role of the loss function and Active Learning query strategy, showing these are important, especially when there is a significant variation in the anomalies.
74

Deep Learning -Based Anomaly Detection System for Guarding Internet of Things Devices

Azumah, Sylvia w. 05 October 2021 (has links)
No description available.
75

Camera-based Recovery of Cardiovascular Signals from Unconstrained Face Videos Using an Attention Network

Deshpande, Yogesh Rajan 22 June 2023 (has links)
This work addresses the problem of recovering the morphology of blood volume pulse (BVP) information from a video of a person's face. Video-based remote plethysmography methods have shown promising results in estimating vital signs such as heart rate and breathing rate. However, recovering the instantaneous pulse rate signals is still a challenge for the community. This is due to the fact that most of the previous methods concentrate on capturing the temporal average of the cardiovascular signals. In contrast, we present an approach in which BVP signals are extracted with a focus on the recovery of the signal morphology as a generalized form for the computation of physiological metrics. We also place emphasis on allowing natural movements by the subject. Furthermore, our system is capable of extracting individual BVP instances with sufficient signal detail to facilitate candidate re-identification. These improvements have resulted in part from the incorporation of a robust skin-detection module into the overall imaging-based photoplethysmography (iPPG) framework. We present extensive experimental results using the challenging UBFC-Phys dataset and the well-known COHFACE dataset. The source code is available at https://github.com/yogeshd21/CVPM-2023-iPPG-Paper. / Master of Science / In this work we are trying to study and recover human health related metrics and the physiological signals which are at the core for the derivation of such metrics. A well know form of physiological signals is ECG (Electrocardiogram) signals and for our research we work with BVP (Blood Volume Pulse) signals. With this work we are proposing a Deep Learning based model for non-invasive retrieval of human physiological signals from human face videos. Most of the state of the art models as well as researchers try to recover averaged cardiac pulse based metrics like heart rate, breathing rate, etc. without focusing on the details of the recovered physiological signal. Physiological signals like BVP have details like systolic peak, diastolic peak and dicrotic notch, and these signals also have applications in various domains like human mental health study, emotional stimuli study, etc. Hence with this work we focus on retrieval of the morphology of such physiological signals and present a quantitative as well as qualitative results for the same. An efficient attention based deep learning model is presented and scope of reidentification using the retrieved signals is also explored. Along with significant implementations like skin detection model our proposed architecture also shows better performance than state of the art models for two very challenging datasets UBFC-Phys as well as COHFACE. The source code is available at https://github.com/yogeshd21/CVPM-2023-iPPG-Paper.
76

Investigation of Subsurface Systems of Polygonal Fractures

Zhu, Weiwei 11 1900 (has links)
Fractures are ubiquitous in the subsurface, and they provide dominant pathways for fluid flow in low permeability formations. Therefore, fractures usually play an essential role in many engineering fields, such as hydrology, waste disposal, geothermal reservoir and petroleum reservoir exploitation. Since fractures are invisible and have variable sizes from micrometers to kilometers, there is limited knowledge of their structure. We aim to deepen the understanding of fracture networks in the subsurface from their topological structures, hydraulic connectivity and characteristics at different scales. We adopt the discrete fracture network method and develop an efficient C++ code, HatchFrac, to make in-depth investigations possible. We start from generating stochastic fracture networks by constraining fracture geometries with different stochastic distributions. We apply percolation theory to investigate the global connectivity of fracture networks. We find that commonly adopted percolation parameters are unsuitable for the characterization of the percolation state of complex fracture networks. We implement the concept of global efficiency to quantify the impact of fracture geometries on the connectivity of fracture networks. Furthermore, we constrain the fracture networks with geological data and geomechanics principles. We investigate the correlation of fracture intensities with different dimensionality and find that it is not feasible to obtain correct 3D intensity parameters from 1D or 2D samples. We utilize a deep-learning technique and propose a pixel-based detection algorithm to automatically interpret fractures from raw outcrop images. Interpreted fracture maps provide abundant resources to investigate fracture intensities, lengths,orientations, and generations. For large scale faults, we develop a method to generate fault segments from a rough fault trace on a seismic map. Accurate fault geometries have significant impacts on damage zones and fault-related flow problems. For small scale fractures, we consider the impact of fracture sealing on the percolation state of orthogonal fracture networks. We emphasize the importance of non-critically stressed and partially sealed fractures, which are usually neglected because usually they are nonconductive. However, with significant stress perturbations, those noncritically stressed and partially sealed fractures can also contribute to the production by enlarging the stimulated reservoir volume.
77

Fully Convolutional Networks (FCNs) for Medical Image Segmentation

Zhewei, Wang January 2020 (has links)
No description available.
78

Strawberry Detection Under Various Harvestation Stages

Fitter, Yavisht 01 March 2019 (has links) (PDF)
This paper analyzes three techniques attempting to detect strawberries at various stages in its growth cycle. Histogram of Oriented Gradients (HOG), Local Binary Patterns (LBP) and Convolutional Neural Networks (CNN) were implemented on a limited custom-built dataset. The methodologies were compared in terms of accuracy and computational efficiency. Computational efficiency is defined in terms of image resolution as testing on a smaller dimensional image is much quicker than larger dimensions. The CNN based implementation obtained the best results with an 88% accuracy at the highest level of efficiency as well (600x800). LBP generated moderate results with a 74% detection accuracy at an inefficient rate (5000x4000). Finally, HOG’s results were inconclusive as it performed poorly early on, generating too many misclassifications.
79

Deformable 3D Brain MRI Registration with Deep Learning / Deformerbar 3D MRI-registrering med djupinlärning

Joos, Louis January 2019 (has links)
Traditional deformable registration methods have achieved impressive performances but are computationally time-consuming since they have to optimize an objective function for each new pair of images. Very recently some learning-based approaches have been proposed to enable fast registration by learning to estimate the spatial transformation parameters directly from the input images. Here we present a method for 3D fast pairwise registration of brain MR images. We model the deformation function with B-splines and learn the optimal control points using a U-Net like CNN architecture. An inverse-consistency loss has been used to enforce diffeomorphicity of the deformation. The proposed algorithm does not require supervised information such as segmented labels but some can be used to help the registration process. We also implemented several strategies to account for the multi-resolution nature of the problem. The method has been evaluated on MICCAI 2012 brain MRI datasets, and evaluated on both similarity and invertibility of the computed transformation.
80

SQUEEZE AND EXCITE RESIDUAL CAPSULE NETWORK FOR EMBEDDED EDGE DEVICES

Sami Naqvi (13154274) 08 September 2022 (has links)
<p>During recent years, the field of computer vision has evolved rapidly. Convolutional Neural Networks (CNNs) have become the chosen default for implementing computer vision tasks. The popularity is based on how the CNNs have successfully performed the wellknown</p> <p>computer vision tasks such as image annotation, instance segmentation, and others with promising outcomes. However, CNNs have their caveats and need further research to turn them into reliable machine learning algorithms. The disadvantages of CNNs become more evident as the approach to breaking down an input image becomes apparent. Convolutional neural networks group blobs of pixels to identify objects in a given image. Such a</p> <p>technique makes CNNs incapable of breaking down the input images into sub-parts, which could distinguish the orientation and transformation of objects and their parts. The functions in a CNN are competent at learning only the shift-invariant features of the object in an image. The discussed limitations provides researchers and developers a purpose for further enhancing an effective algorithm for computer vision.</p> <p>The opportunity to improve is explored by several distinct approaches, each tackling a unique set of issues in the convolutional neural network’s architecture. The Capsule Network (CapsNet) which brings an innovative approach to resolve issues pertaining to affine transformations</p> <p>by sharing transformation matrices between the different levels of capsules. While, the Residual Network (ResNet) introduced skip connections which allows deeper networks</p> <p>to be more powerful and solves vanishing gradient problem.</p> <p>The motivation of these fusion of these advantageous ideas of CapsNet and ResNet with Squeeze and Excite (SE) Block from Squeeze and Excite Network, this research work presents SE-Residual Capsule Network (SE-RCN), an efficient neural network model. The proposed model, replaces the traditional convolutional layer of CapsNet with skip connections and SE Block to lower the complexity of the CapsNet. The performance of the model is demonstrated on the well known datasets like MNIST and CIFAR-10 and a substantial reduction in the number of training parameters is observed in comparison to similar neural networks. The proposed SE-RCN produces 6.37 Million parameters with an accuracy of 99.71% on the MNIST dataset and on CIFAR-10 dataset it produces 10.55 Million parameters with 83.86% accuracy.</p>

Page generated in 0.0179 seconds